Academic Editors: Djamel Djenouri

and Domenico Rosaci

Received: 5 December 2024

Revised: 25 January 2025

Accepted: 6 February 2025

Published: 10 February 2025

Citation: Dai, F.; Hossain, M.A.;

Wang, Y. State of the Art in Parallel

and Distributed Systems: Emerging

Trends and Challenges. Electronics

2025, 14, 677. https://doi.org/

10.3390/electronics14040677

Copyright: © 2025 by the authors.

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license

(https://creativecommons.org/

licenses/by/4.0/).

Review

State of the Art in Parallel and Distributed Systems: Emerging
Trends and Challenges
Fei Dai 1,* , Md Akbar Hossain 1 and Yi Wang 2

1 School of Computing, Eastern Institute of Technology, Napier 4104, New Zealand; ahossain@eit.ac.nz
2 School of Mathematical and Computational Sciences, Massey University,

Palmerston North 4410, New Zealand; y.wang24@massey.ac.nz
* Correspondence: tdai@eit.ac.nz

Abstract: Driven by rapid advancements in interconnection, packaging, integration, and
computing technologies, parallel and distributed systems have significantly evolved in
recent years. These systems have become essential for addressing modern computational
demands, offering enhanced processing power, scalability, and resource efficiency. This
paper provides a comprehensive overview of parallel and distributed systems, exploring
their interrelationships, their key distinctions, and the emerging trends shaping their
evolution. We analyse four parallel computing paradigms—heterogeneous computing,
quantum computing, neuromorphic computing, and optical computing—and examine
emerging distributed systems such as blockchain, serverless computing, and cloud-native
architectures. The associated challenges are highlighted, and potential future directions are
outlined. This work serves as a valuable resource for researchers and practitioners aiming
to stay informed about trends in parallel and distributed computing while understanding
the challenges and future developments in the field.

Keywords: parallel computing; distributed systems; emerging trends; system challenges;
future directions

1. Introduction
In the continually advancing field of computing, parallel and distributed systems have

emerged as indispensable tools for addressing the escalating demands for computational
power, scalability, and efficient resource utilisation. For instance, the rapid growth of
artificial intelligence (AI) workloads has driven the need for computing systems capable
of processing datasets exceeding petabyte scales, such as those required for training large
language models like GPT-4, which involves hundreds of billions of parameters [1]. With
advancements in interconnection networks, packaging technologies, system integration,
and computational architectures, these systems have demonstrated remarkable improve-
ments in performance, enabling the management of increasingly large-scale and complex
workloads [2]. By facilitating the concurrent execution of tasks across multiple proces-
sors and nodes, parallel and distributed systems underpin modern solutions to critical
computational challenges, including big data analytics, AI, real-time simulations, and
cloud-based services.

The significance of parallel and distributed systems extends beyond their computa-
tional capabilities, as they play a pivotal role in driving innovation across various industries.
For example, in high-performance computing (HPC), these systems enable climate mod-
elling [3] and molecular dynamics simulations [4], while distributed architectures power
applications like global-scale content delivery networks [5] and decentralised finance [6]. A

Electronics 2025, 14, 677 https://doi.org/10.3390/electronics14040677

https://doi.org/10.3390/electronics14040677
https://doi.org/10.3390/electronics14040677
https://creativecommons.org/licenses/by/4.0/
https://creativecommons.org/licenses/by/4.0/
https://www.mdpi.com/journal/electronics
https://www.mdpi.com
https://orcid.org/0000-0001-5887-3320
https://orcid.org/0000-0002-4210-7610
https://doi.org/10.3390/electronics14040677
https://www.mdpi.com/article/10.3390/electronics14040677?type=check_update&version=1


Electronics 2025, 14, 677 2 of 39

recent study indicates that distributed systems in finance have great potential to improve
processing speeds for decentralised applications [7]. However, these benefits come with
significant challenges, including scalability, security, interoperability, fault tolerance, legal
compliance, and the integration of diverse and heterogeneous resources [7]. Addressing
these challenges is essential for ensuring the sustained evolution and utility of parallel and
distributed systems.

Despite their critical importance, many existing reviews of parallel and distributed
systems either focus narrowly on specific aspects or lack comprehensive analyses of their
historical development, emerging trends, and future challenges. This paper aims to bridge
this gap by providing a holistic overview of these systems and exploring their evolution,
interrelationships, and distinctions. Furthermore, it examines key challenges associated
with parallel and distributed systems and proposes actionable future research directions to
guide the field’s continued advancement.

The organisation of this paper is illustrated in Figure 1, which provides a clear roadmap
of the topics discussed. Section 2 defines parallel and distributed systems, introduces key
categories, and explores their interrelationships and distinctions. Section 3 examines
emerging trends in parallel systems, focusing on heterogeneous computing, quantum
computing, neuromorphic computing, and optical computing. Section 4 explores emerging
trends in distributed systems, highlighting blockchain and distributed ledgers, serverless
computing, cloud-native architectures, and distributed AI and machine learning (ML)
systems. Section 5 discusses the primary challenges facing these systems, providing
specific metrics and real-world examples. Section 6 outlines actionable future research
directions to address these challenges. Finally, Section 7 concludes this paper.

Figure 1. Logical overview of this paper’s structure. This figure illustrates the organisation of sections,
their interdependencies, and the logical progression of topics in this review.

2. Overview of Parallel and Distributed Systems
This section defines parallel and distributed systems, introduces various categories and

common architectures, and explores their relationships and synergies. This foundational
understanding sets the stage for a deeper examination of their historical context, key
concepts, and terminologies.

2.1. Parallel Systems

Parallel systems are computational architectures designed to execute multiple tasks
simultaneously by dividing computations into smaller sub-tasks processed concurrently
across multiple processors or cores within a single machine or a closely connected clus-
ter [8]. Their primary objective is to reduce computation time and improve performance
efficiency, with applications in scientific simulations, image processing, and large-scale
data analysis [9]. Key features of parallel systems include concurrency, coordination among
processors, and efficient utilisation of shared resources.


Electronics 2025, 14, 677 3 of 39

Traditional parallel systems can be categorised into three main types: Shared Memory
Systems, where multiple processors share a common memory space, allowing for direct
communication through shared variables—examples include multi-core processors and
symmetric multiprocessors (SMPs); Distributed Memory Systems, in which each processor has
its own private memory and communicates with others by passing messages—examples
include cluster computing and massively parallel processing (MPP) systems; and Hybrid
Systems, which combine shared and distributed memory approaches, often seen in modern
supercomputers and HPC clusters to leverage the advantages of both architectures. Com-
mon architectures include Central Processing Units (CPUs), found in everyday devices like
laptops and smartphones, enabling parallel task execution to improve performance and
efficiency; General-Purpose Graphics Processing Units (GPGPUs), used in gaming, video ren-
dering, and AI applications to perform massive parallel computations; Application-Specific
Integrated Circuits (ASICs), custom-designed hardware optimised for specific applications
such as cryptocurrency mining and specialised AI algorithms, providing high performance
and energy efficiency; and Field-Programmable Gate Arrays (FPGAs), which are reconfig-
urable silicon devices that can be electrically programmed to implement various digital
circuits or systems [10], commonly used in scientific research, aerospace, and defence.

The origins of parallel computing can be traced back to the late 1950s with the advent
of vector processors and early supercomputers like the IBM Stretch [11] and the CDC
6600 [12]. Significant advancements occurred in the 1980s with the introduction of MPP sys-
tems [13], including the Connection Machine [14] and the Cray series [15]. These systems
utilised thousands of processors to perform simultaneous computations, paving the way
for modern parallel architectures. In the 1990s and 2000s, the development of multi-core
processors [16] and GPGPUs [17] revolutionised parallel computing by making it more
accessible and efficient. The rise of ML, big data, and deep learning advancements led to a
surge in demand for high-performance parallel processing hardware. However, traditional
parallel hardware began to show limitations in providing the necessary processing capacity
for AI training. Challenges such as insufficient interconnection bandwidth between cores
and processors and the “memory wall” problem—where memory bandwidth cannot keep
up with processing speed—became critical bottlenecks. To address these challenges, scien-
tists and engineers have been developing innovative parallel computing systems tailored
for AI and other demanding applications. Recent innovations, including heterogeneous
computing, quantum computing, neuromorphic systems, and optical computing, aim to
address these limitations, as discussed in Section 3.

2.2. Distributed Systems

Distributed systems are computational architectures where multiple autonomous
computing nodes, often geographically separated, collaborate to achieve a common objec-
tive [18]. These nodes communicate and coordinate their actions by passing messages over
a network [19]. Distributed systems emphasise fault tolerance, scalability, and resource
sharing, making them essential for various applications, including cloud computing, dis-
tributed databases, and blockchain networks. Key features of distributed systems include
the ability to handle node failures gracefully, scale out by adding more nodes, and efficiently
manage distributed resources.

Distributed systems can be categorised into several types: Client–Server Systems, where
clients request services and resources from centralised servers—examples include web
applications and enterprise software; Peer-to-Peer (P2P) Systems, in which nodes act as
both clients and servers, sharing resources directly without centralised control—examples
include file-sharing networks and blockchain platforms; Cloud Computing Systems, which
provide scalable and flexible resources over the Internet—examples include Amazon Web


Electronics 2025, 14, 677 4 of 39

Services (AWS) and Google Cloud Platform (GCP); and Edge Computing Systems, which pro-
cess data near the source of generation to reduce latency and bandwidth usage—examples
include Internet of Things (IoT) devices and real-time analytics systems. Common architec-
tures in distributed systems include the Client–Server Model, used in web services where
web browsers (clients) communicate with web servers to fetch and display content; Cloud
Infrastructure, utilised for on-demand resource provisioning, hosting applications, and
data storage, as seen in platforms like AWS and GCP; and IoT Networks, which connect
various smart devices, enabling them to communicate and perform tasks collaboratively in
real-time.

The concept of distributed systems emerged in the 1970s with the development of
ARPANET, the precursor to the modern Internet [20]. Early distributed systems focused on
resource sharing and remote access to computational power. The 1980s and 1990s witnessed
the growth of distributed databases [21] and the Client–Server Model [22], which became
fundamental in enterprise computing. The 2000s marked the rise of cloud computing and
big data, epitomising the distributed system paradigm by providing scalable, on-demand
computing resources over the Internet [23]. Technologies like Hadoop and MapReduce [24]
enhanced the capability to process large datasets in a distributed manner. More recently,
edge computing [25] and the IoT [26] have extended the reach of distributed systems to the
periphery of networks, enabling real-time processing and decision-making at the edge. The
development of digital cryptocurrencies and advancements in AI have further propelled
the growth of distributed systems. In this paper, we focus on emerging trends such as
blockchain and distributed ledgers, serverless computing, cloud-native architectures, and
distributed AI and ML systems, which will be explored in Section 4.

2.3. Relationship and Synergy Between Parallel and Distributed Systems

Parallel and distributed systems are integral to modern computing, each contributing
to efficiently executing large-scale and complex tasks. While they serve distinct purposes,
their relationship is characterised by complementary roles and overlapping functionalities.
Parallel systems are designed to maximise computational speed within a single machine or
tightly coupled cluster [27]. By dividing a large task into smaller sub-tasks and processing
them simultaneously across multiple processors, parallel systems achieve significant reduc-
tions in computation time. This makes them ideal for HPC applications like AI training
and real-time data processing. Distributed systems, on the other hand, are engineered to
leverage multiple autonomous nodes that collaborate over a network to achieve a common
goal. This architecture prioritises scalability, fault tolerance, and resource sharing, making
distributed systems suitable for applications that require robust, scalable, and reliable
infrastructure, such as cloud computing and distributed databases.

In some scenarios, parallel and distributed systems can overlap, creating hybrid
systems that combine the strengths of both architectures. For instance, a distributed system
might employ parallel processing within individual nodes to further enhance performance.
Conversely, a parallel system might distribute tasks across closely connected clusters,
incorporating distributed computing elements. Both parallel and distributed systems aim
to improve computational efficiency and handle large-scale problems, but they do so with
different focuses and methods. The primary distinction between parallel and distributed
systems lies in their architecture and operational focus:

• Architecture: Parallel systems use multiple processors or cores within a single machine
or a closely connected cluster to perform concurrent computations [8]. Distributed
systems, on the other hand, involve multiple independent machines that communicate
over a network [19].


Electronics 2025, 14, 677 5 of 39

• Coordination and communication: In parallel systems, communication between pro-
cessors is typically fast and direct due to their close proximity. Distributed systems
require communication over potentially large distances, often leading to higher latency
and the need for sophisticated communication protocols.

• Scalability and fault tolerance: Distributed systems are designed to scale out by adding
more nodes and are built with fault tolerance in mind [28], allowing them to continue
functioning even if some nodes fail. Parallel systems focus on scaling up by adding
more processors to a single machine [29], with fault tolerance often a secondary
consideration.

• Resource sharing: Distributed systems emphasise resource sharing and collaboration
among independent nodes, each potentially equipped with its own local resources,
such as distributed memory. Parallel systems concentrate resources within a single
system, focusing on components like cache systems to enhance computational power.

Understanding the relationship and differences between parallel and distributed
systems is crucial for engineers, researchers, and students as they explore the diverse
applications and challenges within these fields. Both systems play vital roles in advancing
computational capabilities and addressing the demands of modern technology.

3. Emerging Trends in Parallel Systems
The development of parallel systems has primarily followed two main directions:

enhancing existing computing architectures and creating new parallel architectures to
adapt to new applications, such as ML. Industry leaders like Intel, AMD, and NVIDIA
exemplify this trend by producing new products based on advanced architectures annually,
targeting general tasks, servers, AI training, etc. The rapid development of deep learning
has spurred the proposal of many innovative architectures, such as near-memory comput-
ing architecture, heterogeneous computing architecture, quantum computing architecture,
neuromorphic computing architecture, and optical computing architecture, aimed at over-
coming the memory wall of traditional Von Neumann architecture [30]. In response to the
increasing volumes of processing data and advancements in AI, we explore the emerg-
ing trends in parallel systems across four key areas: heterogeneous computing, quantum
computing, neuromorphic computing, and optical computing.

3.1. Heterogeneous Computing

Heterogeneous computing integrates different types of processors and specialised
computing units to work together, leveraging their unique strengths to enhance overall
system performance and efficiency. As new architectures are proposed and technological
advancements continue, heterogeneous computing continues to evolve. To explore the
emerging trend of heterogeneous computing within parallel systems, we first examine
the evolution of computing and then focus on advanced ultra-heterogeneous computing
(UHC). Specifically, we discuss the software and hardware architectures that support UHC
and provide an outlook on its future developments.

Figure 2 outlines the evolution of computing, beginning with single-engine serial
processing followed by homogeneous computing then heterogeneous computing, and
culminating in UHC. The evolution of heterogeneous computing can be described in four
stages. In the first stage, a single processor handles all computational tasks sequentially,
limiting performance to the capabilities of a single processing unit. As the demand for
higher performance grew, this led to the second stage, which marked the introduction of
homogeneous parallel processing. Here, multiple cores of the same type, such as multi-
core CPUs or ASICs, work together to perform tasks in parallel. This approach improves
performance by distributing workloads across several identical processors. However, the


Electronics 2025, 14, 677 6 of 39

need to optimise diverse tasks pushed the transition to the third stage: heterogeneous
computing. In this stage, two types of processors, such as CPUs and GPUs, are combined
to handle various computational tasks more effectively, with each processor type optimised
for specific operations, thereby enhancing overall efficiency. Finally, as applications became
more complex and diverse, the necessity to maximise computational efficiency and per-
formance led to the final stage: UHC. This stage integrates multiple types of processors,
such as CPUs, GPUs, neural processing units (NPUs), and data processing units (DPUs),
combining their specialised strengths to address complex computational needs.

Performance
(FLOPS)

Stage 1:
Single-Engine

Serial

Stage 2:
Homogeneous

Parallel

Stage 3:
Heterogeneous

Parallel

Stage 4:
Ultra-Heterogeneous

Parallel

Single-core processor

Multi-core/Many-core processor
(CPU, GPU, ASIC, etc.)

CPU+XPU

CPU+GPU+
NPU+XPU

Figure 2. Evolution of various computing eras. This figure outlines the evolution of computing, from
single-engine serial processing to ultra-heterogeneous parallel processing, highlighting key stages in
this transformation. The different colours in the squares represent various processor types utilized in
each stage.

With the development of technology, we are entering the early stages of UHC, which
promises higher performance than in previous eras. For instance, systems integrating CPUs,
GPUs, and DPUs have already demonstrated significant improvements in handling various
AI tasks [31]. However, such systems rely on the support of both software and hardware.
Figure 3 illustrates the software and hardware layers required for UHC systems. The
software layer is responsible for effectively managing and optimising diverse processing
units. Software frameworks support seamless communication and coordination between
different types of processors, allowing tasks to be dynamically assigned to the most suitable
processing unit. Advancements in frameworks like CUDA and OpenCL have significantly
enhanced interoperability and workload allocation across processors, enabling efficient
dynamic task management [32]. This involves developing sophisticated schedulers, re-
source managers, and communication protocols that can handle the complexities of UHC
environments. Additionally, programming models and languages (e.g., CUDA, OpenCL,
OpenMP, MPI, etc.) must evolve to provide abstractions that simplify the development of
applications for UHC systems, enabling developers to leverage the full potential of diverse
computing resources without needing to manage low-level hardware details [33].


Electronics 2025, 14, 677 7 of 39

Application Software 

OS / AI Framwork / Data Base / etc. 

Tool Chain / Driver / Software Abstraction / etc.

Software 

Hardware 

CPU GPU

NPU XPU

Figure 3. Hardware and software layers of UHC. This figure depicts the essential software and
hardware components required for UHC systems, emphasising interoperability and workload
distribution.

The hardware architectures for UHC integrate multiple processing units into a cohesive
system. This involves designing interconnects that provide high-bandwidth, low-latency
communication between CPUs, GPUs, NPUs, DPUs, and other specialised processors.
Memory architectures will also evolve to support efficient data sharing and movement
between different processing units, minimising bottlenecks and maximising throughput.
Innovations like 3D stacking and advanced co-packaging technologies play a pivotal
role in enabling UHC systems by reducing communication delays and improving system
performance [34].

The future of UHC is promising, with potential applications spanning various fields,
including AI, scientific computing, and real-time data processing. As demand grows for
more powerful and efficient systems, UHC architectures are poised to become increasingly
prevalent. Advances in both technological infrastructure and development frameworks
will be instrumental in driving this evolution, facilitating systems that seamlessly integrate
diverse processing units to deliver unparalleled performance and efficiency.

3.2. Quantum Computing

Quantum computing represents a significant departure from classical computing
paradigms, utilising the principles of quantum mechanics to perform computations. Unlike
classical computers that process information as binary bits (0’s and 1’s), quantum computers
leverage quantum bits (qubits), which can exist in multiple states simultaneously due to
the phenomenon of superposition. This enables quantum computers to process vast


Electronics 2025, 14, 677 8 of 39

amounts of information in parallel, making them particularly powerful for certain types of
computations. Quantum computing research began in the 1980s [35]. Although its initial
development was slow due to technological barriers, it has accelerated rapidly in recent
decades with the scaling up of qubit numbers in superconducting systems [36]. To explore
the emerging trends in quantum computing, we start by discussing quantum computers
and their applications, followed by an explanation of the different types of qubits and their
development trends. Finally, we conclude with an overview of the current state and future
prospects of quantum computing.

Quantum computers leverage qubits, which can exist in multiple states simultaneously
(superposition) and be entangled with one another, enabling exponential increases in com-
putational power for certain types of problems [37]. To illustrate superposition, consider a
coin spinning in the air: unlike a classical bit that is either heads or tails, a qubit remains
in a combination of both states until measured. Similarly, entanglement can be visualised
as a pair of dice that always show the same number, regardless of their distance from
each other. Despite these advantages, qubits are highly sensitive to environmental noise
and interactions, leading to stability issues and significant error rates. These limitations
present a major challenge to the development of practical quantum systems, as maintaining
coherence and minimising errors often require complex error correction protocols and
cryogenic environments.

Quantum gates are designed to manipulate the coefficients of basis states, perform-
ing general functions akin to logic gates in traditional computing systems [38]. Another
essential concept, quantum interference, allows quantum algorithms to amplify correct solu-
tions while cancelling out incorrect ones, significantly improving computational efficiency.
Quantum algorithms specifically exploit the principles of superposition, entanglement, and
quantum interference to execute computations more efficiently than classical computers [39].
Building on these unique properties, quantum computing holds promise for solving com-
plex problems currently intractable for classical computers, such as large-scale optimisation,
cryptography, and quantum physical system simulation [40]. Major technology companies
and research institutions are heavily investing in quantum computing research, driving
rapid advancements in practical quantum computers and efficient quantum algorithms.

There are various physical systems to realise qubits, each offering distinct advan-
tages and contributing to the overall progress in quantum computing. Superconducting
qubits utilise superconducting circuits and are among the most mature technologies in
this domain [36]. However, they require extremely low temperatures, increasing opera-
tional complexity and cost. Silicon qubits, based on semiconductor technology similar
to classical computer chips [41], offer compatibility with existing fabrication techniques
but face scalability challenges, as quantum coherence deteriorates with size. Trapped-ion
qubits use ions trapped in electromagnetic fields and manipulated with lasers [42], known
for their high fidelity, but their operations are inherently slower, posing limitations for
large-scale computations. Neutral atom qubits employ neutral atoms trapped in optical
lattices [43], facilitating scalable quantum computing, yet achieving consistent trapping
and manipulation across large arrays remains challenging. Diamond-based qubits utilise
nitrogen-vacancy centres in diamonds [44], which can be manipulated at room temperature
but often suffer from low qubit density and complex fabrication. Photonic qubits use
photons to encode quantum information [45], providing advantages in communication
due to their speed and low loss, but their integration into computational frameworks and
achieving scalable photonic processors remain significant hurdles.

The current state of quantum computing demonstrates a promising trajectory, with
continuous advancements in qubit technology and quantum algorithms. Despite earlier
bottlenecks in qubit stability, fidelity, and scalability, ongoing research has successfully


Electronics 2025, 14, 677 9 of 39

addressed many of these issues, enabling steady progress in increasing qubit numbers.
As depicted in Figure 4, the number of qubits in quantum processors has been steadily
increasing across different technologies. IBM’s roadmap outlines plans to scale its Flamingo
systems to 1000 qubits by 2027 and deliver quantum-centric supercomputers with thou-
sands of logical qubits by 2030 and beyond [46]. This trend highlights quantum computing’s
potential to revolutionise fields requiring immense computational power, such as materials
science, AI, and high-energy physics.

1998 2000 2002 2004 2006 2008 2010 2012 2014 2016 2018 2020 2022 2024
Year

0

100

200

300

400

Nu
m

be
r o

f Q
ub

its

Superconducting
Silicon
Trapped-Ion
Neutral atom
Diamond-based
Photonic

Figure 4. Qubit growth in quantum computers over recent years. This figure presents the increasing
number of qubits in quantum processors, reflecting advancements in quantum computing technology.

Current applications of quantum computers span a wide range of domains, demon-
strating tangible benefits in solving complex problems that challenge classical systems. In
cryptography, quantum computers are revolutionising secure communication by exploiting
superposition and entanglement to enhance encryption and decryption processes [47]. Sim-
ilarly, in molecular simulation, quantum algorithms enable precise modelling of molecular
structures and interactions, crucial for drug discovery [48], materials science [49], and
other chemistry-related fields [50]. These advancements hold the potential to accelerate
breakthroughs in healthcare, energy, and environmental sustainability. Moreover, financial
modelling is another promising domain, where quantum computers optimise portfolios,
predict market trends, and manage risk with unprecedented speed and accuracy [51].

The rise of quantum machine learning (QML) adds a new dimension to the application
of quantum computers. QML leverages quantum algorithms to enhance ML tasks such as
classification, pattern recognition, and autonomous decision-making [52]. By leveraging
quantum speed-ups, QML can process complex datasets more efficiently than classical
methods, offering advantages in fields such as finance, healthcare, and AI. Figure 5 illus-
trates the workflow of QML, highlighting the interaction between quantum data, quantum
gates, and ML models in tasks such as image classification and dynamic decision-making
in autonomous systems.

In conclusion, quantum computing represents one of the most transformative trends
in the evolving landscape of parallel systems. By harnessing the fundamental prin-
ciples of quantum mechanics, quantum computing is poised to complement classi-
cal HPC, unlocking unprecedented computational power for scientific discovery and
industrial applications.


Electronics 2025, 14, 677 10 of 39

Quantum Data 
(QD)

0

1

Qubit

Superposition of states

Quantum Algorithms

QD
Quantum processing

QD

Time

Pattern recogition,
classification, etc.

Quantum
Channel

Quantum
Channel

Figure 5. Overview of QML. This figure illustrates the integration of quantum
computing principles in ML, showing how quantum algorithms leverage qubit-
based computation. The green arrows indicate the data flow of quantum information between
processing units.

3.3. Neuromorphic Computing

Neuromorphic computing is a class of brain-inspired computing architectures which,
at a certain level of abstraction, simulate the biological computations of the brain. This
approach enhances the efficiency of compatible computational tasks and achieves com-
putational delays and energy consumption with biological computation. The term “neu-
romorphic” was introduced by Carver Mead in the late 1980s [53,54], referring to mixed
analogue–digital implementations of brain-inspired computing. Over time, as technology
evolved, it came to encompass a wider range of brain-inspired hardware implementa-
tions. Specifically, unlike the von Neumann architecture’s CPU–memory separation and
synchronous clocking, neuromorphic computing utilises neurons and synapses, the funda-
mental components, to integrate computation and memory. It employs an event-driven
approach based on asynchronous event-based spikes, which is more efficient for the brain-
like sparse and massively parallel computing, significantly reducing energy consumption.
At the algorithmic level, the brain-inspired Spike Neural Network (SNN) serves as an essen-
tial algorithm deployed on neuromorphic hardware, efficiently completing ML tasks [55,56]
and other operations [57,58]. Recent advancements in VLSI technology and AI have pro-
pelled neuromorphic computing towards large-scale development [59]. This introduces
developments in neuromorphic computing from both hardware and algorithmic perspec-
tives and discusses future trends.

IBM TrueNorth is based on distributed digital neural models designed to address
cognitive tasks in real time [60]. Its chip contains 4096 neurosynaptic cores, each core
featuring 256 neurons, with each neuron having 256 synaptic connections. On the one
hand, the intra-chip network integrates 1 million programmable neurons and 256 million
trainable synapses; on the other hand, the inter-chip interface supports seamless multi-
chip communication of arbitrary size, facilitating parallel computation. By using offline
learning, various common algorithms such as convolutional networks, restricted Boltzmann
machines, hidden Markov models, and multi-modal classification have been mapped to
TrueNorth, achieving good results in real-time multi-object detection and classification
tasks with milliwatt-level energy consumption.

Neurogrid, a tree-structured neuromorphic computing architecture, fully considers
neural features such as the axonal arbor, synapse, dendritic tree, and ion channels to
maximise synaptic connections [61]. Neurogrid uses analogue signals to save energy and


Electronics 2025, 14, 677 11 of 39

a tree structure to maximise throughput, allowing it to simulate 1 million neurons and
billions of synaptic connections with only 16 neurocores and a power consumption of only
3 watts. Neurogrid’s hardware is suitable for real-time simulation, while its software can
be used for interactive visualisation.

As one of the neuromorphic computing platforms contributing to the European Union
Flagship Human Brain Project (HBP), SpiNNaker is a parallel computation architecture
with a million cores [62]. Each SpiNNaker node has 18 cores, connected by a system
network-on-chip. Nodes select 1 neural core to act as the monitor processor, assigned an
operating system support role, while the other 16 cores support application roles, with
the 18th core reserved as a fault-tolerance spare. Nodes communicate through a router to
complete parallel data exchange. SpiNNaker can be used as an interface with AER sensors
and for integration with robotic platforms.

Intel’s Loihi is a neuromorphic research processor supporting multi-scale SNNs,
achieving performance comparable to mainstream computing architectures [63,64]. Loihi
features a maximum of 128,000 neurons per chip with 128 million synapses. Its unique
capabilities include a highly configurable synaptic memory with variable weight precision,
support for a wide range of plasticity rules, and graded reward spikes that facilitate learn-
ing. Loihi has been evaluated in various applications, such as adaptive robot arm control,
visual–tactile sensory perception, modelling diffusion processes for scientific computing
applications, and solving hard optimisation problems like railway scheduling. Loihi2 [65],
as a new generation of neuromorphic computing and an upgrade of Loihi, is equipped
with generalised event-based messaging, greater neuron model programmability, enhanced
learning capabilities, numerous capacity optimisations to improve resource density, and
faster circuit speeds. Importantly, besides the features from Loihi1, Loihi2 has shared
synapses for convolution, which is ideal for deep convolutional neural networks.

SNNs are an essential algorithmic component of neuromorphic computing. To ac-
complish a task, we should consider how to define a tailored SNN and deploy it on
hardware [54]. From a training perspective, algorithms can be categorised into online
learning and offline learning. The offline-learning approach first deploys the SNN on
neuromorphic hardware and then uses the plasticity features to approximate backpropaga-
tion. This is a real-time method for optimising hardware simulation of plasticity. Offline
learning involves training an Artificial Neural Network (ANN) on a CPU or GPU based on
specific tasks and data, then converting the ANN to an equivalent SNN and deploying it
on neuromorphic hardware. As a key to training algorithms, various studies have analysed
backpropagation.

An Energy-Efficient Backpropagation approach successfully implemented backpropa-
gation on TrueNorth hardware [56]. Importantly, this method treats spikes and discrete
synapses as continuous probabilities, allowing the trained network to map to neuromorphic
hardware through probability sampling. This training method achieved 99.42% accuracy on
the MNIST dataset with only 0.268 mJ per image. Furthermore, backpropagation through
time (BPTT) has been implemented on neuromorphic datasets, providing a training method
for recurrent structures on neuromorphic platforms [66]. Benefiting from these training
optimisations, SNNs in neuromorphic computing have been applied in various ML tasks
such as Simultaneous Velocity and Texture Classification [67], Real-time Facial Expression
Recognition [68], and EMG Gesture Classification [69]. Similarly, they have been used
in neuroscience research [70,71]. SNN-based neuromorphic computing is also utilised in
non-ML tasks. Benefiting from the neuromorphic vertex–edge structure, graph theory
problems can be mapped onto the hardware [58,72,73]. Additionally, it has been applied to
solving NP-complete problems [74].


Electronics 2025, 14, 677 12 of 39

Neuromorphic computing often aims to replicate aspects of biological neural pro-
cessing in hardware, but there is an ongoing debate over how strictly such systems must
adhere to biophysical plausibility versus employing more abstract ML methods. On the
one hand, SNN models, such as the Izhikevich formulation [75], focus on capturing the
temporal dynamics of real neurons, which can yield insights into how biological brains
encode and process information. Research has shown that such models can replicate
a variety of neuronal firing patterns with computational efficiency, providing a bridge
between computational neuroscience and neuromorphic engineering [76]. On the other
hand, more traditional ML algorithms, such as Bayesian inference [77], support vector
machines [78], or the large language models [79] dominating modern AI, tend to trade
some fidelity to biological detail for mathematical tractability, scalability, and often better
empirical performance on a range of industrial tasks.

Despite the proven feasibility of neuromorphic computing in many tasks, it remains
largely experimental. In today’s landscape of energy-consuming AI driven by GPU clusters,
bringing neuromorphic computing out of the lab and achieving performance equal to or
better than GPU-based AI with low energy consumption is a significant trend [80–82]. Stan-
dardised hardware protocols and community-maintained software will be crucial. From a
neuroscience research perspective, neuromorphic computing simulates brain structures
to varying degrees. Leveraging these simulations could provide new insights into neural
mechanisms and brain function. Neuromorphic computing has a close-loop relationship
with both AI and neuroscience, drawing inspiration from and serving both fields, tightly
linking their development and advancing our understanding of intelligence.

3.4. Optical Computing

Optical computing utilises the properties of light to perform parallel computations,
providing the potential to significantly exceed the speed and efficiency of electronic comput-
ing [83]. Unlike electronic computing, which relies on the movement of electrical charges,
optical computing uses photons to carry and process information. Because light travels
faster and experiences minimal resistance, optical computing has the potential to signifi-
cantly improve processing speeds and energy efficiency. Research in optical computing
can be traced back to the early 1960s [84]. Over the years, the primary focus of optical com-
puting has been on integrating optical components for communication within computer
systems or incorporating optical functions due to advances in electronic technology [84].
Although these elements remain under development and have yet to mature, the adaptation
and exploration of optical computing, especially in AI, have grown rapidly in recent years
due to the boom in AI and the limitations of traditional electrical architectures. To explore
the emerging trends in optical computing, we first examine the different categories of
optical computing systems and then discuss the potential and outlook of optical computing
in AI.

Optical computing systems can be categorised into analogue, digital, and hybrid
optical computing systems (OCS). Each category differs in how it processes information,
balancing speed, precision, and scalability. Analogue optical computing systems (AOCS)
utilise the continuous nature of light to perform computations, leveraging properties such
as intensity, phase, and wavelength to represent and process data. This enables high
precision and real-time processing capabilities, making AOCS suitable for signal processing
and image recognition applications. On the other hand, digital optical computing systems
(DOCS) operate on binary principles similar to traditional electronic computers, where
light is used to represent binary data (0’s and 1’s) and perform logical operations through
optical gates. DOCS can achieve exceptionally high-speed processing and parallelism, ideal
for tasks requiring rapid data computation. However, scalability and integration difficulties


Electronics 2025, 14, 677 13 of 39

pose significant challenges for DOCS, particularly in large-scale systems. Hybrid optical
computing systems (HOCS) combine the strengths of both analogue and digital approaches,
integrating continuous and discrete data representations to optimise performance across a
broader range of applications. By leveraging the unique advantages of light, such as its
speed and bandwidth, these hybrid systems can enhance computational efficiency and
open new frontiers in fields such as telecommunications, AI, and scientific simulations.
Table 1 summarises the features of these three systems.

Table 1. Comparison of optical computing systems. This table compares analogue optical computing
systems (AOCS), digital optical computing systems (DOCS), and hybrid optical computing systems
(HOCS) based on key characteristics such as data type, speed, error susceptibility, complexity,
integration challenges, and applications.

Feature AOCS DOCS HOCS

Data type Continuous Discrete (binary) Both continuous
and discrete

Speed Very high High High
Error susceptibility Higher Lower Balanced
Complexity Lower Higher Medium
Integration Challenging Easier Moderate

Applications
Real-time
processing,
imaging

Logic operations,
data storage

Neural networks,
adaptive optics

Optical computing also leverages optical components such as microring resonators
(MRRs) and Mach–Zehnder interferometers (MZIs) to design essential elements such as
logic gates, switches, storage devices, routers, and photonic integrated circuits. Micror-
ing resonators act as miniature loops that guide and filter light, while Mach–Zehnder
interferometers function as optical switches, enabling precise control over light-based
computations. The development of these components has been pivotal in advancing the
field, leading to the creation of more compact and powerful photonic circuits. Initially,
research focused on the fundamental properties of light and how it could be manipulated
for computation. Over time, advancements in materials science and nanofabrication have
enabled greater miniaturisation and improved integration of optical components.

Optical computing is making significant strides across multiple domains, particu-
larly in telecommunications, AI, and HPC. Key areas of impact include the following:
1. Telecommunications: Optical components enhance data transmission speeds and net-
work capacity. Photonic technologies in fibre-optic networks reduce latency and increase
bandwidth, making them integral to modern communication infrastructures [85]. 2. AI:
Optical neural networks, particularly those utilising MZIs, enable AI computations at
speeds beyond conventional electronic processors. One notable example is the use of
optical matrix multiplication for accelerating deep learning models, significantly reducing
energy consumption in AI training [86]. 3. HPC: The integration of photonic integrated
circuits (PICs) and photonic–electronic co-design is advancing HPC infrastructures [87].
The adoption of optical interconnects in HPC provides high bandwidth and lower energy
consumption, significantly improving data transfer efficiency for large-scale simulations
and AI training [88]. Despite these advancements, optical computing still faces challenges,
including manufacturing complexities, optical loss, and crosstalk, which hinder large-scale
adoption [89]. However, ongoing research in photonic materials and integrated circuit
design continues to address these limitations, paving the way for more scalable optical
computing solutions.


Electronics 2025, 14, 677 14 of 39

Optical computing has evolved significantly with advances in component technology,
growing applications, and increasing research interest. While it is not yet poised to replace
electronic computing entirely, it is expected to play a complementary role, particularly
in areas demanding ultra-fast, energy-efficient computations. As research progresses,
breakthroughs in nanophotonics, integrated optical chips, and AI-driven photonic com-
puting will likely drive optical computing toward mainstream adoption. With further
improvements in scalability and integration, optical computing may soon redefine HPC,
revolutionising fields such as AI, communications, and beyond. For readers interested in a
more in-depth exploration of optical computing, the review papers [84,86,90], as well as
the book [91], provide comprehensive insights into its fundamentals and applications.

4. Emerging Trends in Distributed Systems
In the rapidly evolving landscape of computing, distributed systems have become

integral to handling the scale, complexity, and diversity of modern applications. By lever-
aging multiple interconnected computing resources, distributed systems provide scalable,
resilient, and efficient solutions that traditional centralised systems cannot offer. As data
volumes grow exponentially and applications demand real-time processing and decision-
making, innovative approaches in distributed computing are essential. This section explores
the emerging trends in distributed systems, focusing on four key areas: blockchain and
distributed ledgers, serverless computing, cloud-native architectures, and distributed AI
and ML systems. These advancements are redefining how data are managed, processed,
and secured across various industries, enabling new possibilities while addressing critical
scalability, efficiency, security, and privacy challenges.

4.1. Blockchain and Distributed Ledgers

The concept of the blockchain was first coined by Satoshi Nakamoto in 2008 after the
failure of the global financial system in 2007 [92]. Though he did not formally define the
blockchain, he demonstrated the blockchain concept for electronic cash (called Bitcoin)
transfers where no central authority is needed to prevent double-spending. The first
successful Bitcoin transaction took place in 2009 when Satoshi Nakamoto transferred 10
BTC (Bitcoin) to Hal Finney. Satoshi uses a peer-to-peer network to timestamp transactions
through a hash-based Proof-of-Work chain, which acts as an unchangeable record unless
the Proof of Work is redone. However, the concept of blockchain is fundamentally based
on three elements: (i) blind signature, a cryptographic concept proposed by David Chaum
in 1989 for automation of payment [93]; (ii) timestamped documents which secure digital
documents by stamping the documents with the date [94]; and (iii) Reusable Proof of
Work (RPoW), a mechanism for preventing double-spending and securing decentralized
networks, later extended into a reusable format by Hal Finney in 2004 [95]. Therefore,
researchers formally defined the blockchain as a meta-technology which combines several
computing techniques [96]. However, the most widely adopted definition of blockchain is
a distributed digital ledger technology with a ledger of transactions, or blocks, that form a
systematic, linear chain of all transactions ever made. Blockchain presents timestamped
and immutable blocks of highly encrypted and anonymised data not owned or mediated
by any specific person or groups [97,98]. A block in a blockchain is primarily identified
by its block header hash or block hash, a cryptographic hash made by hashing the block
header twice through the SHA256 algorithm. In addition, a block can also be identified by
the block height, which is its position in the blockchain or the number of blocks preceding
it in the blockchain. The Merkle tree offers a secure and efficient way to create a digital
fingerprint for the complete set of transactions. A blockchain structure is shown in Figure 6,
where blocks are connected through their respective hash code.


Electronics 2025, 14, 677 15 of 39

Figure 6. Basic structure of a blockchain block. This figure presents the fundamental components of
a blockchain block, explaining how distributed ledger technology ensures security and integrity in
decentralised networks.

Distributed ledger technology (DLT) is the underlying generalised concept that makes
the blockchain work in a distributed platform. The concept of DLT incorporates principles
from “The Byzantine General Problem”, described by Lamport et al. [99], which evaluates
the strategies for achieving consensus in distributed systems despite conflicting information
in an adversarial environment. Consensus protocols, like Proof of Stake, allow participants
to achieve a shared view of the ledger without intermediaries. Emerging mechanisms,
such as Proof of Space and Proof of Authority [100], have gained attention for their lower
energy consumption and faster transaction verification times compared to Proof of Work.
These mechanisms aim to address the inefficiencies and environmental impacts associated
with traditional methods, offering tailored solutions for specific use cases. Additionally,
cryptographic techniques, such as the Schnorr Signature Scheme and Merkle Tree, enhance
data integrity and trust within blockchain frameworks, reinforcing secure data verification
processes [101]. A distributed ledger is a digital record maintained across a network
of machines, known as nodes, with any updates being reflected simultaneously for all
participants and authenticated through cryptographic signatures [102].

Beyond cryptocurrencies, blockchain’s applications span a wide range of industries,
including eHealth [103,104], intellectual property [105,106], education, digital identity, fi-
nance [107–109], supply chain [110–112], IoT [113–115], etc. In supply chain management,
blockchain frameworks such as IBM Food Trust provide end-to-end traceability, ensuring
transparency and accountability. Case studies, such as Walmart’s use of blockchain to track
food provenance, have quantified significant reductions in tracing times, from days to
seconds, illustrating blockchain’s potential to streamline operations and mitigate fraud.
In healthcare, blockchain’s anonymity and immutability features make it unparalleled for
secure information sharing among different providers, forming the foundation of modern
healthcare, alternatively termed Healthcare 5.0. Numerous frameworks such as MeD-
Share [116], Medblock [117], HealthBlock [118], and BLOSOM [119] have been developed
to secure patient records. BCIF-EHR, an interoperable blockchain-based framework pro-
posed in [103], facilitates seamless sharing and integration of electronic health records
(EHRs) while preserving privacy and security. However, the framework requires a decen-
tralised authentication and access control mechanism to restrict access to authorised entities
only. Addressing this limitation, TrustHealth [104] integrates blockchain with a trusted
execution environment, designing a secure database that ensures the confidentiality and
integrity of EHRs. TrustHealth also incorporates a secure session key generation protocol,


Electronics 2025, 14, 677 16 of 39

enabling secure communication channels between healthcare providers and the trusted
execution environment. Such advancements exemplify blockchain’s ability to transform
healthcare by improving interoperability, security, and trust.

Despite its broad applicability, blockchain faces challenges such as latency and high
energy consumption, particularly in Proof-of-Work-based systems. These issues can hinder
real-time applications and raise concerns about environmental sustainability. Addition-
ally, blockchain’s reliance on distributed consensus mechanisms can lead to cold-start
issues in networks with low node participation, delaying transaction validation. Overall,
blockchain’s transformative potential lies in its ability to provide secure, transparent, and
decentralised solutions across diverse sectors, fundamentally changing how data integrity
and trust are managed.

4.2. Serverless Computing

The concept of serverless computing emerged in the mid-2000s with cloud services
like Amazon S3 and EC2, which simplified infrastructure management for developers [120].
However, a major breakthrough came in 2014 with the introduction of AWS Lambda [121],
which established the Function-as-a-Service (FaaS) model. This allowed developers to
execute code in response to events without managing servers, providing automatic scaling
and reducing operational overhead [122]. IBM OpenWhisk (2016) later expanded on this
concept by offering an open-source alternative that prioritised flexibility [123]. Further
advancements included Microsoft Azure Functions [124] and Google Cloud Run [125],
which integrated containerised workloads to extend serverless capabilities.

Serverless computing provides automatic scalability, eliminating the need for manual
resource management. A key advantage of this model is its “pay-as-you-go” pricing struc-
ture, where users pay only for the compute time they consume rather than pre-allocated
resources, significantly reducing costs for variable workloads [126]. These benefits make
serverless computing ideal for applications such as web services [127], IoT [128], and large-
scale data processing [129]. Industries including finance, healthcare, and e-commerce utilise
serverless computing to enable rapid scaling and resource efficiency. Major companies
like Netflix and Airbnb rely on serverless architectures to handle fluctuating traffic loads,
ensuring a smooth user experience during peak demand [130]. Studies indicate that server-
less platforms can handle up to 10,000 concurrent function executions while maintaining
response times below 500 ms, making them suitable for real-time applications [131].

Despite its advantages, serverless computing presents several challenges. One major
concern is cold-start latency, which occurs when an idle function is invoked and requires
initialisation. To mitigate this, techniques such as function pre-warming, optimising con-
tainer configurations, and adjusting function granularity have been developed, reducing
cold-start delays by up to 50% in production environments [132,133]. Another issue is
vendor lock-in, where applications become dependent on proprietary cloud provider im-
plementations. To overcome this, multi-cloud serverless frameworks like Knative and
OpenFaaS have emerged, allowing developers to deploy serverless workloads across mul-
tiple providers, increasing flexibility and reducing dependency risks [134]. Furthermore,
serverless architectures are not well suited for long-running processes, as they impose
execution time limits. Hybrid serverless–edge computing models are increasingly being
explored to process latency-sensitive workloads closer to the data source, particularly for
IoT applications [135].

Ongoing advancements aim to enhance serverless computing’s flexibility and per-
formance. AI-based function pre-warming, such as Alibaba Cloud’s Function Compute
prediction models, proactively warms up instances to reduce startup delays [136]. Fed-
erated serverless architectures provide cost-effectiveness and resource efficiency [137].


Electronics 2025, 14, 677 17 of 39

Additionally, confidential computing techniques like secure enclaves are being integrated
to enhance function-level security, mitigating multi-tenant isolation concerns [138]. For
high-frequency workloads, the unpredictable costs of serverless computing can some-
times make traditional cloud computing a more economical option. Research into more
transparent and cost-efficient serverless pricing structures is ongoing [139].

Overall, while serverless computing offers scalability, cost efficiency, and opera-
tional flexibility, its adoption requires addressing challenges related to latency, vendor
dependence, and security. Continued advancements in optimisation techniques, multi-
cloud interoperability, and pricing models will further enhance its impact on the future of
cloud computing.

4.3. Cloud-Native Architectures

Cloud-native architectures began with distributed systems research in the 1990s [140]
and the introduction of virtualisation by VMware in 1998 [141]. In 2006, AWS launched EC2
and S3, making on-demand cloud services widely available [120]. DevOps ideas took off
around 2009 [142], combining development and operations to speed up software delivery.
Docker emerged in 2013 as a platform for packaging applications into lightweight contain-
ers [143], followed by Google’s open-source release of Kubernetes in 2014 to orchestrate
and manage containerized workloads [144].

Docker emerged in 2013 for packaging applications into lightweight containers [143],
followed by Google’s open-source release of Kubernetes in 2014 to orchestrate and manage
containerised workloads [144]. The Cloud Native Computing Foundation (CNCF) formed
in 2015 and made Kubernetes its first project [144], while AWS Lambda (launched in 2014)
introduced serverless computing [121], and service meshes emerged to handle microservice
communication [128].

Today, cloud-native architecture optimises cloud application performance by integrat-
ing microservices, containerisation, and continuous integration/continuous delivery (as
shown in Figure 7) [145,146]. These techniques enable modularity, scalability, and reliability.
Microservices divide applications into independent, manageable services, containerisation
ensures consistent deployment, and CI/CD accelerates the development life cycle, creating
a robust framework for efficiently handling dynamic workloads. Tools like Docker and
Kubernetes simplify container orchestration, streamline scaling, and accelerate deployment
pipelines [147]. Unlike traditional monolithic structures, cloud-native applications are
modular, allowing components to be managed, scaled, and updated independently. This
makes cloud-native architectures highly effective in dynamic environments demanding
rapid iteration and resilient deployment [148–150].

Alongside these core components, advanced communication paradigms such as
Partitioned Global Address Space (PGAS) models and Remote Direct Memory Access
(RDMA) further enhance cloud-native platforms [151]. PGAS models provide a shared
memory abstraction across distributed systems, emphasising data locality and reducing
communication overhead, making them particularly suitable for high-performance ap-
plications in cloud environments. RDMA further enhances infrastructure efficiency by
enabling direct memory-to-memory transfers between nodes, bypassing CPU involvement
to minimise latency and maximise throughput. These technologies are critical for opti-
mising the performance of modern distributed systems and are increasingly adopted in
cloud-native platforms.

Cloud-native architectures also play a pivotal role in Industry 4.0, where real-time
data processing across IoT and edge devices matters most. These architectures realise
smooth integration in distributed systems, hence managing large-scale, latency-sensitive
data efficiently [152]. By incorporating PGAS and RDMA, these architectures can handle


Electronics 2025, 14, 677 18 of 39

complex data flows and resource-intensive tasks with greater efficiency, supporting the
scalability demands of Industry 4.0. One of the prominent design elements of cloud-
native designs is the multi-cloud and distributed cloud models that, for instance, enable
the deployment of applications across multiple cloud providers [153]. This increases
availability and avoids vendor lock-in, giving flexibility and resilience to enterprises by
utilising unique services across cloud platforms [154].

Figure 7. Key building blocks of a cloud-native architecture. This figure illustrates the four fun-
damental components of cloud-native systems: containers, microservices, DevOps, and CI/CD.
These elements enable scalability, automation, and continuous deployment in modern cloud
computing environments.

On top of this, cloud-native architectures leverage Platform-as-a-Service (PaaS) en-
vironments to simplify infrastructure management and scaling [155]. Cloud federation
strategies improve interoperability across providers, enabling seamless service migration
and management in heterogeneous systems [156]. Infrastructure as Code (IaC) automates
resource provisioning, ensuring efficient and secure application deployment [157]. By com-
bining these methods with advanced communication paradigms, cloud-native architectures
offer robust fault tolerance and high resource utilisation, supporting a range of workloads
from e-commerce to scientific computing.

Despite these benefits, cloud-native approaches come with their own set of challenges.
Integrating PGAS and RDMA can be complex, requiring specialised hardware and in-depth
expertise, which may raise costs and limit portability across diverse platforms. Deploying
microservices at scale also necessitates comprehensive observability solutions to handle
complex debugging and performance monitoring tasks. In multi-cloud scenarios, while
the flexibility is appealing, organisations may still encounter partial vendor lock-in due to
unique service integrations. Security remains a prominent concern, as misconfigurations in
container orchestration or vulnerabilities within microservices can open pathways for data
breaches. Additionally, the rapid pace of innovation in the cloud-native ecosystem demands
continual learning and adaptation, placing pressure on both developers and operators to
stay abreast of emerging tools and best practices [144]. Balancing these challenges with the
clear advantages of agility, scalability, and resilience is essential for successful adoption
across various industries.

4.4. Distributed AI and ML Systems

Distributed AI and ML systems are the backbone for scalable training and deployment
of complex models across decentralised networks [158]. Unlike the centralised approach,
this architecture allows the computation to be distributed among different nodes, reducing


Electronics 2025, 14, 677 19 of 39

the latency in training and efficiently processing large datasets [159]. This ML approach
can optimise learning and AI inference, particularly for resource-constrained devices
such as IoT or edge computing devices used in real-time applications [160]. It aligns
with the principles of federated learning, which allow for collaborative model training
without the need to share raw data, thus preserving data privacy and reducing bandwidth
demands [161]. By leveraging intelligent agents in a distributed environment, these systems
can significantly reduce model training time while maintaining robust fault tolerance [162].
Moreover, distributed learning algorithms applied in different application areas, such
as 6G [163] and smart grid systems [164], illustrate how these methods can optimise
resource usage and enable real-time decision-making with minimal latency. Advanced
variants, such as AutoDAL, enable automatic hyperparameter tuning within distributed
learning frameworks, addressing scalability and efficiency challenges in large-scale data
analysis [165].

Federated learning is an emerging area in distributed AI, allowing model training
across decentralised devices or servers without centralising raw data. This approach im-
proves privacy and reduces data transfer costs, with models trained locally on edge devices
and only shared parameters sent back to central servers, as shown in Figure 8 [166]. Feder-
ated learning is especially valuable in applications with strict privacy requirements, such
as healthcare and finance, where regulatory constraints limit centralised data storage [167].
However, federated systems face significant challenges in balancing privacy preserva-
tion and model accuracy. Privacy-preserving techniques, such as differential privacy and
secure multi-party computation, introduce noise or encryption that can reduce model
performance [168]. To address this, privacy-aware optimisation algorithms, such as those
incorporating adaptive noise levels or secure aggregation protocols, have been proposed
to maintain accuracy while ensuring data security [169,170]. Another critical challenge in
federated systems is communication overhead, especially in scenarios involving frequent
synchronisation of model updates across devices. This overhead can significantly increase
latency and reduce efficiency in large-scale systems. Potential solutions include strategies
like periodic aggregation, where updates are transmitted at predefined intervals rather
than continuously [171], and selective model updates, which prioritise transmitting critical
updates based on gradient sparsity or importance [172]. Additionally, techniques such as
gradient compression and quantised updates can minimise communication costs without
sacrificing accuracy, making federated learning more scalable and efficient in distributed
environments [173,174]. These advancements demonstrate that federated learning can
address privacy and efficiency challenges effectively, paving the way for its widespread
adoption in privacy-sensitive domains.

Distributed training systems enable simultaneous model training across multiple
nodes, significantly accelerating the development of complex AI models. Techniques like
data parallelism, model parallelism, and pipeline parallelism optimise resource usage,
making them essential for large-scale training tasks in fields like natural language process-
ing and computer vision, where computational demands are exceptionally high [175]. By
distributing workloads across multiple nodes, these systems reduce the dependence on
centralised infrastructures, promoting scalable, efficient, and resource-adaptive ML [176].
Despite these advantages, distributed training systems face several challenges that limit
their efficiency and effectiveness. Communication overhead, caused by frequent synchroni-
sation of parameters across nodes, can result in increased latency and inefficient bandwidth
utilisation, particularly in large-scale systems [177]. Techniques like gradient sparsifica-
tion [178], optimised collective communication protocols [31], and Asynchronous Stochastic
Gradient Descent (ASGD) [179,180] aim to mitigate these issues by reducing the volume
of data transmitted during updates and allowing nodes to operate more independently.


Electronics 2025, 14, 677 20 of 39

However, these methods often struggle to maintain model accuracy due to inconsistent
parameter updates [181], requiring advanced consistency management algorithms, such
as dynamic weighting of updates, to address this trade-off. Another significant challenge
is managing data heterogeneity, as data distributed across nodes are often non-IID (non-
independent and identically distributed), leading to skewed model updates that hinder
training effectiveness [182]. Solutions like adaptive loss functions, dynamic weighting
of local models, and frameworks such as AdaFed [183] dynamically adjust the contribu-
tions of local models based on data quality, improving convergence. Privacy-preserving
methods, such as differential privacy and secure multi-party computation, add further
complexity by introducing noise or encryption to protect sensitive data, which can degrade
model accuracy [184]. Privacy-aware optimisation strategies such as PSDF [185] are being
developed to balance security with performance. Resource optimisation is another critical
issue [186], particularly in decentralised environments with heterogeneous hardware capa-
bilities and network reliability. Adaptive resource allocation frameworks that dynamically
adjust computation and communication parameters based on workload demands and
node capacities [187] are essential for efficient resource utilisation, but implementing these
frameworks requires robust scheduling algorithms and real-time monitoring. Addressing
these challenges through innovative algorithms, resource management strategies, and
privacy-aware techniques is essential for unlocking the full potential of distributed training
systems.

Figure 8. Step-by-step illustration of federated ML. This figure explains the federated learning
process, highlighting key stages such as local model training, aggregation, and privacy-preserving
updates.

In summary, distributed AI and ML systems offer transformative potential by enabling
scalable, efficient, and secure training across decentralised networks, while challenges such
as communication overhead, data heterogeneity, and synchronisation remain, ongoing
advancements in adaptive algorithms and privacy-preserving methods continue to address
these issues, paving the way for widespread adoption in sectors like healthcare, finance,
and IoT.

5. Challenges in Parallel and Distributed Systems
Parallel and distributed systems have revolutionised the way computational tasks are

performed, enabling the handling of complex and large-scale applications. However, these


Electronics 2025, 14, 677 21 of 39

systems face several challenges that can hinder their efficiency and effectiveness. This sec-
tion delves into the key challenges in parallel and distributed systems, including scalability
and performance, security and privacy, fault tolerance and reliability, interoperability and
standardisation, energy efficiency, and ethical concerns.

5.1. Scalability and Performance

Achieving scalability while maintaining high performance is one of the foremost chal-
lenges in parallel and distributed systems. As the number of processors or nodes increases,
bottlenecks can arise due to limitations in network bandwidth, synchronisation overhead,
and resource contention [188]. These issues can significantly degrade system performance,
negating the benefits of adding more computational resources. A notable example of
scalability challenges is in distributed AI systems, where training large-scale models like
GPT3 involves 175 billion parameters spread across thousands of GPUs [189,190]. Syn-
chronisation overhead during gradient updates can significantly impact training efficiency,
especially as the number of GPUs increases [191]. Studies have shown that communica-
tion overhead during parameter updates and gradient synchronisation can dominate the
training time in large-scale distributed systems, reducing the benefits of scaling out [192].
Additionally, memory bandwidth and latency constraints exacerbate the problem, reducing
the overall efficiency of these systems [175].

Addressing these challenges requires integrated solutions that consider the distinct
demands of heterogeneous, quantum, neuromorphic, and optical computing paradigms.
In heterogeneous computing, task scheduling algorithms ensure efficient workload dis-
tribution among diverse processing units (e.g., CPUs, GPUs, DPUs) to prevent resource
underutilisation [193]. Advanced scheduling algorithms dynamically assign tasks to appro-
priate processors, optimising execution and guaranteeing energy consumption [194]. For
quantum computing, modular quantum architectures and hybrid quantum–classical sys-
tems help manage the scalability of qubit systems while reducing error propagation [195].
In neuromorphic computing, innovations such as photonics integration, online learning,
and 3D stacking enhance the scalability of ANNs by increasing density and reducing power
consumption [196]. For optical computing, material advancements such as silicon photonics
and integrated photonic circuits enable the scaling of optical interconnects while minimis-
ing crosstalk and optical loss. Hardware/software co-design innovations further enhance
the performance of optical computing systems [87]. Optimising communication proto-
cols [197], dynamic resource allocation [198], and adaptive scheduling algorithms [199]
improve data transfer and task management. Gradient compression techniques in dis-
tributed AI systems reduce communication delays, while optical interconnects [85] and
optical wireless communication [200] provide high-bandwidth, low-latency data transfer,
enhancing overall system efficiency. Together, these advancements improve workload
distribution, enabling parallel and distributed systems to scale efficiently and meet the
demands of complex modern applications.

5.2. Security and Privacy

Security and privacy are paramount concerns in distributed environments where data
and resources are shared across multiple nodes [201]. Threats such as unauthorised access,
data breaches, and malicious attacks can compromise the integrity and confidentiality
of the system. Distributed systems are particularly vulnerable due to their open and
interconnected nature, which can be exploited by attackers. A notable case involved a major
cloud service provider experiencing downtime across its network due to a coordinated
ransomware attack, resulting in financial losses exceeding USD 1.85 million and extensive
recovery efforts [202]. Similarly, in parallel systems used for HPC, side-channel attacks that


Electronics 2025, 14, 677 22 of 39

exploit shared memory vulnerabilities have exposed sensitive data, highlighting the need
for enhanced security measures [203].

To address these challenges, robust security solutions should span hardware, software,
and cryptographic advancements [204]. Encryption methods such as AES-256 and secure
communication protocols like TLS ensure data protection during storage and transmission,
while authentication mechanisms, including multi-factor authentication, enhance access
control [205]. Zero-trust architectures and Trusted Execution Environments limit attack
surfaces by isolating sensitive computations and continuously validating user and device
credentials [206]. In distributed systems like blockchain, mechanisms such as Merkle trees
and Proof-of-Stake consensus algorithms maintain data integrity and ensure secure transac-
tion validation [207]. With the advent of quantum computing, post-quantum cryptography,
including lattice-based cryptography, and quantum key distribution are critical for secur-
ing communications and future-proofing systems against quantum-enabled threats [208].
These solutions, when integrated into parallel and distributed systems, provide resilience
against evolving cyber threats, safeguarding user privacy and ensuring system reliability.

5.3. Fault Tolerance and Reliability

Fault tolerance and reliability are critical in ensuring that parallel and distributed
systems continue to operate correctly even in the presence of component failures [209].
Hardware malfunctions, network issues, or software errors can lead to system downtime or
data loss, which is unacceptable in mission-critical applications. For instance, distributed
systems supporting global financial transactions need to maintain uninterrupted operation
despite hardware failures or network disruptions, as downtime can result in significant
financial and reputational losses [210].

Many methods have been proposed to address these challenges in various distributed
computing scenarios. Redundancy and replication ensure high availability and data in-
tegrity by maintaining multiple copies of critical data across nodes [206], while check-
pointing periodically saves system states, enabling recovery without restarting entire
processes [211]. Self-healing algorithms and dynamic task migration mitigate the impact
of hardware and software failures by redistributing workloads to healthy nodes or com-
ponents [212]. Similarly, modular architectures and error-correcting codes enhance the
reliability of quantum systems by addressing decoherence and qubit failures [209]. Neu-
romorphic systems benefit from fault-tolerant designs and techniques that accommodate
various types of resistive random-access memory faults [213]. Optical interconnect systems
rely on the ONOS SDN controller for dynamic provisioning of data connectivity services
and advanced automatic failure recovery [214]. Middleware solutions, such as those sup-
porting distributed frameworks (e.g., Apache Spark) or blockchain consensus algorithms
(e.g., Proof of Stake), enhance robustness against node failures and maintain consistency
across distributed systems [210]. By integrating these strategies, parallel and distributed
systems can enhance reliability, minimise disruptions, and meet the demands of modern
mission-critical applications.

5.4. Interoperability and Standardisation

In heterogeneous environments where diverse systems and technologies coexist, inter-
operability becomes a significant challenge [215]. Orchestrating operations across different
platforms, protocols, and interfaces requires careful coordination. Without standardisation,
integrating new components or scaling the system can lead to incompatibilities and in-
creased complexity. Managing heterogeneity in UHC systems, where CPUs, GPUs, NPUs,
and DPUs have to collaborate seamlessly, exacerbates these challenges.


Electronics 2025, 14, 677 23 of 39

To address these challenges, adopting standardised communication protocols and
resource allocation frameworks is essential [216]. Protocols like MPI and NCCL enable
efficient data exchange in parallel systems [197], while resource allocation frameworks such
as Kubernetes facilitate task distribution in distributed systems [144]. Middleware solutions
abstract hardware and platform differences, simplifying the integration of components
in heterogeneous and distributed environments [206]. For quantum systems, modular
architectures and standardised quantum gates ensure compatibility between quantum and
classical components, enabling hybrid quantum–classical workflows [217]. Neuromorphic
systems require different neural coding schemes for achieving the best performance of
neuromorphic systems under different design constraints [218]. In optical systems, the
optical gates, photonic integrated circuits, and optical architectures are still evolving, and
the development of standards and related protocols is ongoing [89]. Industry standards
and open architectures promote interoperability, allowing diverse systems to work together
while fostering collaborative innovation. For example, distributed frameworks like Apache
Hadoop and TensorFlow support heterogeneous hardware, ensuring compatibility across
CPUs, GPUs, and accelerators [219]. Such standardisation efforts reduce development
costs, streamline integration, and enable parallel and distributed systems to scale efficiently,
incorporating emerging technologies with minimal complexity.

5.5. Energy Efficiency

As parallel and distributed systems scale up, power consumption becomes a growing
concern [220]. High energy usage not only increases operational costs but also has envi-
ronmental implications due to the carbon footprint associated with large data centres and
computing clusters. A notable example is the training of large-scale AI models like GPT-3,
which reportedly consumed approximately 1287 megawatt-hours (MWh) of electricity
during its training phase, emitting over 550 metric tons of carbon dioxide if powered by
non-renewable sources [221]. This substantial energy use underscores the importance of
implementing energy-efficient solutions across all parallel and distributed systems.

Addressing energy-efficiency challenges in parallel and distributed systems requires
a holistic approach that integrates energy-efficient hardware, intelligent algorithms, dy-
namic power management, and sustainable infrastructure. Hardware innovations, such
as neuromorphic chips like Intel’s Loihi [63] and optical network processors [222,223],
significantly reduce energy consumption through specialised designs and advanced tech-
nologies. Energy-aware algorithms, such as SkipTrain in decentralised learning [224],
enhance efficiency by strategically skipping certain training rounds and replacing them
with synchronisation rounds. Quantum algorithms like QAOA further minimise compu-
tational overhead, improving overall energy efficiency [216]. At the infrastructure level,
renewable-powered data centres [220] and dynamic workload migration support sustain-
able operations. Additionally, techniques such as dynamic voltage and frequency scaling
(DVFS) and adaptive power gating optimise energy usage by adjusting power levels based
on workload demands [225]. For blockchain systems, energy-efficient consensus protocols
such as Proof of Authority reduce power consumption while maintaining security and
operational viability [100], helping to mitigate their environmental impact.

5.6. Emerging Ethical Concerns

As AI becomes increasingly integrated into parallel and distributed systems, ethical
concerns have emerged as a critical challenge [226]. Issues such as algorithmic bias, misuse
of sensitive user data, and lack of transparency in decision-making processes can under-
mine trust, fairness, and accountability in AI-driven systems [227]. For example, biased
AI models deployed in distributed healthcare platforms can lead to unequal treatment


Electronics 2025, 14, 677 24 of 39

outcomes, disproportionately disadvantaging marginalised groups [228]. Similarly, inade-
quate data governance in cloud-based AI systems can result in privacy violations, exposing
sensitive user information to misuse or unauthorised access [229].

Addressing these challenges requires a multi-faceted approach across governance,
technology, and collaboration. Robust governance frameworks and adherence to ethical
guidelines throughout the AI life cycle are essential for ensuring accountability. Explainable
AI (XAI) techniques can improve transparency by providing interpretable insights into
decision-making processes, reducing the risk of biased or opaque outcomes [230]. Privacy-
preserving technologies, such as federated learning, allow data to remain decentralised,
mitigating risks associated with data misuse or breaches [167]. Federated learning has
shown promise in fields like healthcare, enabling collaborative model training without
compromising data privacy [231]. Additionally, interdisciplinary collaborations among
technologists, ethicists, and policymakers are vital for establishing standards and poli-
cies that promote equitable and responsible AI deployment. Standards such as fairness
metrics, model validation protocols, and data auditing mechanisms ensure AI systems
align with ethical principles. For instance, blockchain-based audit trails can improve ac-
countability in distributed systems by recording data usage and decision-making processes
securely and transparently [232]. By integrating these strategies, parallel and distributed
systems can address emerging ethical concerns, fostering trust and ensuring sustainable,
equitable development.

6. Future Directions
Building on the challenges outlined in this paper, it is evident that significant advance-

ments are still needed to overcome scalability, energy efficiency, and security limitations in
parallel and distributed systems. As these systems evolve, several emerging technologies
and research areas show promise for addressing current obstacles and driving innovation.
This section discusses the future directions of each class of parallel and distributed systems.

• Heterogeneous computing: As computing moves towards UHC architectures integrating
diverse processors such as CPUs, GPUs, TPUs, FPGAs, and specialised accelerators,
significant advancements are required to address challenges in scalability, energy effi-
ciency, and complexity [233]. These architectures have the potential to revolutionise
computing by leveraging the unique strengths of each processor type; however, their
successful implementation depends on overcoming several critical obstacles. One key
research direction is the development of hybrid scheduling algorithms [234]. These
algorithms should dynamically adapt to varying computational demands, both online
and offline, while optimising energy efficiency and performance [235]. Additionally,
designing energy-aware resource management frameworks that minimise power con-
sumption without compromising computational throughput is crucial for meeting
sustainability goals [236]. Another vital area of focus is high-bandwidth, low-latency
interconnect technologies, which are essential for seamless data exchange among
heterogeneous components [237]. Innovations such as photonic interconnects and
3D packaging can alleviate bandwidth bottlenecks and reduce latency, enabling effi-
cient communication between processors [238]. To enhance developer adoption and
simplify programming for heterogeneous systems, further refinement of frameworks
such as CUDA, OpenCL, SYCL, and oneAPI, as well as emerging unified program-
ming models like CodeFlow [239], is essential. These frameworks should provide
robust abstractions, allowing developers to harness the full potential of diverse ar-
chitectures without dealing with low-level hardware complexities. Finally, synergies
among quantum computing, neuromorphic systems, optical computing, and opti-
cal interconnects present exciting opportunities for future exploration. Advancing


Electronics 2025, 14, 677 25 of 39

these interdisciplinary technologies will be critical in shaping the next generation of
high-performance, energy-efficient computing architectures.

• Quantum computing: The future trajectory of quantum computing is shaped by several
critical technological and practical imperatives. At the hardware level, the ongoing
development of diverse qubit technologies—including superconducting, silicon-based,
trapped-ion, and photonic implementations—remains essential for advancing quan-
tum computing capabilities [36,42,43]. While these platforms have demonstrated
significant progress, challenges such as noise reduction, high error rates, and deco-
herence should be effectively addressed to realise practical quantum advantage [37].
Current quantum error correction protocols require substantial qubit overhead, ne-
cessitating innovative approaches that can scale efficiently with system size [240].
Industry roadmaps, such as IBM’s plan to develop processors with thousands of
qubits [46], highlight the importance of achieving fault tolerance while maintaining
quantum coherence across larger qubit arrays. The integration of quantum computing
with classical computing represents a promising direction for near-term applications.
Hybrid quantum–classical systems, particularly in ML and optimisation tasks, can
leverage the complementary strengths of both paradigms [217]. To facilitate broader
adoption, the field will address interconnected challenges, including quantum in-
frastructure development. Establishing robust quantum networking protocols and
leveraging optical interconnects will be crucial for scaling quantum systems beyond
single-processor implementations [195]. Additionally, the development of standard-
ised quantum software frameworks and advanced error mitigation techniques will
be instrumental in enhancing accessibility and usability [240]. Beyond technical ad-
vancements, the socioeconomic implications of quantum computing warrant careful
consideration. The transformative potential of quantum technologies spans multiple
industries, with significant applications in cryptography [47] and molecular simu-
lation [48]. Ensuring equitable access to quantum resources and fostering a skilled
quantum workforce will be critical in maximising the societal benefits of quantum
computing across diverse sectors and regions.

• Neuromorphic computing: Inspired by the brain’s architecture, neuromorphic com-
puting is rapidly emerging as a promising solution for achieving energy-efficient,
event-driven processing, particularly in AI and ML tasks [53]. Despite its potential,
scalability remains a significant hurdle, as building larger neuromorphic systems
demands advancements in technological infrastructure, development tools, and inte-
gration strategies [59]. Future progress should focus on enhancing the programma-
bility of neuromorphic hardware to enable larger, more complex systems capable of
addressing diverse AI and ML workloads [241]. This includes improving the flex-
ibility and accessibility of programming environments to facilitate adoption by a
broader range of developers and researchers. In parallel, the development of SNNs
as foundational algorithms requires further exploration, particularly in areas such as
backpropagation [56] and online learning [242], to enhance their adaptability, scala-
bility, and real-time performance. The practical adoption of neuromorphic hardware
faces challenges such as the lack of standardised protocols and the high costs of chip
fabrication. Initiatives like Intel’s Loihi 2 platform have demonstrated progress in
commercialising neuromorphic computing [65], but broader collaboration among
academia, industry, and policymakers will be necessary to standardise frameworks,
reduce costs, and accelerate adoption. Integrating neuromorphic computing with
photonics presents a promising avenue for addressing key challenges, including scala-
bility, energy efficiency, precision, and standardised performance benchmarks [196].
As the technology evolves, addressing ethical concerns and promoting the responsible


Electronics 2025, 14, 677 26 of 39

use of brain-inspired systems will be critical [243]. Ensuring equitable access, avoiding
misuse, and fostering transparency in neuromorphic applications will help ensure
that the technology benefits society responsibly.

• Optical computing: The future of optical computing holds transformative potential for
meeting the escalating demands of modern computing systems, particularly in AI,
telecommunications, and HPC [84]. Advancing this technology requires addressing
several critical research challenges through innovative solutions and interdisciplinary
collaboration. A key research direction is the development of next-generation photonic
integrated circuits, with a particular focus on advancing core components such as
MRRs and MZIs [238]. These components will evolve to meet stringent requirements
for scalability, efficiency, and reliability. The advancement of all-optical processing
presents promising opportunities, including the development of optical gates and
logical units, high bit-rate signal processing, and optical quantum computing [89].
High-performance optical interconnects offer significant advantages over traditional
electrical interconnects, enabling efficient data transmission in large-scale systems
such as data centres, supercomputers, and quantum networks [85]. Industry adoption
is already underway, as demonstrated by Google’s integration of photonic components
in data centres and the emergence of optical neural network research prototypes [88].
In the quantum computing domain, optical components play a crucial role in facilitat-
ing high-bandwidth communication between quantum processors, addressing key
challenges related to quantum network scalability and efficiency [195]. To accelerate
the practical deployment of optical computing systems, research efforts should focus
on three key areas: miniaturisation techniques, advanced materials development, and
scalable manufacturing processes. These technological advancements are essential for
achieving cost-effective, energy-efficient solutions that can expand access to HPC capa-
bilities. This expansion is particularly crucial for small and medium-sized enterprises
and academic institutions, which stand to benefit significantly from more accessible
advanced computing resources. As optical computing technologies mature, they are
poised to revolutionise industries by delivering unprecedented computational power,
sustainability, and accessibility. This evolution represents a major step toward meeting
the growing computational demands of modern society while aligning with global
sustainability goals.

• Blockchain and distributed ledgers: Blockchain and DLTs present a decentralised, tamper-
resistant way to ensure security and transparency in distributed systems [102]. These
technologies eliminate intermediaries and offer immutable transaction records, en-
abling trustless environments in applications like cloud computing, IoT, and supply
chain management. However, challenges such as latency, high energy consump-
tion in Proof-of-Work-based systems, and cold-start delays hinder their scalability
and responsiveness. Future research should prioritise the development of scalable
blockchain architectures with energy-efficient consensus mechanisms [244]. Innova-
tions such as Proof of Stake and sharding can significantly reduce energy consumption
while maintaining robust security and enabling high transaction throughput [245].
These advancements are essential to ensuring blockchain’s feasibility in real-time
applications and resource-constrained environments. Another promising direction
is the creation of tailored blockchain frameworks for specific distributed computing
applications. Decentralised file systems, for example, can leverage blockchain to
ensure data availability, integrity, and secure sharing [246], while decentralised cloud
services can benefit from blockchain’s capabilities in managing resource allocation
and security [113]. Interoperability among blockchain networks is another key area,
requiring standardised protocols and cross-chain communication to enable multi-


Electronics 2025, 14, 677 27 of 39

platform applications. Practical use cases, such as supply chain management and IoT,
already demonstrate blockchain’s potential to enhance traceability, secure resource
sharing, and improve trust [110]. Efforts to minimise blockchain’s environmental
impact through energy-efficient mechanisms and green blockchain initiatives further
align with global sustainability goals. By addressing these challenges, blockchain
and DLTs can revolutionise distributed systems, transforming how data integrity,
transparency, and trust are managed across industries.

• Serverless computing: Serverless computing, which abstracts infrastructure manage-
ment and allows developers to focus solely on code execution, is emerging as a
transformative paradigm in parallel and distributed systems. By automatically scaling
based on demand, serverless architectures are particularly well suited for distributed
applications with highly variable workloads, providing cost efficiency, flexibility, and
ease of deployment [134]. However, serverless computing faces challenges such as
cold-start latency, latency associated with initialising functions, and difficulties in
managing stateful, resource-intensive applications [132,135]. Future advancements
should address these limitations. Improving the latency and scalability of serverless
frameworks is essential, particularly for HPC and real-time distributed systems [132].
Fine-grained resource management techniques and enhanced serverless orchestration
mechanisms are needed to efficiently handle parallel tasks across distributed nodes,
ensuring optimised workload distribution and responsiveness [236]. Serverless sys-
tems show significant potential in AI/ML workflows, enabling seamless deployment
of ML models and distributed training pipelines [247]. Their adoption in multi-
cloud environments can ensure interoperability across cloud platforms, reducing
vendor lock-in and improving resource utilisation [248]. Additionally, techniques like
container pre-warming, lightweight virtualisation, and predictive scaling can miti-
gate cold-start issues, making serverless computing viable for latency-sensitive and
resource-constrained environments [132]. By overcoming these challenges, serverless
computing can significantly contribute to the evolution of parallel and distributed
systems, enabling more scalable, efficient, and adaptable architectures across a wide
range of industries.

• Cloud-native architectures: Cloud-native architectures are transforming distributed
computing by leveraging microservices, containerisation, and orchestration tools like
Kubernetes to enable auto-scaling, fault tolerance, and resilience. By decomposing
applications into smaller, independent components, these architectures provide flexi-
bility and adaptability, ensuring consistent performance even under varying workload
demands [146]. Future advancements should enhance the coordination and orchestra-
tion of microservices to ensure data consistency across geographically dispersed cloud
resources. For instance, an optimised communication solution has been proposed
to enhance inter-service communication in microservices [249]. Synergies with large
generative AI models are essential to enable dynamic load balancing between cloud
and edge nodes, optimising costs of goods sold and improving resource accessibil-
ity [250]. Multi-cloud orchestration initiatives, such as the expansion of the Kubernetes
ecosystem [144] and platforms like Google’s Anthos [251], demonstrate the feasibility
of cross-cloud collaboration for managing complex workloads. Energy efficiency is
a critical challenge as cloud-native systems scale. Green computing strategies, such
as intelligent container scheduling and life-cycle management, can reduce energy
consumption and environmental impact [225]. Additionally, improved container or-
chestration algorithms that dynamically allocate resources are vital for aligning these
architectures with sustainability goals [252]. Security and privacy are paramount due
to the decentralised nature of micros