Event-Driven Architectures for Ultra-Low-Latency Systems: Design Principles, Reference Patterns, and Evaluation

Authors

  • Nikhil Kokal

DOI:

https://doi.org/10.22399/ijcesen.4303

Keywords:

Event-driven architecture, ultra-low latency, RDMA, kernel bypass, tail latency

Abstract

Ultra-low-latency, event-driven systems have strict end-to-end latency demands in the range of microseconds or sub-milliseconds for applications in high-frequency trading, augmented realities, teleoperation, and real-time machine learning inference. Current applications require not only reductions in median latency performance, but also tight control on tail latency to meet service-level objectives and maintain operational safety. Recent advances in kernel-bypass networking (DPDK, AF_XDP), persistent RDMA semantics, FPGAs as fabrics, and deterministic scheduling provide powerful ways to achieve these demands, but there does not yet exist an integrated framework that covers all of these techniques. This article builds on five years of systems research and organizes the findings into a usable taxonomy that maps sources of latency, namely: network traversal, kernel overheads, serialization, scheduling delays, and distributed state management, to architectural levers and supporting systems. Three example architectures are shown: a single-node user-space architecture, a rack-scale PCIe/FPGA architecture, and a DRAM-assisted architecture. It also presents application-level implementation patterns for users to achieve similar latency outcomes, focusing on zero-copy paths, NIC offload, and placement that preserves locality. It also gives reproducible evaluation techniques associated with the implementation patterns in this development. Finally, case study examples from high-frequency trading and edge-based machine learning inference are presented in which single-digit microsecond latency is feasible while also explaining fundamental trade-offs among latency, maintainability, and hardware specialization.

References

[1] Lászlo Blázovics, et al., "Low Latency Video Streaming System for VR Teleoperation over 5G Networks," in 2023 IEEE 24th International Symposium on Design and Diagnostics of Electronic Circuits and Systems (DDECS), Dec. 24, 2024. Available: https://ieeexplore.ieee.org/document/10796131

[2] Maxim Susloparov, et al., "Providing High Capacity for AR/VR Traffic in 5G Systems with Multi-Band Resource Aggregation," in 2022 IEEE Global Communications Conference (GLOBECOM), Aug. 24, 2022. Available: https://ieeexplore.ieee.org/document/9858230

[3] Killian Castillon du Perron, et al., "Understanding Delays in AF_XDP-based Applications," in 2023 IEEE 10th International Conference on Network Softwarization (NetSoft), Aug. 20, 2024. Available: https://ieeexplore.ieee.org/document/10622351

[4] Zhaoyi Li, et al., "Achieving Low Latency for Multipath Transmission in RDMA-Based Data Centers," in 2023 IEEE 43rd International Conference on Distributed Computing Systems (ICDCS), Feb. 13, 2024. Available: https://ieeexplore.ieee.org/document/10433770

[5] Debendra Das Sharma, "PCI Express® 6.0 Specification at 64.0 GT/s with PAM-4 Signaling," in 2020 IEEE Symposium on High-Performance Interconnects (HOTI), Sep. 09, 2020. Available: https://ieeexplore.ieee.org/document/9188289

[6] Ludwig Thomeczek, et al., "Measuring Safety Critical Latency Sources using Linux Kernel eBPF Tracing," in 2019 IEEE 22nd International Symposium on Real-Time Distributed Computing (ISORC), Sep. 16, 2019. Available: https://ieeexplore.ieee.org/document/8836200

[7] Jude Haris, et al., "SECDA: Efficient Hardware/Software Co-Design of FPGA-based DNN Accelerators for Edge Inference," in 2022 IEEE 30th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), Dec. 28, 2021. Available: https://ieeexplore.ieee.org/document/9651579

[8] Mengting Zhang, et al., "RoSR: A Novel Selective Retransmission FPGA Architecture for RDMA NICs Supporting Out-of-Order Packets," in 2023 IEEE 31st International Conference on Network Protocols (ICNP), Jul. 31, 2025. Available: https://ieeexplore.ieee.org/document/11106222

[9] Marta Andronic and George A. Constantinides, "PolyLUT: Learning Piecewise Polynomials for Ultra-Low Latency FPGA LUT Inference," in 2023 IEEE 31st International Conference on Field-Programmable Logic and Applications (FPL), Feb. 01, 2024. Available: https://ieeexplore.ieee.org/document/10416099

[10] Deep Gupta, et al., "FPGA for High-Frequency Trading: Reducing Latency in Financial Systems," in 2023 IEEE International Conference on High Performance Switching and Routing (HPSR), Jan. 17, 2025. Available: https://ieeexplore.ieee.org/document/10841781

Downloads

Published

2025-11-14

How to Cite

Nikhil Kokal. (2025). Event-Driven Architectures for Ultra-Low-Latency Systems: Design Principles, Reference Patterns, and Evaluation. International Journal of Computational and Experimental Science and Engineering, 11(4). https://doi.org/10.22399/ijcesen.4303

Issue

Section

Research Article