Enhancing Network-On-Chip Performance: Advanced Mmu Techniques For Lower Latency And Higher Bandwidth

Debasis Behera; Suvendu Naraya Mishra

doi:10.22399/ijcesen.2556

Authors

Debasis Behera Convener
Suvendu Naraya Mishra

DOI:

https://doi.org/10.22399/ijcesen.2556

Keywords:

Network-on-Chip (NoC), Memory Management Unit, Latency Optimization, Bandwidth Utilization, TLB Caching

Abstract

With the increasing complexity of high-performance computing systems, Network-on-Chip (NoC) architectures face critical performance bottlenecks due to memory management latency and inefficient bandwidth utilization. This research presents a novel, mathematically rigorous framework for optimizing NoC performance through advanced Memory Management Unit (MMU) techniques, specifically Translation Lookaside Buffer (TLB) caching and hybrid address mapping. The study develops symbolic models of latency and bandwidth as optimization functions, accounting for memory translation delays and dynamic workload patterns. Using discrete-event simulation based on analytically defined traffic and MMU behavior assumptions, we evaluate performance across various configurations. Our results indicate that hybrid address mapping yields up to 30.7% latency reduction and 32% bandwidth efficiency gain, while TLB caching provides 26.1% latency improvement and 27.3% increased throughput. These findings, derived under theoretical constraints, demonstrate the potential of MMU-level optimizations for significantly enhancing NoC system performance. The proposed model serves as a foundational tool for future adaptive and scalable memory management strategies in edge computing, real-time systems, and data-intensive applications.

References

[1] Ashok, K. K., & Reddy, V. K. (2020, July). Advanced Memory Management Unit for 3-D Network on Chip. In 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC) (pp. 1062-1067). IEEE.

[2] Behera, D., Mishra, S. N., Sahoo, P. K., & Shah, H. A. (2023). An enhanced approach towards improving the performance of embedding memory management units into Network-on-Chip. e-Prime-Advances in Electrical Engineering, Electronics and Energy, 6, 100332.

[3] Krutthika, H. K., & Aswatha, A. R. (2020). FPGA-based design and architecture of network-on-chip router for efficient data propagation. IIOAB Journal, 11, 7-25.

[4] Tariq, U. U., Ali, H., Liu, L., Hardy, J., Kazim, M., & Ahmed, W. (2021). Energy-aware scheduling of streaming applications on edge-devices in IoT-based healthcare. IEEE Transactions on Green Communications and Networking, 5(2), 803-815.

[5] Kumar, A., & Reddy, V. K. (2021, May). Advanced FIFO Structure for Router in Bi-NoC. In 2021 5th International Conference on Intelligent Computing and Control Systems (ICICCS) (pp. 1219-1224). IEEE.

[6] Vivet, P., Guthmuller, E., Thonnart, Y., Pillonnet, G., Fuguet, C., Miro-Panades, I., ... & Clermidy, F. (2020). IntAct: A 96-core processor with six chiplets 3D-stacked on an active interposer with distributed interconnects and integrated power management. IEEE Journal of Solid-State Circuits, 56(1), 79-97.

[7] Leyva, N., Monemi, A., Oliete-Escuín, N., López-Paradís, G., Abancens, X., Balkind, J., ... & Alvarez, L. (2023, October). OpenPiton Optimizations Towards High Performance Manycores. In Proceedings of the 16th International Workshop on Network on Chip Architectures (pp. 27-33).

[8] Leyva, N., Monemi, A., Oliete-Escuín, N., López-Paradís, G., Abancens, X., Balkind, J., ... & Alvarez, L. (2024). OpenPiton4HPC: Optimizing OpenPiton Towards High Performance Manycores. IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[9] Orenes-Vera, M., Manocha, A., Balkind, J., Gao, F., Aragón, J. L., Wentzlaff, D., & Martonosi, M. (2022, June). Tiny but mighty: Designing and realizing scalable latency tolerance for manycore SoCs. In Proceedings of the 49th Annual International Symposium on Computer Architecture (pp. 817–830).

[10] Shen, H., Chen, G., Li, B., Lin, X., Zhang, X., Wang, X., ... & Tan, K. (2023). NP-RDMA: Using commodity RDMA without pinning memory. arXiv preprint arXiv:2310.11062.

[11] Wulf, C., Willig, M., & Göhringer, D. (2021, August). A survey on hypervisor-based virtualization of embedded reconfigurable systems. In 2021 31st International Conference on Field-Programmable Logic and Applications (FPL) (pp. 249–256). IEEE.

[12] Jang, H., Han, K., Lee, S., Lee, J. J., & Lee, W. (2019). MMNoC: Embedding memory management units into network-on-chip for lightweight embedded systems. IEEE Access, 7, 80011–80019.

[13] Behera, D., & Jena, U. R. (2020, July). Detailed review on embedded MMU and their performance analysis on test benches. In 2020 International Conference on Computational Intelligence for Smart Power System and Sustainable Energy (CISPSSE) (pp. 1–6). IEEE.

[14] Mummidi, C. S., & Kundu, S. (2023). ACTION: Adaptive cache block migration in distributed cache architectures. ACM Transactions on Architecture and Code Optimization, 20(2), 1–19.

[15] Zhao, X., Jahre, M., Tang, Y., Zhang, G., & Eeckhout, L. (2023, January). NUBA: Non-uniform bandwidth GPUs. In Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2 (pp. 544–559).

Enhancing Network-On-Chip Performance: Advanced Mmu Techniques For Lower Latency And Higher Bandwidth

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Information

Keywords

Announcements

Current Issue