“This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by the authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder."
Conference/Journal Publications
- Yiltan Hassan Temucin, Whit Schonbein, Scott Levy, Amirhossein Sojoodi, Ryan Grant, and Ahmad Afsahi, "Design and Implemntation of MPI-Native GPU-Initiated MPI Partitioned Communication", 12th Workshop on Extreme Scale MPI (ExaMPI), Atlanta, GA, USA, Nov. 17, 2024.
- Hamed Sharifian, Amirhossein Sojoodi, and Ahmad Afsahi, "A Topology- and Load-aware Design for Neighborhood Allgather", 26th IEEE International Conference on Cluster Computing (Cluster 2024), Kobe, Japan, Sept 24 - 27, 2024 © IEEE (acceptance rate: 26.8%, 39/145)
- Yiltan Hassan Temucin, Mahdieh Ghazimirsaeed, Ryan Grant, and Ahmad Afsahi, "ROCm-Aware Leader-based Designs for MPI Neighborhood Collectives", International Supercomputing Conference (ISC) High Performance 2024, Hamburg, Germany, May 12 - 16, 2024. © IEEE (acceptance rate: 30%, 24/80)
- Amirhossein Sojoodi, Yiltan Hassan Temucin, and Ahmad Afsahi, "Enhancing Intra-node GPU-to-GPU Performance in MPI+UCX through Multi-Path Communication", 3rd International Workshop on Extreme Heterogeneity Solutions (ExHET) 2024, Edinburgh, UK, Mar 3, 2024. © ACM Best Paper Award
- Yiltan Hassan Temucin, Scott Levy, Whit Schonbein, Ryan Grant, and Ahmad Afsahi, "A Dynamic Network-Native MPI Partitioned Aggregation over InfiniBand Verbs", 25th IEEE Cluster (Clutser 2023), Santa Fe, NM, USA, Oct 31 - Nov 1, 2023. © IEEE (acceptance rate: 24.6%, 32/130) Best Paper Award
- Pedram Alizadeh, AmirHossein Sojoodi, Yiltan Hassan Temucin, and Ahmad Afsahi, "Efficient Process Arrival Pattern Aware Collective Communication for Deep Learning", 29th EuroMPI /USA Conference, Chattanooga, TN, USA, Sept. 26-28, 2022. © ACM
- Yiltan Hassan Temucin, Ryan Grant, and Ahmad Afsahi, "Micro-benchmarking MPI Partitioned Point-to-Point Communication", 51st ACM International Conference on Parallel Processing (ICPP), Bordeaux, France, Aug 29 - Sept 1, 2022. © ACM (acceptance rate: 27%, 84/311)
- Yiltan Hassan Temucin, AmirHossein Sojoodi, Pedram Alizadeh, Benjamin Kitor, and Ahmad Afsahi, "Accelerating Deep Learning using Interconnect-Aware UCX Communication for MPI Collectives", IEEE Micro, 42(2):68-76, Mar-Apr 1, 2022. © IEEE
- Kaushal Kumar, Judicael A. Zounmevo, and Ahmad Afsahi, "SmartInterrupts: A Node-Wide Asynchronous Message Progression Technique", 29th EuroMPI Conference, Garching, Munich, Germany, Sept. 7-9, 2021. Best Paper Award Nominee
- Yiltan Hassan Temucin, AmirHossein Sojoodi, Pedram Alizadeh, and Ahmad Afsahi, "Efficient Multi-Path NVLink/PCIe-Aware UCX based Collective Communication for Deep Learning", 28th IEEE Hot Interconnects Symposium (HotI), August 18-20, 2021. © IEEE
- Mahdieh Ghazimirsaeed, Hessam Mirsadeghi, and Ahmad Afsahi, "Communication-Aware Message Matching in MPI", Concurrency and Computation: Practice and Experience, Vol. 32, Issue 3, pp. 1-17, 2020; first presenetd in the 5th Workshop on Exascale MPI (ExaMPI), Denver, CO, Nov. 12, 2017. © Wiley
- Mahdieh Ghazimirsaeed, Ryan Grant, and Ahmad Afsahi, "A Dynamic, Unified Design for Dedicated Message Matching Engines for Collective and Point-to-Point Communications", Parallel Computing (PARCO), Volume 89, Nov 1, 2019. © Elsevier
- Mahdieh Ghazimirsaeed, Hessam Mirsadeghi, and Ahmad Afsahi, "An Efficient Collaborative Communication Mechanism for MPI Neighborhood Collectives", 33rd IEEE International Parallel and Distributed Processing Symposium (IPDPS), Rio de Janeiro, Brazil, May 20 - 24, 2019. © IEEE (acceptance rate: 27.6%, 103/372)
- Matthew Dosanjh, Whit Schonbein, Ryan Grant, Patrick Bridges, Mahdieh Ghazimirsaeed, and Ahmad Afsahi, "Fuzzy Matching: Hardware Accelerated MPI Communication Middleware", 19th Annual IEEE/ACM International Symposium in Cluster, Cloud, and Grid Computing (CCGrid 2019), Larnaca, Cyprus, May 14 - 17, 2019. © IEEE (acceptance rate: 22.7%, 47/207), Best Paper Award Nominee
- Iman Faraji and Ahmad Afsahi, "Design Considerations for GPU-Aware Collective Communications in MPI", Concurrency and Computation: Practice and Experience (CCPE), Vol. 30, Issue 17, September 10, 2018. © Wiley
- Mahdieh Ghazimirsaeed, Ryan E. Grant, and Ahmad Afsahi, "A Dedicated Message Matching Mechanism for Collective Communications", 11th International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2 2018), Eugene, OR, Aug. 13, 2018. © ACM
- Matthew G. F. Dosanjh, Mahdieh Ghazimirsaeed, Ryan E. Grant, Whit Schonbein, Michael J. Levenhagen, Patrick G. Bridges, and Ahmad Afsahi, "The Case for Semi-Permanent Cache Occupancy, Understanding the Impact of Data Locality on Network Processing", 47th International Conference on Parallel Processing (ICPP 2018), Eugene, OR, Aug. 13 - 16, 2018. © ACM (acceptance rate: 29%, 91/313)
- Hessam Mirsadeghi, Jesper Larsson Träff, Pavan Balaji, and Ahmad Afsahi, "Exploiting Common Neighborhoods to Optimize MPI Neighborhood Collectives", 24th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC 2017), Jaipur, India, Dec. 18-21, 2017. (acceptance rate: 23%, 42/184) © IEEE
- Iman Faraji, Hessam Mirsadeghi, and Ahmad Afsahi, "Exploiting Heterogeneity of Communication Channels for Efficient GPU Selection on Multi-GPU Nodes", Parallel Computing (PARCO), Volume 68, Oct. 2017, pp. 3-16. © Elsevier
- Mahdieh Ghazimirsaeed and Ahmad Afsahi, "Accelerating MPI Message Matching by a Data Clustering Strategy", High Performance Computing Symposium (HPCS 2017), Kingston, ON, June 6-9, 2017. Lecture Notes in Computer Science (LNCS). © Springer
- Hessam Mirsadeghi, Iman Faraji, and Ahmad Afsahi, "MAGC: A Mapping Approach for GPU Clusters", 28th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2016), Los Angeles, CA, Oct. 26-28, 2016. (acceptance rate: 35%, 27/77) © IEEE
- Hessam Mirsadeghi and Ahmad Afsahi, "PTRAM: A Parallel Topology- and Routing-Aware Mapping Framework for Large-Scale HPC Systems", 21st International Workshop on High-Level Parallel Programming Models and Supportive Environments (HIPS 2016), Chicago, IL, May 23, 2016. © IEEE
- Iman Faraji, Hessam Mirsadeghi, and Ahmad Afsahi "Topology-Aware GPU Selection on Multi-GPU Nodes", Sixth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES 2016), Chicago, IL, May 23, 2016. © IEEE Best Paper Award
- Hessam Mirsadeghi and Ahmad Afsahi, "Topology-Aware Rank Reordering for MPI Collectives", First Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware (IPDRM 2016), Chicago, IL, May 27, 2016. © IEEE
- Iman Faraji and Ahmad Afsahi, "Hyper-Q Aware Intranode MPI Collectives on the GPU", International Workshop on Extreme Scale Programming Models and Middleware (ESPM2 2015), Austin, TX, Nov. 15, 2015. © ACM/IEEE
- Ryan E. Grant, Mohammad J. Rashti, Pavan Balaji, and Ahmad Afsahi, "Scalable Connectionless RDMA over Unreliable Datagrams", Parallel Computing (PARCO), Volume 48, Oct. 2015, pp. 15-39. © Elsevier
- Ryan E. Grant, Mohammad J. Rashti, Pavan Balaji, and Ahmad Afsahi, "Scalable Network Communication using Unreliable RDMA", Book Chapter, Handbook on Data Centers, Editors: Samee U. Khan and Albert Y. Zomaya, March 17, 2015. ISBN: 978-1-4939-2091-4 (Print) 978-1-4939-2092-1 (Online) © Springer
- Judicael A. Zounmevo, Dries Kimpe, Robert Ross, and Ahmad Afsahi, "Extreme-scale Computing Services over MPI: Experiences, Observations and Feature Proposal for Next Generation Message Passing Interface", International Journal of High Performance Computing Applications (IJHPCA), Volume 28, No. 4, Nov. 2014, pp. 435-449. © SAGE
- Judicael A. Zounmevo, Xin Zhao, Pavan Balaji, William Gropp, and Ahmad Afsahi, "Nonblocking Epochs in MPI one-sided Communication", 2014 International Conference for High Performance Computing, Networking, Storage and Analysis (Supercomputing 2014), New Orleans, LA, Nov. 16-21, 2014. (acceptance rate: 20.8%, 82/394) © ACM/IEEE Best Paper Award Finalist (top 6 papers)
- Iman Faraji and Ahmad Afsahi, "GPU-Aware Intranode MPI_Allreduce", 21st EuroMPI 2014, Kyoto, Japan, Sept. 9-12, 2014. © ACM
- Judicael A. Zounmevo and Ahmad Afsahi, "Intra-Epoch Message Scheduling to Exploit Unused or Residual Overlapping Potential", 21st EuroMPI 2014, Kyoto, Japan, Sept. 9-12, 2014. © ACM
- Judicael A. Zounmevo and Ahmad Afsahi, "A Fast and Resource-Conscious MPI Message Queue Mechanism for Large-Scale Jobs", Future Generation Computer Systems (FGCS), 30(1):265-290, 2014. © Elsevier
- Jerome Soumagne, Dries Kimpe, Judicael Zounmevo, Mohamad Chaarawi, Quincey Koziol, Ahmad Afsahi, and Robert Ross, "Mercury: Enabling Remote Procedure Call for High-Performance Computing", 15th IEEE International Conference on Cluster Computing (Cluster 2013), Indianapolis, IN, Sept. 23-27, 2013. © IEEE (acceptance rate: 31%, 46/147)
- Judicael A. Zounmevo, Dries Kimpe, Robert Ross, and Ahmad Afsahi, "On the use of MPI in High-Performance Computing Services", 20th EuroMPI 2013, Madrid, Spain, Sept. 15-18, 2013. © ACM
- Xin Zhao, Darius Buntinas, Judicael Zounmevo, James Dinan, David Goodell, Pavan Balaji, Rajeev Thakur, Ahmad Afsahi, and William Gropp, "Towards Generalized, Asynchronous, and MPI-Interoperable Active Messages", 13th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2013), Delft, The Netherlands, May 13-16, 2013. © IEEE/ACM (acceptance rate: 22.2%, 57/257)
- Judicael A. Zounmevo and Ahmad Afsahi, "An Efficient MPI Message Queue Mechanism for Large-scale Jobs", 18th IEEE International Conference on Parallel and Distributed Systems (ICPADS 2012), Singapore, Dec. 17-19, 2012. © IEEE (acceptance rate: 29.6%, 87/294)
- Grigori Inozemtsev and Ahmad Afsahi, "Designing an Offloaded Nonblocking MPI_Allgather Collective using CORE-Direct", 14th IEEE International Conference on Cluster Computing (Cluster 2012), Beijing, China, Sept. 24-28, 2012. © IEEE (acceptance rate: 28.9%, 58/201)
- Reza Zamani and Ahmad Afsahi, "A Study of Hardware Performance Monitoring Counter Selection in Power Modeling of Computing Systems", 2nd International Workshop on Power Measurement and Profiling (PMP 2012), San Jose, CA, June 5-8, 2012. © IEEE
- Mohammad J. Rashti and Ahmad Afsahi, "Exploiting Application Buffer Reuse to Improve MPI Small Message Transfer protocols over RDMA-enabled Networks", Cluster Computing, The Journal of Networks, Software Tools and Applications, Volume 14, Number 4, Dec. 2011, pp. 345-356. © Springer
- Judicael A. Zounmevo and Ahmad Afsahi, "Investigating Scenario-conscious Asynchronous Rendezvous over RDMA", 13th IEEE International Conference on Cluster Computing (Cluster 2011), Austin, TX, Sept. 26-30, 2011. © IEEE
- Mohammad J. Rashti, Jonathan Green, Pavan Balaji, Ahmad Afsahi and William Gropp, "Multi-core and Network Aware MPI Topology Functions", 18th EuroMPI conference, Recent Advances in the Message Passing Interface (EuroMPI 2011), Santorini, Greece, Sept. 18-21, 2011. © Springer.
- Ying Qian and Ahmad Afsahi, "Process Arrival Pattern Aware Alltoall and Allgather on InfiniBand Clusters", International Journal of Parallel Programming, Volume 39, No. 4, Aug. 2011, pp. 473-493. © Springer
- Ryan E. Grant, Mohammad J. Rashti, Pavan Balaji, and Ahmad Afsahi, "RDMA Capable iWARP over Datagrams", 25th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2011), Anchorage, AK, May 16-20, 2011. © IEEE (acceptance rate: 19.6%, 112/571)
- Mohammad J. Rashti, Ryan E. Grant, Pavan Balaji, and Ahmad Afsahi, "iWARP Redefined: Scalable Connectionless Communication over High-Speed Ethernet", 17th International Conference on High Performance Computing (HiPC 2010), Goa, India, Dec. 19-22, 2010. © IEEE (acceptance rate: 19.2%, 40/208)
- Reza Zamani and Ahmad Afsahi, "Adaptive Estimation and Prediction of Power and Performance in High Performance Computing", International Conference on Energy-Aware High Performance Computing", Sept. 16-17, 2010, Hamburg, Germany. Journal of Computer Science - Research and Development, Vol. 25, No. 3-4, 177-186. © Springer
- Ryan E. Grant, Pavan Balaji, and Ahmad Afsahi, "A Study of Hardware Assisted IP over InfiniBand and its Impact on Data Center Performance", 2010 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2010), White Plains, NY, Mar. 28-30, 2010. © IEEE (acceptance rate: 33%, 22/66)
- Mohammad J. Rashti and Ahmad Afsahi, "Modern Interconnects for High-Performance Computing Clusters", Book Chapter, Cluster Computing and Multi-Hop Network Research, Eds: Ciceron Jimenez and Maurice Ortego, 2010, © Nova Science Publishers, Inc.
- Ryan E. Grant and Ahmad Afsahi, "Improving Energy Efficiency of Asymmetric Chip Multithreaded Multiprocessors through Reduced OS Noise Scheduling", Concurrency and Computation: Practice and Experience (CCPE), Volume 21, Issue 18, pp. 2355-2376, Dec. 25, 2009. © Wiley
- Ryan E. Grant, Ahmad Afsahi, and Pavan Balaji, "Evaluation of ConnectX Virtual Protocol Interconnect for Data Centers", the15th International Conference on Parallel and Distributed Systems (ICPADS 2009), Shenzhen, China, Dec. 8-11, 2009. © IEEE (acceptance rate: 29.8%, 91/305)
- Ying Qian and Ahmad Afsahi, "Process Arrival Pattern and Shared Memory Aware Alltoall on InfiniBand", 16th EuroPVM/MPI, Espoo, Finland, Sept. 7-10, 2009, Lecture Notes in Computer Science (LNCS 5759), pp. 250-260. © Springer
- Mohammad J. Rashti and Ahmad Afsahi, "Improving RDMA-based MPI Eager Protocol for Frequently-used Buffers", 9th Workshop on Communication Architecture for Clusters (CAC 2009), in conjunction with the 23rd International Parallel and Distributed Processing Symposium (IPDPS 2009), Rome, Italy, May 25-29, 2009. © IEEE
- Mohammad J. Rashti and Ahmad Afsahi, "A Speculative and Adaptive MPI Rendezvous Protocol over RDMA-enabled Interconnects", International Journal of Parallel Programming, Volume 37, No. 2, Apr. 2009, pp. 223-246. © Springer
- Ying Qian and Ahmad Afsahi, "Efficient Shared Memory and RDMA based Collectives on Multi-rail QsNetII SMP Clusters", Cluster Computing, The Journal of Networks, Software Tools and Applications, Volume 11, No. 4, Dec. 2008, pp 341-354. © Springer.
- Ying Qian, Mohammad J. Rashti, and Ahmad Afsahi, "Multi-connection and Multi-core Aware All-Gather on InfiniBand Clusters", 20th IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS 2008), Orlando, Florida, USA, Nov. 16 – 18, 2008. © ACTA Press Best Paper Award in the area of Software Systems and tools
- Ryan E. Grant, Mohammad J. Rashti, and Ahmad Afsahi, "An Analysis of QoS Provisioning for Sockets Direct Protocol vs. IPoIB over Modern InfiniBand Networks", International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2), in conjunction with the 37th International Conference on Parallel Processing (ICPP 2008), Portland, OR, Sept. 12, 2008. © IEEE (acceptance rate: 45%, 9/20)
- Mohammad J. Rashti and Ahmad Afsahi, "Improving Communication Progress and Overlap in MPI Rendezvous Protocol over RDMA-enabled Interconnects," 22nd International Symposium on High Performance Computing Systems and Applications (HPCS 2008), Quebec City, QC, June 9-11, 2008. © IEEE
- Reza Zamani, Ahmad Afsahi, Ying Qian, and Carl Hamacher, "A Feasibility Analysis of Power-Awareness and Energy Minimization in Modern Interconnects for High-Performance Computing", 9th IEEE International Conference on Cluster Computing (Cluster 2007), Austin, TX, Sept. 17-20, 2007. © IEEE
- Ryan E. Grant and Ahmad Afsahi, "Improving System Efficiency through Scheduling and Power Management”, International Workshop on Green Computing (GreenCom’07), invited paper, work-in-progress session, in conjunction with the 9th IEEE International Conference on Cluster Computing (Cluster 2007), Austin, TX, Sept. 17, 2007. © IEEE
- Ying Qian and Ahmad Afsahi, "RDMA-based and SMP-aware Multi-port All-gather on Multi-rail QsNetII SMP Clusters", 36th International Conference on Parallel Processing (ICPP 2007), XiAn, China, Sept. 10-14, 2007. © IEEE
- Mohammad J. Rashti and Ahmad Afsahi, "Assessing the Ability of Computation/Communication Overlap and Communication Progress in Modern Interconnects", 15th Annual IEEE Symposium on High-Performance Interconnects (Hot Interconnects 2007), Palo Alto, CA, Aug. 22-24, 2007, pp. 117-124. © IEEE
- Ying Qian and Ahmad Afsahi, "High Performance RDMA-based Multi-port All-gather on Multi-rail QsNetII", 21st International Symposium on High Performance Computing Systems and Applications (HPCS 2007), Saskatoon, SK, May 13-16, 2007. © IEEE
- Mohammad J. Rashti and Ahmad Afsahi, "10-Gigabit iWARP Ethernet: Comparative Performance Analysis with InfiniBand and Myrinet-10G", 7th Workshop on Communication Architecture for Clusters (CAC 2007), in conjunction with the 21st International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, CA, Mar. 26-30, 2007. © IEEE (acceptance rate: 32%, 10/31)
- Ryan E. Grant and Ahmad Afsahi, "A Comprehensive Analysis of Multithreaded OpenMP Applications on Dual-Core Intel Xeon SMPs", Workshop on Multithreaded Architectures and Applications (MTAAP'07), in conjunction with the 21st International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, CA, Mar. 26-30, 2007. © IEEE
- Ying Qian, Ahmad Afsahi, Nathan R. Fredrickson, and Reza Zamani, "Performance Evaluation of the Sun Fire Link SMP Clusters", International Journal of High Performance Computing and Networking (IJHPCN), 2006, Volume 4, No 5/6, pp 209-221. © Inderscience
- Mohammad J. Rashti and Ahmad Afsahi, "NetEffect PCI-Express 10-Gigabit iWARP Ethernet: A Performance Study", White Paper, November, 2006. Also, available at NetEffect, Inc. (www.neteffect.com) and TechOnline (www.techonline.com)
- Ryan E. Grant and Ahmad Afsahi, "Power-Performance Efficiency of Asymmetric Multiprocessors for Multi-threaded Scientific Applications", 2nd Workshop on High-Performance, Power-Aware Computing (HP-PAC 2006), in conjunction with the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), Rhodes Island, Greece, Apr. 25-29, 2006. © IEEE (acceptance rate: 50%, 9/18)
- Ying Qian and Ahmad Afsahi, "Efficient RDMA-based Multi-port Collectives on Multi-rail QsNetII Clusters", 6th Workshop on Communication Architecture for Clusters (CAC 2006), in conjunction with the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), Rhodes Island, Greece, Apr. 25-29, 2006. © IEEE
- Reza Zamani and Ahmad Afsahi, "Communication Characteristics of Message-Passing Scientific and Engineering Applications", 17th IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS 2005), Phoenix, AZ, Nov. 14-16, 2005, pp. 644-649. © ACTA Press
- Ryan E. Grant and Ahmad Afsahi, "Characterization of Multithreaded Scientific Workloads on Simultaneous Multithreading Intel Processors", Workshop on Interaction between Operating System and Computer Architecture (IOSCA 2005), in conjunction with 2005 IEEE International Symposium on Workload Characterization (IISWC 2005), Austin, TX, Oct. 6-8, 2005, pp. 13-19.
- Reza Zamani, Ying Qian, and Ahmad Afsahi, "An Evaluation of the Myrinet/GM2 Two-Port Networks", 3rd IEEE Workshop on High-Speed Local Networks (HSLN 2004), In Proceedings of the 29th Annual IEEE Conference on Local Computer Networks (LCN 2004), Tampa, FL, Nov. 16-18, 2004, pp. 734-742. © IEEE
- Ying Qian, Ahmad Afsahi, and Reza Zamani, "Myrinet Networks: A Performance Study", 3rd IEEE International Symposium on Network Computing and Applications (NCA04), Cambridge, MA, Aug. 30 - Sept. 1, 2004, pp. 323-328. © IEEE
- Ying Qian, Ahmad Afsahi, Nathan R. Fredrickson, and Reza Zamani, "Performance Evaluation of the Sun Fire Link SMP Clusters", 18th International Symposium on High Performance Computing Systems and Applications (HPCS 2004), Winnipeg, MB, May 16-19, 2004, pp. 145-156.
- Ahmad Afsahi and Ying Qian, "Remote Shared Memory over Sun Fire Link Interconnect", 15th IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS 2003), Marina del Rey, CA, Nov. 3-5, 2003, pp. 381-386. © ACTA Press
- Nathan R. Fredrickson, Ahmad Afsahi, and Ying Qian, "Performance Characteristics of OpenMP Constructs, and Application Benchmarks on a Large Symmetric Multiprocessor", 17th Annual ACM International Conference on Supercomputing (ICS 2003), San Francisco, CA, June 23-26, 2003, pp. 140-149. © ACM (acceptance rate: 21.1%, 36/171)
- Ahmad Afsahi and Nikitas J. Dimopoulos, "Efficient Communication Using Message Prediction for Clusters of Multiprocessors", Concurrency and Computation: Practice and Experience (CCPE 2002), Volume 14, Issue 10, 2002, pp. 859-883. © Wiley
- Ahmad Afsahi and Nikitas J. Dimopoulos, "Analysis of a Latency Hiding Broadcasting Algorithm on a Reconfigurable Optical Interconnect", Parallel Processing Letters (PPL 2002), Volume 12, No. 1, 2002, pp. 41-50. © World Scientific Publishing Company
- Ahmad Afsahi and Nikitas J. Dimopoulos, "Architectural Extensions to Support Efficient Communication Using Message Prediction", 16th Annual International Symposium on High Performance Computing Systems and Applications (HPCS 2002), Moncton, NB, June, 2002, pp. 18-25. © IEEE.
- Ahmad Afsahi and Nikitas J. Dimopoulos, "Communication Prediction in Message-Passing Multiprocessors", 14th Annual International Symposium on High PerformanceComputing Systems and Applications, (HPCS 2000), Victoria, BC, June, 2000. High Performance Computing Systems and Applications, 2002, Chapter 18, pp. 253-271. ©Kluwer Academic Publishers.
- Ahmad Afsahi and Nikitas J. Dimopoulos, "Efficient Communication Using Message Prediction for Cluster of Multiprocessors", 4th Workshop on Communication, Architecture,and Applications for Network-based Parallel Computing (CANPC 2000), Toulouse, France, held in conjunction with the 6th International Symposium on High-Performance Computer Architecture (HPCA-6), Jan. 2000, Lecture Notes in Computer Science, Vol. 1797 , pp. 162-178. © Springer Verlag.
- Ahmad Afsahi and Nikitas J. Dimopoulos, "Communication Latency Hiding in Reconfigurable Message-Passing Environments: Quantitative Studies", 13th Annual International Symposium on High Performance Computing Systems and Applications (HPCS 99), Kingston, ON, June, 1999. High Performance Computing Systems and Applications, 2000, Chapter 19, pp. 137-152. © Kluwer Academic Publishers.
- Ahmad Afsahi and Nikitas J. Dimopoulos, "Hiding Communication Latency in Reconfigurable Message-Passing Environments", 2nd Merged IEEE Symposium IPPS/SPDP 99: 13th International Parallel Processing Symposium & 10th Symposium on Parallel and Distributed Processing, San Juan, Puerto Rico, Apr., 1999, pp. 55-60, © IEEE. (acceptance rate: 43.5%, 113/260)
- Ahmad Afsahi and Nikitas J. Dimopoulos, "Communications Latency Hiding Techniques for a Reconfigurable Optical Interconnect: Benchmark Studies", 4th International Workshop on Applied Parallel Computing, Large Scale Scientific and Industrial Problems (PARA 98), Umeå, Sweden, June, 1998, Lecture Notes in Computer Science, Vol. 1541,pp. 1-6. © Springer Verlag.
- Ahmad Afsahi and Nikitas J. Dimopoulos, "Collective Communications on a Reconfigurable Optical Interconnect", International Conference on Principles of Distributed Systems(OPODIS 97), Chantilly, France, Dec., 1997, pp. 167-181. © Hermes.
Technical Reports
- Hessam Mirsadeghi, Jesper Larsson Träff, Pavan Balaji, and Ahmad Afsahi, "Exploiting Common Neighborhoods to Optimize MPI Neighborhood Collectives", Technical Report ECE-0630, Parallel Processing Research Laboratory, Department of Electrical and Computer Engineering, Queen’s University, Kingston, ON, June 2017.
- Performance Characteristics of OpenMP Constructs, and Applications Benchmarks on a Large Symmetric Multiprocessor, Nathan R. Fredrickson, Ahmad Afsahi, and Ying Qian, Technical Report ECE-0302, Parallel Processing Research Laboratory, Department of Electrical and Computer Engineering, Queen's University, February 2003.