Categories
Publications

Building blocks for network-accelerated distributed file systems

Research paper by our partner ETHZ, accepted at the SC22 Conference  that took place online from 14 to 18 November 2022 in Dallas, TX, USA. This paper was produced in collaboration with DEEP-SEA and was a BEST PAPER FINALIST! Abstract High-performance clusters and datacenters pose increasingly demanding requirements on storage systems. If these systems do not operate at scale, […]

Categories
Publications

NeVerMore: Exploiting RDMA Mistakes in NVMe-oF Storage Applications

Research paper by our partner ETHZ, accepted at the ACM Conference on Computer and Communications Security (CCS)  that took place from 7 to 11 November 2022 in Los Angeles, CA, USA. Abstract This paper presents a security analysis of the InfiniBand architecture, a prevalent RDMA standard, and NVMe-over-Fabrics (NVMe-oF), a prominent protocol for industrial disaggregated storage […]

Categories
Publications

Lifting C semantics for dataflow optimization

Research paper by our partner ETHZ, accepted at the ICS ’22 International Conference on Supercomputing that took place online from 28 to 30 June 2022. Abstract C is the lingua franca of programming and almost any device can be programmed using C. However, programming modern heterogeneous architectures such as multi-core CPUs and GPUs requires explicitly […]

Categories
Publications

KafkaDirect: Zero-copy Data Access for Apache Kafka over RDMA Networks

Research paper presented at the ACM SIGMOD/PODS Conference that took place in Philadelphia from 12 to 17 June 2022. Abstract Apache Kafka is an open-source distributed publish-subscribe system, which is widely used in data centers for messaging between applications, log aggregation, and stream processing. The existing Kafka implementation uses TCP/IP for communication, which has various […]

Categories
Publications

Asynchronous Distributed-Memory Triangle Counting and LCC with RMA Caching

This paper was accepted at IPDPS 2022, the 36th IEEE International Parallel and Distributed Processing Symposium that was held from 30 May to 03 June 2022 in Lyon, France. This paper was prepared by ETH Zürich. Abstract Triangle count and local clustering coefficient are two core metrics for graph analysis. They find broad application in analyses […]

Categories
Publications

Optimized Page Fault Handling During RDMA

This research paper was accepted by the journal IEEE Transactions on Parallel and Distributed Systems and will be published in vol. 33, no. 12, pp. 3990-4005 in December 2022. Abstract Remote Direct Memory Access (RDMA) is widely used in High-Performance Computing (HPC) while making inroads in datacenters and accelerators. State-of-the-art RDMA engines typically do not […]

Categories
Publications

A RDMA Interface for Ultra-Fast Ultrasound Data-Streaming over an Optical Link

Research paper presented at the DATE 2022 conference (Design, Automation and Test in Europe conference) that took place online from 14 to 23 March 2022. Abstract Digital ultrasound (US) probes integrate the analog-to-digital conversion directly on the probe and can be conveniently connected to commodity devices. Existing digital probes are however limited to a relatively […]

Categories
Publications

Flare: flexible in-network allreduce

Research paper presented at the SC21 conference (St Louis, USA and online) Abstract The allreduce operation is one of the most commonly used communication routines in distributed applications. To improve its bandwidth and to reduce network traffic, this operation can be accelerated by offloading it to network switches, that aggregate the data received from the hosts, and […]

Categories
Publications

A RISC-V in-network accelerator for flexible high-performance low-power packet processing

Research paper presented at the ISCA 2021 conference (online) Abstract The capacity of offloading data and control tasks to the network is becoming increasingly important, especially if we consider the faster growth of network speed when compared to CPU frequencies. In-network compute alleviates the host CPU load by running tasks directly in the network, enabling […]