Skip to content

Terabit-Ethernet/NetChannel

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NetChannel: Disaggregating the Host Network Stack

NetChannel is a new disaggregated network stack architecture that enables resources allocated to each layer in the packet processing pipeline to be scaled and scheduled independently. Evaluation of an end-to-end realization of NetChannel within the Linux network stack demonstrates that NetChannel enables new operating points that were previously unachievable:

  • Independent scaling of data copy processing allows a single application thread to saturate 100Gbps access link bandwidth.
  • Independent scaling of network connections allows short flows to increase throughput almost linearly with cores.
  • Dynamic scheduling of packets between application threads and network connections at fine-grained timescales allows latency-sensitive applications to achieve µs-scale tail latency, even when competing with bandwidth-intensive applications operating at near-line rate.

1. Overview

Repository overview

  • kernel_patch/ includes some modifications in the kernel code.
  • module/ includes NetChannel kernel modules.
  • scripts/ includes scripts for getting started instructions.
  • sigcomm22_artifact/ includes scripts for SIGCOMM 2022 artifact evaluation.
  • util/ includes sample applications.

System overview

For simplicity, we assume that users have two physical servers (Client and Server) connected with each other over networks using the following configuration:

  • Client: 192.168.10.116 (interface: ens2f0)
  • Server: 192.168.10.117 (interface: ens2f0)

Getting Started Guide

Through the following sections, we provide getting started instructions to install NetChannel and to run experiments.

  • Build NetChannel
    • NetChannel Kernel: (10 human-mins + 30 compute-mins + 5 reboot-mins):
      NetChannel requires some modifications in the Linux kernel, so it requires kernel compilation and system reboot into the NetChannel kernel. This section covers how to build the Linux kernel with the NetChannel patch.
    • NetChannel Kernel Module: (5 human-mins):
      This section covers how to build the NetChannel kernel modules.
    • NetChannel Applications: (5 human-mins):
      This section covers how to build the NetChannel test applications.
  • Run a Toy Experiment (5-10 compute-mins):
    This section covers how to setup the servers and run experiments with the NetChannel kernel modules.
  • SIGCOMM 2022 Artifact Evaluation (45 compute-mins, and optionally 20 compute-mins + 5 reboot-mins):
    This section provides the detailed instructions to reproduce all individual results presented in our SIGCOMM 2022 paper.

2. Build NetChannel

NetChannel has been successfully tested on Ubuntu 20.04 LTS with Linux kernel 5.6. Building the NetChannel kernel and kernel modules should be done on both Client and Server machines. Note: if you are using our machines, you can skip this part and directly jump into SIGCOMM 2022 Artifact Evaluation.

Install Prerequisites

We need to install prerequisites to compile the kernel. On Ubuntu 20.04, this can be done with

sudo apt-get install libncurses-dev gawk flex bison openssl libssl-dev dkms dwarves  \
                  libelf-dev libudev-dev libpci-dev libiberty-dev autoconf sysstat iperf

NetChannel Kernel

Note: Don't build the kernel if you use our machine; please jump into SIGCOMM 2022 Artifact Evaluation.

  1. Download Linux kernel source tree:

    cd ~
    wget https://mirrors.edge.kernel.org/pub/linux/kernel/v5.x/linux-5.6.tar.gz
    tar xzvf linux-5.6.tar.gz
    
  2. Download and apply the NetChannel kernel patch to the kernel source:

    git clone https://github.com/Terabit-Ethernet/NetChannel.git
    cd ~/linux-5.6/
    git apply ../NetChannel/kernel_patch/netchannel-kernel.patch
    
  3. Update kernel configuration (with root):

    sudo -s
    cp /boot/config-`uname -r` ./.config
    make olddefconfig
    scripts/config --set-str SYSTEM_TRUSTED_KEYS ""
    
  4. Compile and install:

    make -j32 bzImage
    make -j32 modules
    make modules_install
    make install
    

    The number 32 means the number of threads created for compilation. Set it to be the total number of cores of your system to reduce the compilation time. Type lscpu | grep 'CPU(s)' to see the total number of cores:

    CPU(s):                32
    On-line CPU(s) list:   0-31
    

    In case, the image is too large, instead of using make modules_install, you can use make INSTALL_MOD_STRIP=1 modules_install. More details can be found here: https://01.org/linuxgraphics/gfx-docs/drm/kbuild/kbuild.html#install-mod-strip.

  5. Edit /etc/default/grub to boot with your new kernel by default. For example:

    GRUB_DEFAULT="1>Ubuntu, with Linux 5.6.0-netchannel"
    
  6. Update the grub configuration and reboot into the new kernel.

    update-grub && reboot
    
  7. When system is rebooted, check the kernel version, type uname -r in the command-line. It should be 5.6.0-netchannel.

NetChannel Kernel Modules

  1. Go to the kernel module directory:

    cd ~/NetChannel/module/
    
  2. Edit nd_plumbing.c (line 281) to change the local IP, remote IP address and the number of remote hosts:

    params->local_ip = "192.168.10.116";
    
    /* set the number of remote hosts */
    params->num_remote_hosts = 2;
    params->remote_ips[0] = "192.168.10.116";
    params->remote_ips[1] = "192.168.10.117";
    

    [NOTE] Use params->local_ip = "192.168.10.117" on the Server-side.

  3. Compile and load the NetChannel kernel module:

    make
    sudo insmod nd_module.ko
    

NetChannel Applications

  1. Add IPPROTO_VIRTUAL_SOCK in /usr/include/netinet/in.h:

    We need to define IPPROTO_VIRTUAL_SOCK for NetChannel applications. Add the two lines with sudo vi /usr/include/netinet/in.h (line 58):

    ...
       IPPROTO_VIRTUAL_SOCK = 19,      /* Virtual Socket.  */
    #define IPPROTO_VIRTUAL_SOCK     IPPROTO_VIRTUAL_SOCK
    ...
    
  2. Build io_uring library (liburing):

    cd ~
    git clone https://github.com/axboe/liburing
    cd liburing
    make
    cd ~/NetChannel/util/
    
  3. Edit Makefile to set the liburing-path (line 1):

    liburing-path = /home/(account name)/liburing
    
  4. Edit netdriver_test.cc to change the host IP adddress (line 761):

    addr_in.sin_addr.s_addr = inet_addr("192.168.10.116");
    

    [NOTE] Use inet_addr("192.168.10.117") on the Server-side.

  5. Compile the applications:

    make
    
  6. Build Redis (for artifact evaluation):

    cd ~
    git clone https://github.com/qizhe/redis.git
    cd redis/
    make
    

3. Run a Toy Experiment

[NOTE] You should confirm that NetChannel kernel module is loaded in both machines before activating the NetChannel module.

  1. On both sides:

    Load the NetChannel kernel module with network configuration scripts:

    sudo ~/NetChannel/scripts/network_setup.sh ens2f0
    sudo ~/NetChannel/scripts/enable_arfs.sh ens2f0
    sudo insmod ~/NetChannel/module/nd_module.ko
    
  2. On the Server side:

    Activate the NetChannel kernel module and run a test server application with the Server IP address (e.g., 192.168.10.117):

    sudo ~/NetChannel/scripts/run_module.sh
    cd ~/NetChannel/util/
    ./run_single_server.sh 192.168.10.117 1
    
  3. On the Client side:

    Activate the NetChannel kernel module and run a test client application with the Server IP address (e.g., 192.168.10.117):

    sudo ~/NetChannel/scripts/run_module.sh
    cd ~/NetChannel/util/
    ./run_client.sh 192.168.10.117 1 nd
    

Type sudo killall server on the Server machine to stop the server application.

4.Run your own applications with NetChannel without any changes of applications

  1. build socket redirect library,
cd custom_socket/
sudo ./compile.sh
  1. Run your applications
sudo LD_PRELOAD=~/NetChannel/custom_socket/nd_socket.so ./program

SIGCOMM 2022 Artifact Evaluation

Hardware/Software Configuration

We have used the follwing hardware and software configurations for running the experiments shown in the paper.

  • CPU: 4-Socket Intel Xeon Gold 6234 3.3 GHz with 8 cores per socket (with hyperthreading disabled)
  • RAM: 384 GB
  • NIC: Mellanox ConnectX-5 Ex VPI (100 Gbps)
  • OS: Ubuntu 20.04 with Linux 5.6 (patched)

Caveats of Our Work

Our work has been evaluated with two servers with 4-socket multi-core CPUs and 100 Gbps NICs directly connected with a DAC cable. While we generally focus on trends rather than individual data points, other combinations of end-host network stacks and hardware may exhibit different performance characteristics. All our scripts use network_setup.sh to configure the NIC to allow a specific benchmark to be performed. Some of these configurations may be specific to Mellanox NICs (e.g., enabling aRFS).

NetChannel Configurations

On both sides:

  1. Load NetChannel module:

    sudo ~/NetChannel/scripts/network_setup.sh ens2f0
    sudo ~/NetChannel/scripts/enable_arfs.sh ens2f0
    sudo insmod ~/NetChannel/module/nd_module.ko
    

    [NOTE] Step 1 should be done on both sides before Step 2.

  2. Activate NetChannel module:

    sudo ~/NetChannel/scripts/run_module.sh
    cd ~/NetChannel/sigcomm22_artifact/
    
  3. Edit param.sh to change the IP addresses and interface name:

    client_ip=192.168.10.116
    server_ip=192.168.10.117
    iface=ens2f0
    
  4. (Skip if configured) Configure sysstat and ssh:

    We use sar/sysstat and ssh to measure CPU utilization in the artifact evaluation scripts. Please refer to Installing sar/sysstat (both sides) and SSH login without password (Client-side only). Once the configrations are done correctly, you should be able to measure Server-side CPU utilization via the following command:

    Client: ssh 192.168.10.117 'sar -u 3 1'

    Linux 5.6.0-netchannel (xxxx)         xx/xx/2022      _x86_64_        (32 CPU)
    
    10:48:53 AM     CPU     %user     %nice   %system   %iowait    %steal     %idle
    10:48:58 AM     all      0.01      0.00      0.03      0.00      0.00     99.96
    Average:        all      0.01      0.00      0.03      0.00      0.00     99.96
    

NetChannel Experiments

  • Figure 6a, 6b (Data copy processing parallelism) (~2 minutes):

    For read/write syscalls:

    • Server: ./fig6a6b-nc-server.sh
    • Client: ./fig6a6b-nc-client.sh

    (Server: sudo killall server to stop the server application.)

    For io_uring:

    • Server: ./fig6a6b-nc-uring-server.sh
    • Client: ./fig6a6b-nc-uring-client.sh

    [NOTE] You can ignore these error messages: failed cqe: -107 recv_longflow failed

  • Figure 6c (Network processing parallelism) (~8 minutes):

    [NOTE] Repeat the following experiments with varying <#channels>: 1, 2, 3, 4 to get the entire Figure 6c results.

    For read/write syscalls:

    • Server: ./fig6c-nc-server.sh <#channels>
    • Client: ./fig6c-nc-client.sh <#channels>

    (Server: sudo killall server to change <#channels>.)

    For io_uring:

    • Server: ./fig6c-nc-uring-server.sh <#channels>
    • Client: ./fig6c-nc-uring-client.sh <#channels>
  • Figure 6d (Performance isolation) (~2 minutes):

    For the isolated case:

    • Server: ./fig6d-nc-isol-server.sh
    • Client: ./fig6d-nc-isol-client.sh

    (Server: sudo killall pingpong_server to stop the server application.)

    For the interference case:

    • Server: ./fig6d-nc-intf-server.sh
    • Client: ./fig6d-nc-intf-client.sh

    (Server: sudo killall server pingpong_server to stop the server application.)

  • Figure 7 (Redis performance) (~6 minutes):

    • Server: ./fig7-nc-server.sh
    • Client: ./fig7-nc-client.sh

    (Server: Ctrl+C to stop the server application.)

  • Figure 8a (Overheads of emulating Linux network stack) (~1 minute):

    • Server: ./fig8a-nc-server.sh
    • Client: ./fig8a-nc-client.sh

    (Server: sudo killall server to stop the server application.)

  • Figure 8b (Overheads of scaling data copy processing) (~1 minute):

    • Server: ./fig8b-nc-server.sh
    • Client: ./fig8b-nc-client.sh

    (Server: sudo killall server to stop the server application.)

  • Figure 8c (Overheads of scaling packet processing) (~4 minutes):

    [NOTE] Repeat the following experiment with varying <#channels>: 1, 2, 3, 4 to get the entire Figure 8c results.

    • Server: ./fig8c-nc-server.sh <#channels>
    • Client: ./fig8c-nc-client.sh <#channels>

    (Server: sudo killall server to change <#channels>.)

  • Figure 8d (Overheads of achieving performance isolation) (~2 minutes):

    For the isolated case:

    • Server: ./fig8d-nc-isol-server.sh
    • Client: ./fig8d-nc-isol-client.sh

    (Server: sudo killall server to stop the server application.)

    For the interference case:

    • Server: ./fig8d-nc-intf-server.sh
    • Client: ./fig8d-nc-intf-client.sh

    (Server: sudo killall server pingpong_server to stop the server application.)

Default Linux TCP Experiments

  • Figure 6a, 6b (data copy processing parallelism) (~2 minutes)

    For read/write syscalls:

    • Server: ./fig6a6b-tcp-server.sh
    • Client: ./fig6a6b-tcp-client.sh

    (Server: sudo killall server to stop the server application.)

    For io_uring:

    • Server: ./fig6a6b-tcp-uring-server.sh
    • Client: ./fig6a6b-tcp-uring-client.sh
  • Figure 6c (network processing parallelism) (~2 minutes)

    For read/write syscalls:

    • Server: ./fig6c-tcp-server.sh
    • Client: ./fig6c-tcp-client.sh

    (Server: sudo killall server to stop the server application.)

    For io_uring:

    • Server: ./fig6c-tcp-uring-server.sh
    • Client: ./fig6c-tcp-uring-client.sh

    [NOTE] This result generates the "Without NetChannel" case in Figure 6c.

  • Figure 6d (performance isolation) (~2 minutes)

    For the isolated case:

    • Server: ./fig6d-tcp-isol-server.sh
    • Client: ./fig6d-tcp-isol-client.sh

    (Server: sudo killall pingpong_server to stop the server application.)

    For the interference case:

    • Server: ./fig6d-tcp-intf-server.sh
    • Client: ./fig6d-tcp-intf-client.sh

    (Server: sudo killall server pingpong_server to stop the server application.)

  • Figure 7 (Redis performance) (~6 minutes):

    • Server: ./fig7-tcp-server.sh
    • Client: ./fig7-tcp-client.sh

    (Server: Ctrl+C to stop the server application.)

  • Figure 8a (Overheads of emulating Linux network stack) (~1 minute):

    • Server: ./fig8a-tcp-server.sh
    • Client: ./fig8a-tcp-client.sh

    (Server: sudo killall server to stop the server application.)

  • Figure 8b (Overheads of scaling data copy processing) (~1 minute):

    • Server: ./fig8b-tcp-server.sh
    • Client: ./fig8b-tcp-client.sh

    (Server: sudo killall server to stop the server application.)

  • Figure 8d (Overheads of achieving performance isolation) (~2 minutes):

    For the isolated case:

    • Server: ./fig8d-tcp-isol-server.sh
    • Client: ./fig8d-tcp-isol-client.sh

    (Server: sudo killall server to stop the server application.)

    For the interference case:

    • Server: ./fig8d-tcp-intf-server.sh
    • Client: ./fig8d-tcp-intf-client.sh

    (Server: sudo killall server pingpong_server to stop the server application.)

(Optional) Section 2 Motivation Experiments

  • Figure 2a, 2b (Lack of scalability for long flows) (~4 minutes):

    For TCP read/write syscalls with aRFS:

    • Server: ./fig2-arfs-tcp-server.sh
    • Client: ./fig2-arfs-tcp-client.sh

    (Server: sudo killall server to stop the server application.)

    For TCP read/write syscalls with manual IRQ steering:

    • Server: ./fig2-irq-tcp-server.sh
    • Client: ./fig2-irq-tcp-client.sh

    (Server: sudo killall server to stop the server application.)

    For TCP io_uring with aRFS:

    • Server: ./fig2-arfs-tcp-uring-server.sh
    • Client: ./fig2-arfs-tcp-uring-client.sh

    For TCP io_uring with manual IRQ steering:

    • Server: ./fig2-irq-tcp-uring-server.sh
    • Client: ./fig2-irq-tcp-uring-client.sh

    For MPTCP:

    [NOTE] Please go to "fig2_mptcp/" to run the MPTCP scripts. We however recommend you complete the following experiments first and finally come back to the MPTCP experiments as MPTCP requires a system reboot to change the kernel.

  • Figure 3 (Lack of scalability for short flows) (~2 minutes):

    For read/write syscalls:

    • Server: ./fig3-tcp-server.sh
    • Client: ./fig3-tcp-client.sh

    (Server: sudo killall server to stop the server application.)

    For io_uring:

    • Server: ./fig3-tcp-uring-server.sh
    • Client: ./fig3-tcp-uring-client.sh
  • Figure 4 (Lack of performance isolation) (~6 minutes):

    For the isolated case:

    • Server: ./fig4-tcp-isol-server.sh
    • Client: ./fig4-tcp-isol-client.sh for Linux
    • Client: ./fig4-tcp-prio-isol-client.sh for Linux+prioritization

    (Server: sudo killall server pingpong_server to stop the server application.)

    For the interference case:

    • Server: ./fig4-tcp-intf-server.sh
    • Client: ./fig4-tcp-intf-client.sh for Linux
    • Client: ./fig4-tcp-prio-intf-client.sh for Linux+prioritization

    (Server: sudo killall server pingpong_server to stop the server application.)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published