.. Copyright (c) <2019>, Intel Corporation All rights reserved. Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met: - Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer. - Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution. - Neither the name of Intel Corporation nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission. THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. =============== CBDMA test plan =============== Sample Application of packet copying using Intel Quick Data Technology ====================================================================== Overview -------- This sample is intended as a demonstration of the basic components of a DPDK forwarding application and example of how to use IOAT driver API to make packets copies. Also, this application can be used to compare performance of memory copy with different packet size between CPU and DMA engine. The application will print out statistics each second. The stats shows received/send packets and packets dropped or failed to copy. The application can be launched in various configurations depending on provided parameters. Each port can use up to 2 lcores: one of lcore receives incoming traffic and makes a copy of each packet, the second lcore then updates MAC address and sends the copy. If one lcore per port is used, both operations are done sequentially. For each configuration an additional lcore is needed since master lcore in use which is responsible for configuration, statistics printing and safe deinitialization of all ports and devices. Also, the application can use 8 ports in maximum. Running the Application ----------------------- In order to run the hardware copy application, the copying device needs to be bound to user-space IO driver. Refer to the *IOAT Rawdev Driver for Intel QuickData Technology* guide for information on using the driver. The application requires a number of command line options: .. code-block:: console ./build/ioatfwd [EAL options] -- -p MASK [-q NQ] [-s RS] [-c <sw|hw>] [--[no-]mac-updating] where, * p MASK: A hexadecimal bitmask of the ports to configure * q NQ: Number of Rx queues used per port equivalent to CBDMA channels per port * c CT: Performed packet copy type: software (sw) or hardware using DMA (hw) * s RS: Size of IOAT rawdev ring for hardware copy mode or rte_ring for software copy mode * --[no-]mac-updating: Whether MAC address of packets should be changed or not Packet pipeline: ================ NIC RX -> copy packet -> free original -> update mac addresses -> NIC TX Test Case1: CBDMA basic test with differnet size packets ======================================================== 1.Bind one cbdma port and one nic port to igb_uio driver. 2.Launch ioatfwd app:: ./build/ioatfwd -l 0-1 -n 2 -- -p 0x1 -c hw --mac-updating 3.Send different size packets (64B, 256B, 512B, 1024B, IMIX) from TG to NIC. 4.Check performance from “Total packets Tx†and check log includes "Worker Threads = 1, Copy Mode = hw". Test Case2: CBDMA test with multi-threads ========================================= 1.Bind one cbdma port and one nic port to igb_uio driver. 2.Launch ioatfwd app with three cores:: ./build/ioatfwd -l 0-2 -n 2 -- -p 0x1 -c hw 3. Send different size packets from TG to NIC. 4.Check performance from “Total packets Tx†and check log includes "Worker Threads = 2, Copy Mode = hw". Test Case3: CBDMA test with multi nic ports =========================================== 1.Bind two cbdma ports and two nic ports to igb_uio driver. 2.Launch ioatfwd app with multi-ports:: ./build/ioatfwd -l 0-4 -n 2 -- -p 0x3 -q 1 -c hw 3.Send different size packets (64B, 256B, 512B, 1024B, IMIX) from TG to two NIC ports. 4.Check stats of two ports, each port's performance shows in “Total packets Tx†and each port's log includes "Worker Threads = 2, Copy Mode = hw". Test Case4: CBDMA test with multi-queues ======================================== 1.Bind two cbdma ports and one nic port to igb_uio driver. 2.Launch ioatfwd app with multi-queues:: ./build/ioatfwd -l 0-2 -n 2 -- -p 0x1 -q 2 -c hw 3. Send random ip packets (64B, 256B, 512B, 1024B, IMIX) from TG to NIC port. 4. Check stats of ioat app, "Worker Threads = 2, Copy Mode = hw, Rx Queues = 2" and each ioat channel can enqueue packets. 5. Repeat step1 to step4 with queue number 4 and qemu number 8, also bind same number cbdma ports. Check performance gains status when queue numbers added. Test Case5: CBDMA performance cmparison between mac-updating and no-mac-updating ================================================================================ 1.Bind one cbdma ports and one nic port to igb_uio driver. 2.Launch ioatfwd app:: ./build/ioatfwd -l 0-1 -n 2 -- -p 0x1 -q 2 --no-mac-updating -c hw 3. Send random ip 64B packets from TG. 4. Check performance from ioat app:: Total packets Tx: xxx [pps] 5.Launch ioatfwd app:: ./build/ioatfwd -l 0-1 -n 2 -- -p 0x1 -q 2 --mac-updating -c hw 6. Send random ip 64B packets from TG. 7. Check performance from ioat app:: Total packets Tx: xxx [pps] Test Case6: CBDMA performance cmparison between HW copies and SW copies using different packet size =================================================================================================== 1.Bind four cbdma pors and one nic port to igb_uio driver. 2.Launch ioatfwd app with three cores:: ./build/ioatfwd -l 0-2 -n 2 -- -p 0x1 -q 4 -c hw 3. Send random ip packets from TG. 4. Check performance from ioat app:: Total packets Tx: xxx [pps] 5.Launch ioatfwd app with three cores:: ./build/ioatfwd -l 0-2 -n 2 -- -p 0x1 -q 4 -c sw 6. Send random ip packets from TG. 7. Check performance from ioat app and compare with hw copy test:: Total packets Tx: xxx [pps]