UCCL-EP: DeepEP-style expert parallelism on any NIC, no GPU-initiated comms
How UCCL reimplements the DeepEP kernels for arbitrary hardware by swapping the transport out from under them.
Read full article →How UCCL reimplements the DeepEP kernels for arbitrary hardware by swapping the transport out from under them.
Read full article →