iOptimizeThings/dlmserve: OpenAI-compatible HTTP serving for diffusion language models. Continuous batching + LocalLeap acceleration.

OpenAI-compatible HTTP serving for diffusion language models. Continuous batching + LocalLeap acceleration.

Read Original

Related