Submissions from lmsys.org

		DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles (lmsys.org)
		80 points by mji 42 days ago \| past \| 10 comments
		Pipeline Parallelism in SGLang: Scaling to Million-Token Contexts (lmsys.org)
		3 points by roody_wurlitzer 3 months ago \| past
		Pipeline Parallelism in SGLang: Scaling to Million-Token Contexts and Beyond (lmsys.org)
		1 point by gmays 4 months ago \| past
		Production-Ready Speculative Decoding Models and Framework (lmsys.org)
		1 point by gmays 5 months ago \| past
		Mini-SGLang: Efficient Inference Engine in a Nutshell (lmsys.org)
		2 points by matt_d 5 months ago \| past
		Power Up FSDP2 as a Flexible Training Back End for Miles (lmsys.org)
		1 point by gmays 5 months ago \| past
		NVIDIA DGX Spark In-Depth Review: A New Standard for Local AI Inference (lmsys.org)
		115 points by yvbbrjdr 7 months ago \| past \| 93 comments
		Deploying DeepSeek on 96 H100 GPUs (lmsys.org)
		285 points by GabrielBianconi 9 months ago \| past \| 80 comments
		Deploying DeepSeek on GB200 NVL72 with PD and Large Scale EP: 2.7x Throughput (lmsys.org)
		1 point by gmays 11 months ago \| past
		Match DeepSeek's inference system performance with SGLang (lmsys.org)
		1 point by echaozh on May 6, 2025 \| past
		Does style matter? Disentangling style and substance in Chatbot Arena (lmsys.org)
		2 points by ZeljkoS on Feb 9, 2025 \| past
		Faster JSON Decoding for LLMs (lmsys.org)
		1 point by gaocegege on Dec 18, 2024 \| past
		Does style matter? Disentangling style and substance in Chatbot Arena (lmsys.org)
		1 point by scottfr on Aug 29, 2024 \| past
		LLM Lookahead Decoding (lmsys.org)
		2 points by mr-ai on Aug 20, 2024 \| past
		From Live Data to High-Quality Benchmarks: The Arena-Hard Pipeline – Lmsys Org (lmsys.org)
		2 points by swyx on Aug 3, 2024 \| past
		Faster Open-Source Llama3 Serving with SGLang Runtime (vs. TensorRT-LLM, VLLM) (lmsys.org)
		4 points by yvbbrjdr on July 25, 2024 \| past
		RouteLLM: An Open-Source Framework for Cost-Effective LLM Routing (lmsys.org)
		4 points by adr1an on July 5, 2024 \| past
		RouteLLM: An Open-Source Framework for Cost-Effective LLM Routing (lmsys.org)
		4 points by not-chatgpt on July 1, 2024 \| past \| 1 comment
		Introducing Hard Prompts Category in Chatbot Arena (lmsys.org)
		1 point by JumpCrisscross on June 21, 2024 \| past
		Introducing Hard Prompts Category in Chatbot Arena (lmsys.org)
		1 point by CharlesW on May 20, 2024 \| past
		Hard Prompts Category in Chatbot Arena (lmsys.org)
		1 point by imjonse on May 17, 2024 \| past
		Whats up with Llama-3? LMSYS leaderboard analysis (lmsys.org)
		2 points by aadillpickle on May 15, 2024 \| past
		What's Up with Llama 3? (lmsys.org)
		4 points by tosh on May 9, 2024 \| past
		Gpt2-Chatbot Removed from Lmsys (lmsys.org)
		39 points by synthwave on April 30, 2024 \| past \| 11 comments
		Lmsys Chatbot Arena: Benchmarking LLMs in the Wild (lmsys.org)
		2 points by EvgeniyZh on April 10, 2024 \| past
		Fast JSON Decoding for Local LLMs with Compressed Finite State Machine (lmsys.org)
		1 point by yeesian on March 7, 2024 \| past
		Mistral AI launches Mixtral-Next (lmsys.org)
		204 points by varunvummadi on Feb 17, 2024 \| past \| 49 comments
		Mistral releases their latest prototype model, Next, to Chatbot Arena (lmsys.org)
		2 points by vagabund on Feb 16, 2024 \| past
		Fastest JSON Decoding for Local LLMs with Compressed Finite State Machine (lmsys.org)
		2 points by MMMercy2 on Feb 5, 2024 \| past
		5x LLM Throughput with SGLang and RadixAttention (lmsys.org)
		2 points by DreamGen on Jan 19, 2024 \| past
		More

HN For You