For the best experience on desktop, install the
Chrome extension
to track your reading on news.ycombinator.com
×
Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
history
|
from
register
DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles
(
lmsys.org
)
80 points
by
mji
42 days ago
|
past
|
10 comments
Pipeline Parallelism in SGLang: Scaling to Million-Token Contexts
(
lmsys.org
)
3 points
by
roody_wurlitzer
3 months ago
|
past
Pipeline Parallelism in SGLang: Scaling to Million-Token Contexts and Beyond
(
lmsys.org
)
1 point
by
gmays
4 months ago
|
past
Production-Ready Speculative Decoding Models and Framework
(
lmsys.org
)
1 point
by
gmays
5 months ago
|
past
Mini-SGLang: Efficient Inference Engine in a Nutshell
(
lmsys.org
)
2 points
by
matt_d
5 months ago
|
past
Power Up FSDP2 as a Flexible Training Back End for Miles
(
lmsys.org
)
1 point
by
gmays
5 months ago
|
past
NVIDIA DGX Spark In-Depth Review: A New Standard for Local AI Inference
(
lmsys.org
)
115 points
by
yvbbrjdr
7 months ago
|
past
|
93 comments
Deploying DeepSeek on 96 H100 GPUs
(
lmsys.org
)
285 points
by
GabrielBianconi
9 months ago
|
past
|
80 comments
Deploying DeepSeek on GB200 NVL72 with PD and Large Scale EP: 2.7x Throughput
(
lmsys.org
)
1 point
by
gmays
11 months ago
|
past
Match DeepSeek's inference system performance with SGLang
(
lmsys.org
)
1 point
by
echaozh
on May 6, 2025
|
past
Does style matter? Disentangling style and substance in Chatbot Arena
(
lmsys.org
)
2 points
by
ZeljkoS
on Feb 9, 2025
|
past
Faster JSON Decoding for LLMs
(
lmsys.org
)
1 point
by
gaocegege
on Dec 18, 2024
|
past
Does style matter? Disentangling style and substance in Chatbot Arena
(
lmsys.org
)
1 point
by
scottfr
on Aug 29, 2024
|
past
LLM Lookahead Decoding
(
lmsys.org
)
2 points
by
mr-ai
on Aug 20, 2024
|
past
From Live Data to High-Quality Benchmarks: The Arena-Hard Pipeline – Lmsys Org
(
lmsys.org
)
2 points
by
swyx
on Aug 3, 2024
|
past
Faster Open-Source Llama3 Serving with SGLang Runtime (vs. TensorRT-LLM, VLLM)
(
lmsys.org
)
4 points
by
yvbbrjdr
on July 25, 2024
|
past
RouteLLM: An Open-Source Framework for Cost-Effective LLM Routing
(
lmsys.org
)
4 points
by
adr1an
on July 5, 2024
|
past
RouteLLM: An Open-Source Framework for Cost-Effective LLM Routing
(
lmsys.org
)
4 points
by
not-chatgpt
on July 1, 2024
|
past
|
1 comment
Introducing Hard Prompts Category in Chatbot Arena
(
lmsys.org
)
1 point
by
JumpCrisscross
on June 21, 2024
|
past
Introducing Hard Prompts Category in Chatbot Arena
(
lmsys.org
)
1 point
by
CharlesW
on May 20, 2024
|
past
Hard Prompts Category in Chatbot Arena
(
lmsys.org
)
1 point
by
imjonse
on May 17, 2024
|
past
Whats up with Llama-3? LMSYS leaderboard analysis
(
lmsys.org
)
2 points
by
aadillpickle
on May 15, 2024
|
past
What's Up with Llama 3?
(
lmsys.org
)
4 points
by
tosh
on May 9, 2024
|
past
Gpt2-Chatbot Removed from Lmsys
(
lmsys.org
)
39 points
by
synthwave
on April 30, 2024
|
past
|
11 comments
Lmsys Chatbot Arena: Benchmarking LLMs in the Wild
(
lmsys.org
)
2 points
by
EvgeniyZh
on April 10, 2024
|
past
Fast JSON Decoding for Local LLMs with Compressed Finite State Machine
(
lmsys.org
)
1 point
by
yeesian
on March 7, 2024
|
past
Mistral AI launches Mixtral-Next
(
lmsys.org
)
204 points
by
varunvummadi
on Feb 17, 2024
|
past
|
49 comments
Mistral releases their latest prototype model, Next, to Chatbot Arena
(
lmsys.org
)
2 points
by
vagabund
on Feb 16, 2024
|
past
Fastest JSON Decoding for Local LLMs with Compressed Finite State Machine
(
lmsys.org
)
2 points
by
MMMercy2
on Feb 5, 2024
|
past
5x LLM Throughput with SGLang and RadixAttention
(
lmsys.org
)
2 points
by
DreamGen
on Jan 19, 2024
|
past
More
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
×
HN For You
Display Mode
Highlight
Top
Only
Debug mode
Sign Out
API Key:
Connect
Create an account
to get your API key.