For the best experience on desktop, install the
Chrome extension
to track your reading on news.ycombinator.com
×
Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
history
|
from
register
Ggwave: Tiny Data-over-Sound Library
(
github.com/ggerganov
)
284 points
by
LorenDB
on Feb 24, 2025
|
past
|
72 comments
Whisper.cpp: Looking for Maintainers
(
github.com/ggerganov
)
3 points
by
tech234a
on Feb 5, 2025
|
past
Llama.cpp now supports tool calling (OpenAI-compatible)
(
github.com/ggerganov
)
3 points
by
ochafik
on Feb 1, 2025
|
past
|
1 comment
Ollama are 'try[ing to] achieve vendor lock-in'
(
github.com/ggerganov
)
17 points
by
alexmorley
on Jan 31, 2025
|
past
|
5 comments
Ggml: X2 speed for WASM by optimizing SIMD
(
github.com/ggerganov
)
1 point
by
btilly
on Jan 28, 2025
|
past
|
2 comments
DeepSeek-R1 speeds up llama.cpp code by x2
(
github.com/ggerganov
)
6 points
by
roboboffin
on Jan 28, 2025
|
past
|
3 comments
Llama.cpp PR with 99% of code written by DeepSeek-R1
(
github.com/ggerganov
)
4 points
by
zelag
on Jan 28, 2025
|
past
Ggml 2x WASM Speed with SIMD Optimization Using 99% DeekSeek-R1-Generated Code
(
github.com/ggerganov
)
7 points
by
bratao
on Jan 27, 2025
|
past
Train a Mnist VAE with C and CUDA
(
github.com/ggerganov
)
54 points
by
bssrdf
on Dec 21, 2024
|
past
|
2 comments
Llama.cpp Now Supports Qwen2-VL (Vision Language Model)
(
github.com/ggerganov
)
155 points
by
BUFU
on Dec 14, 2024
|
past
|
50 comments
Llama.vim: Plugin for Neovim
(
github.com/ggerganov
)
2 points
by
mariuz
on Oct 22, 2024
|
past
Llama.vim: Plugin for Neovim
(
github.com/ggerganov
)
2 points
by
ibobev
on Oct 21, 2024
|
past
Attention and final logit soft-capping, update scaling factor to Gemma2
(
github.com/ggerganov
)
2 points
by
tosh
on July 1, 2024
|
past
Distributed LLM Inference with Llama.cpp
(
github.com/ggerganov
)
3 points
by
tosh
on May 24, 2024
|
past
New exponent functions that make SiLU and SoftMax 2x faster, at full accuracy
(
github.com/ggerganov
)
382 points
by
weinzierl
on May 15, 2024
|
past
|
72 comments
ggml: Add Flash Attention
(
github.com/ggerganov
)
2 points
by
tosh
on May 13, 2024
|
past
Acoustic Keyboard Eavesdropping
(
github.com/ggerganov
)
1 point
by
behnamoh
on May 11, 2024
|
past
llama.cpp bfloat16 support
(
github.com/ggerganov
)
2 points
by
indigodaddy
on April 30, 2024
|
past
GGML Flash Attention support merged into llama.cpp
(
github.com/ggerganov
)
3 points
by
smcleod
on April 30, 2024
|
past
|
1 comment
Llama.cpp Working on Support for Llama3
(
github.com/ggerganov
)
7 points
by
theolivenbaum
on April 18, 2024
|
past
Llama.cpp: Improve CPU prompt eval speed
(
github.com/ggerganov
)
1 point
by
tosh
on April 17, 2024
|
past
Llama.cpp: Mac Prebuilds
(
github.com/ggerganov
)
2 points
by
tosh
on March 22, 2024
|
past
Grok-1 Support for Llama.cpp
(
github.com/ggerganov
)
11 points
by
schappim
on March 22, 2024
|
past
|
2 comments
Control Vectors have been added to llama.cpp
(
github.com/ggerganov
)
3 points
by
Der_Einzige
on March 16, 2024
|
past
Gemma Is Added to Llama.cpp
(
github.com/ggerganov
)
17 points
by
behnamoh
on Feb 21, 2024
|
past
Llama.cpp supports distributed inference across machines on a local network
(
github.com/ggerganov
)
3 points
by
behnamoh
on Jan 27, 2024
|
past
Llama.cpp incoming backends: Vulkan, Kompute, SYCL
(
github.com/ggerganov
)
2 points
by
irusensei
on Jan 27, 2024
|
past
Llama.cpp: Self-Extend Support
(
github.com/ggerganov
)
2 points
by
tosh
on Jan 9, 2024
|
past
Llama.cpp: SOTA 2-bit quants
(
github.com/ggerganov
)
5 points
by
tosh
on Jan 7, 2024
|
past
GGUF File Format
(
github.com/ggerganov
)
2 points
by
warkanlock
on Dec 31, 2023
|
past
More
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
×
HN For You
Display Mode
Highlight
Top
Only
Debug mode
Sign Out
API Key:
Connect
Create an account
to get your API key.