For the best experience on desktop, install the
Chrome extension
to track your reading on news.ycombinator.com
×
Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
history
|
from
register
Low-Latency Inference with Speculative Decoding on D-Matrix Corsair and GPU
(
gimletlabs.ai
)
1 point
by
nserrino
40 days ago
|
past
The emerging role of SRAM-centric chips in AI inference
(
gimletlabs.ai
)
7 points
by
gmays
43 days ago
|
past
The emerging role of SRAM-centric chips in AI inference
(
gimletlabs.ai
)
3 points
by
nserrino
46 days ago
|
past
Speeding up PyTorch inference on Apple devices with AI-generated Metal kernels
(
gimletlabs.ai
)
187 points
by
nserrino
7 months ago
|
past
|
30 comments
Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
×
HN For You
Display Mode
Highlight
Top
Only
Debug mode
Sign Out
API Key:
Connect
Create an account
to get your API key.