More

DennisL123 · 2025-05-13T06:48:11 1747118891

A BUILD.md could be useful.

mdaniel · 2025-05-13T15:43:29 1747151009

the readme is literally 21 lines long, and about 25% of them are blank lines, with the 2nd URL pointing to "how to contribute" and that link has a dedicated "how to build and run" https://firefox-source-docs.mozilla.org/contributing/contrib...

The bad news is that their build system is extremely hand-rolled, and so if it works for you, count yourself lucky, because when it doesn't work you're in for 4 hours of python hell

DennisL123 · 2025-05-06T06:18:24 1746512304

Interesting stuff. Not sure if I read this right that it‘s 16 und 32 bit values of integers that get sorted. If yes, I‘d love to see if the GPU implementation can beat a competitive Radix sort implementation on a CPU.

winwang · 2025-05-06T07:05:46 1746515146

It's 32 32-bit values which get sorted. I don't think a GPU sort would beat a CPU sort at this scale, even if you don't take kernel launch time into account. CPUs are simply too fast for (super-)small data, especially with AVX-512. But if we're talking about a larger amount of data, that would be a different story, i.e. as part of a normal gpu mergesort.

maeln · 2025-05-06T07:24:15 1746516255

It is also useful if your data already lives on the GPU memory. For example, when you need to z-sort a bunch of particles in a 3d renderer particle system.

exDM69 · 2025-05-06T08:55:18 1746521718

A 32 way GPU sorting algorithm might be just what I need for sorting and deduplicating triangle id's in a visibility buffer renderer I am working on.

Thanks for sharing.

winwang · 2025-05-06T18:36:32 1746556592

As someone who doesn't know very much about graphics (ironically), you're welcome and hope it helps!

DennisL123 · 2025-04-24T05:12:22 1745471542

OSRM lead dev here. Love to see this large of an instance being solved.

DennisL123 · on March 18, 2025

It precomputes partial routes that are combined at run time. :)

DennisL123 · on Feb 28, 2025

tl;dr: doesn‘t work as expected and we sank a ton of money on it too.

DennisL123 · on Feb 18, 2025

Shitty tools lead to wasted time. True for programming, and apparently also for appliance installations.

Good job on pulling through, though.

DennisL123 · on Jan 4, 2025

tl;dr: it‘s a fast and simple (~5 LOCs) PRNG that has garnered some attraction in the computer chess community.

DennisL123 · on Dec 14, 2024

Love it

DennisL123 · on Nov 29, 2024

It’s meltdown proof in principle since the process goes into an equilibrium rather than into a runaway process.

DennisL123 · on Oct 26, 2024

Without knowing what they actually use, I feel comfortable to state that the industry has moved on from Contraction Hierarchies to somewhat more flexible techniques. These allow you to take traffic information and road closures, and user preferences, and whatnot into account without requiring a full re-processing of the input data with each traffic update. The state of the art is a two-step preprocessing that first decomposes the road network into cells, and then processes these cells independently. Sometimes it goes by the name of customisable route planning, sometimes it is called multi-level Dijkstra.

HN For You