sito42's comments

sito42 · 2026-03-04T14:53:48 1772636028

> Now people barely bring it up at all. It’s like a lion has escaped the zoo and it’s gulping down schoolchildren, but when people suggest zoo improvements, all the agenda items are like, “We should add another Dippin’ Dots kiosk”. If you bring up the loose tiger, everyone gets annoyed at you, like “Of course, no one likes the tiger”.

saghm · 2026-03-04T15:40:19 1772638819

Is the switch from lion to tiger in that paragraph intentional? If only the quote from "everyone" was switched I might think it's intended to convey people trying to derisively dismiss the issue or something but it does specifically reference the tiger rather than the lion being brought up, so I was confused when I got to this part of the article.

sito42 · 2025-10-06T15:33:58 1759764838

> As much as people like to use NumPyro and sometimes even PyMC to generate JAX code, I think it may be easier in the end to just write JAX directly. That way, nothing gets between you and JAX and you don’t have to figure out how to filter JAX through middleware. When you do that, the models can be organized very much like in Stan.

^much truth. Nascent libraries like distreqx make it much easier to work at a lower level while maintaining some of the log density affordances that PPLs provide.

https://github.com/lockwo/distreqx

sito42 · 2025-09-08T19:52:46 1757361166

> The defining - and best - feature of preprint servers is that they do not try to litigate the rigor of the science in a paper, or try to decide whether it will ultimately prove important, before posting it. They just post it. We should embrace this, and fight the temptation to reinstate gatekeeping criteria and systems that disempower authors without providing any value.

jleyank · 2025-09-08T19:59:50 1757361590

S/N reduction. Without this, it will be impossible to wade through the flood of papers. Too easy to hemmorage them out anymore.

sito42 · 2025-09-08T22:19:35 1757369975

Author reputation and citation patterns provide plenty(?) of signal without journal/reviewer/editor endorsement. But you could still imagine introducing “badges” that provide similar additional signal to what publication in a top journal provides today. Academic societies issue a fixed number of badges to top preprints each year. But ditch the song and dance around peer review.

gus_massa · 2025-09-09T01:12:47 1757380367

> citation patterns provide plenty(?) of signal

There are self citations and citation rings. The signal is very week, even with the curren model.

> introducing “badges”

You reinvented journals.

sito42 · 2025-09-09T01:35:12 1757381712

> There are self citations and citation rings.

https://en.wikipedia.org/wiki/Eigenfactor and other such indices are much less game-able

Peer review (as currently practiced) and closed access are the problems, not the quality signal provided by journal publication.

sito42 · 2025-08-20T18:02:31 1755712951

astral bags another one

RS-232 · 2025-08-20T18:14:59 1755713699

Is ty more mature than pyright or mypy?

I'm currently using pyright, but I'm going to migrate once ty and its vscode extension are given the "production ready" greenlight.

sito42 · 2025-08-20T18:56:58 1755716218

at this stage I get very few false positives and it's so much easier to configure and use than pyright

marcyb5st · 2025-08-21T07:36:21 1755761781

Personal experience: if you use injector[1] with NewType so that you can give your primitive types a meaning and add them to your injection stack it completely fails. For example:

```python

ModelName = NewType("ModelName", str)

# You bind your string within a module: binder.bind(ModelName, to=ModelName(parsed_args.model_name))

# When you need it: model_name = injector.get(ModelName) # Here it fails, saying that you need "concrete" types or something similar

```

So while it is great already it definitely still has many rough edges still. But it is to be expected from alpha releases

[1] https://pypi.org/project/injector/

denis- · 2025-08-20T19:57:31 1755719851

ty still doesn't understand match + typing.assert_never pattern, last barrier for me to switching.

sito42 · 2025-07-31T03:12:38 1753931558

Most scientific "products" are not bound books or pdf. The software, datasets, proofs, algorithms, etc contained within them (or in the dreaded "replication materials") are usually more valuable than the text that scientists write around them. We use the text merely to communicate the contribution and its value to other humans. I suspect AI will outperform the best humans at this communication task very soon. Is the purpose of a dissertation really to demonstrate that a human has the capacity to write effective prose? There are many scientists who are brilliant but are terrible at this task. Do we really wish to shun them and their ideas?

In the future, the best science will be produced by those that wield AI tools most effectively. Academics need to figure out how to assess scientific work within this context. This is not a good solution.

sito42 · 2025-07-30T02:15:18 1753841718

I don't understand why this framework has become so popular. You can't instantiate or program around `FlowSpec`s. The type safety is terrible (you're supposed to just haphazardly assign to `self` whenever you need to persist something across steps). `step`s just feel like glorified jupyter cells.

There's been so much great innovation in the ML ops space in the last few years and metaflow puts you in a straightjacket that prevents you from tapping into any of it.

constGard · 2025-07-30T05:28:52 1753853332

Off the top of your head, could you describe a good/great ml ops infra setup? I've been reading what I can about state of the art ml ops after not being in the space for a few years and metaflow comes up a lot.

knowsuchagency · 2025-07-30T21:33:34 1753911214

Nothing touches https://windmill.dev. There are some rare instances where you still may want Airflow, but even so, windmill and airflow complement each other well. If you need notebooks, I recommend https://docs.marimo.io/

sito42 · 2025-05-07T01:24:09 1746581049

Closing in on alpha https://github.com/astral-sh/ruff/milestone/22

sito42 · on Feb 10, 2024

numpyro is an underrated library that runs on jax which makes it easy to put on the gpu. They have a nice suite of examples.

https://github.com/pyro-ppl/numpyro

canyon289 · on Feb 10, 2024

Numpyro is fantastic, great syntax and great contributors as well.

PyMC now supports jax and numba as well, and can use the numpyro samplers. https://www.pymc.io/projects/examples/en/latest/samplers/fas...

Between all these tools we hope the modern bayesian can find the one that works best for them

HN For You