More

XYen0n · 2026-06-09T17:11:25 1781025085

After the implementation of SIM card real-name registration in China, scam calls can accurately state your personal information.

XYen0n · 2026-06-04T05:48:31 1780552111

DNS is merely one implementation of service discovery; even without DNS, some other form of service discovery would still be needed.

louwrentius · 2026-06-04T06:12:44 1780553564

Why would some form of service discovery be required? No need to discover things if you can push said information in configuration updates using tools like Ansible, pyinfra, and so on?

protocolture · 2026-06-04T06:31:25 1780554685

How does your convoluted Ansible system know which systems and services to maintain.

If its a list of IP addresses, having a list of ip addresses is a crude service discovery protocol.

Tasking developers (because lets be absolutely clear, the idea of removing DNS from production environments is something only a developer could come up with, no competent engineer would ever raise) with maintaining ordered lists of servers to keep updated is only going to overcomplicate things.

And yes your hosts file is another example of a list.

trumpdong · 2026-06-05T11:58:22 1780660702

How does your DNS system know? There's always a list of all systems somewhere.

XYen0n · 2026-06-09T03:03:24 1780974204

I think the essential difference between these two approaches lies in whether the complexity is placed at update time or query time. When updates are infrequent, or when the number of machines that need to apply the update is small, this is certainly reasonable.

throwway120385 · 2026-06-04T20:11:32 1780603892

What you're asking is akin to "why do people sometimes need a boat to get from point A to point B? All I've used are cars and I think that should be fine."

kube-system · 2026-06-04T20:10:33 1780603833

You probably don't need service discovery if your entire infrastructure is small enough that the whole thing is deployed with Ansible.

XYen0n · 2026-05-05T19:10:12 1778008212

> Linux is GPL'ed and the name Linux is also trademarked. But if I decided to port it to run on a lava lamp, what would be wrong with my calling the project "Linux for Lava Lamp"?

You can do this not because Linux is GPL, but because Linus Torvalds has authorized certain uses of this trademark in some form; I could not find specific information for Linux, but the Linux Foundation provides reference: https://www.linuxfoundation.org/brand-guidelines

kelvinjps10 · 2026-05-05T19:56:22 1778010982

Arch linux mentions this in their website I think that distros that user the linux name asked for permission to use it

XYen0n · 2026-05-05T18:51:20 1778007080

The only thing a model can output is tokens; to achieve this, a tool of converting tokens into operational transformations is required. For example, I have an ast-grep skill, it will instruct the model to generate ast-grep rules and run ast-grep to perform file modifications.

basch · 2026-05-05T20:27:46 1778012866

I am saying to directly output the operational transformation instructions as the tokens. You’re essentially telling it to “write the diff” and then applying the patch.

[retain(8), delete(6), insert("very very"), retain(10)]

mike_hearn · 2026-05-06T09:01:04 1778058064

OpenAI models emit a format similar to a regular diff, but without the line numbers. Look at apply_patch

ritonlajoie · 2026-05-05T23:48:33 1778024913

there is a model in openrouter doing exactly this, it generates diffs. forgot the name though

XYen0n · 2026-05-05T18:31:10 1778005870

GLM-5.1 does not support image input.

XYen0n · 2026-05-05T18:25:22 1778005522

The OCI manifest references the hashes of these compressed layers, and re-compressing them does not guarantee obtaining the same hash

flakes · 2026-05-05T18:52:55 1778007175

Recompressing should be guaranteed deterministic. It’s the packing/unpacking of tar archives to/from directories on disk that leads to the non-determinism (such as timestamps and ownership metadata). If the tar is left intact, both zstd and gzip should produce byte for byte identical outputs given the same compression parameters.

XYen0n · 2026-05-05T19:56:33 1778010993

You are correct; I confused archiving with compression. However, even considering only the compression process, same compression parameters cannot be guaranteed, as it is unknown which compression parameters the image publisher used.

flakes · 2026-05-05T21:14:48 1778015688

Thats true. And regardless of compressed vs regular tar, I think the OCI format working with opaque archives is extremely limiting. I hope the industry will eventually redesign to use content addressable storage per file and have metadata to describe the layer/disk layout instead. That would allow per file deduplication, and we can use tar for just bulk transfer over the wire, rather than using tar for the data at rest.

cpuguy83 · 2026-05-05T21:36:52 1778017012

containerd 2.3 has support for erofs which does a direct import of the layer. It can even convert the tar based layers to erofs, faster than extracting the tar normally.

Also looking at block-based content store so that blocks can be deduped across images.

cpuguy83 · 2026-05-05T21:32:37 1778016757

That is not correct. You would have to use the same compression tool (and likely version) for this to match.

Old docker discarded the compressed bits but kept some metadata about the the so it can at least recreate the tar.

It also recreated the manifest o push.

flakes · 2026-05-05T21:41:57 1778017317

Thanks for the correction. I did mean given the same tooling version/parameters, but (as you and others pointed out) preserving and recreating that state is not at all straightforward.

mort96 · 2026-05-05T19:43:51 1778010231

If that's the purpose, couldn't you store the hash and throw away the compressed image?

(As others said, compression is deterministic for the same algorithm, parameters and input data)

a_t48 · 2026-05-05T20:05:34 1778011534

Zstd for example only promises determinism on the same version of the library. I've personally seen the hashes mutate between pull and export. Things like tar padding also make a difference. Really, the thing to do is to hash on the _uncompressed_ data and let compression be a transport/registry detail. That's what I've done, at least.

mort96 · 2026-05-05T20:26:56 1778012816

I didn't know that about zstd, that's a bit unfortunate.

Tar isn't related here though, we're talking about compression not archival formats

thaJeztah · 2026-05-06T00:16:06 1778026566

Yes, compression being part of the OCI image's digest was (in hindsight) a poor decision. _Technically_ OCI images allow uncompressed layers, and the layers could be included without compression (and transport compression to be used); this would allow layers to be fully reproducible. We explored some options to do this (and made some preparations; https://github.com/containerd/containerd/pull/8166), but also discovered that various implementations of registry clients didn't handle transport-compression correctly (https://github.com/distribution/distribution/pull/3754), which could result in client either pulling the full, uncompressed, content, or image validation failing.

a_t48 · 2026-05-06T01:29:06 1778030946

For my registry fork/custom pull client I hash on the uncompressed content and store as compressed under the uncompressed digest. This lets me have my cake and eat it, too - compression free digests, smaller storage costs, be able to set consistent compression settings, have the ability to spend extra CPU to recompress on the backend without breaking hashes, etc. I control both pull client and registry, so it works.

cpuguy83 · 2026-05-05T21:38:04 1778017084

The whole entire reason is compression is not deterministic across tooling.

XYen0n · 2026-04-30T04:39:11 1777523951

Unfortunately, it appeared too late, and the relevant support is now far less complete than that for `X-Forwarded-*`.

XYen0n · 2026-04-28T03:17:36 1777346256

Amp Free provides $10 free credits every day. Unfortunately, new applications have now been closed.

XYen0n · 2026-04-11T08:35:31 1775896531

Even human developers are unlikely to have only ever seen GPL-2.0-only code.

tmalsburg2 · 2026-04-11T10:18:24 1775902704

Humans will not regurgitate longer segments of code verbatim. Even if we wanted to, we couldn’t do it because our memory doesn’t work that way. LLM on the other hand can totally do that, and there’s nothing you can do to prevent it.

johanyc · 2026-04-11T14:35:46 1775918146

Llm can but do they? Is there any evidence that they spit out a piece of code verbatim without being explicitly prompted to do so? NYT v OpenAI for example, NYT intentionally prompted to circumvent OpenAi's guardrail to show NYT articles

XYen0n · 2026-03-31T04:39:15 1774931955

If everyone avoids using packages released within the last 7 days, malicious code is more likely to remain dormant for 7 days.

otterley · 2026-03-31T04:42:20 1774932140

What do you base that on? Threat researchers (and their automated agents) will still keep analyzing new releases as soon as they’re published.

mike_hearn · 2026-03-31T08:33:55 1774946035

Their analysis was triggered by open source projects upgrading en-masse and revealing a new anomalous endpoint, so, it does require some pioneers to take the arrows. They didn't spot the problem entirely via static analysis, although with hindsight they could have done (missing GitHub attestation).

narrator · 2026-03-31T09:14:43 1774948483

A security company could set up a honeypot machine that installs new releases of everything automatically and have a separate machine scan its network traffic for suspicious outbound connections.

mike_hearn · 2026-03-31T14:16:33 1774966593

The problem is what counts as suspicious. StepSecurity are quite clear in their post that they decide what counts as anomalous by comparing lots of open source runs against prior data, so they can't figure it out on their own.

PunchyHamster · 2026-03-31T12:25:09 1774959909

The fact threat researchers and especially their automated agents are not all that good at their jobs

zwily · 2026-03-31T13:19:52 1774963192

Those threat researchers and their autonomous agents caught this axios release.

staticassertion · 2026-03-31T11:01:23 1774954883

> What do you base that on?

The entire history of malware lol

otterley · 2026-03-31T14:04:23 1774965863

Can you elaborate? Why do you believe that motivated threat hunters won’t continue to analyze and find threats in new versions of open source software in the first week after release?

staticassertion · 2026-03-31T14:08:17 1774966097

Attackers going "low and slow" when they know they're being monitored is just standard practice.

> Why do you believe that motivated threat hunters won’t continue to analyze and find threats in new versions of open source software in the first week after release?

I'm sure they will, but attackers will adapt. And I'm really unconvinced that these delays are really going to help in the real world. Imagine you rely on `popular-dependency` and it gets compromised. You have a cooldown, but I, the attacker, issue "CVE-1234" for `popular-dependency`. If you're at a company you now likely have a compliance obligation to patch that CVE within a strict timeline. I can very, very easily pressure you into this sort of thing.

I'm just unconvinced by the whole idea. It's fine, more time is nice, but it's not a good solution imo.

otterley · 2026-03-31T14:23:48 1774967028

What, in your view, is a better solution?

staticassertion · 2026-03-31T15:12:17 1774969937

There are many options. Here's a post just briefly listing a few of the ones that would be handled by package managers and registries, but there are also many things that would be best done in CI pipelines as well.

https://news.ycombinator.com/item?id=47586241

cozzyd · 2026-03-31T04:41:37 1774932097

that's why people are telling others to use 7 days but using 8 days themselves :)

wongarsu · 2026-03-31T10:33:46 1774953226

brb, switching everything to 9 days

johnisgood · 2026-03-31T13:18:13 1774963093

That is 3D chess level type shit. xD

MetaWhirledPeas · 2026-03-31T15:18:54 1774970334

You don't have to be faster than the bear, you just have to be faster than the other guy.

porridgeraisin · 2026-03-31T07:34:25 1774942465

Genius

jmward01 · 2026-03-31T04:43:35 1774932215

I suspect most packages will keep a mix of people at 7 days and those with no limit. That being said, adding jitter by default would be good to these features.

Barbing · 2026-03-31T06:16:04 1774937764

>adding jitter by default would be good

This became evident, what, perhaps a few years ago? Probably since childhood for some users here but just wondering what the holdup is. Lots of bad press could be avoided, or at least a little.

DimmieMan · 2026-03-31T04:41:16 1774932076

They’re usually picked up by scanners by then.

bakugo · 2026-03-31T04:46:55 1774932415

> If everyone avoids using packages released within the last 7 days

Which will never even come close to happening, unless npm decides to make it the default, which they won't.

Aurornis · 2026-03-31T04:55:37 1774932937

Most people won’t.

7 days gives ample time for security scanning, too.

3abiton · 2026-03-31T05:27:04 1774934824

This highly depends on the detection mechanism.

shreyssh · 2026-03-31T09:31:01 1774949461

[flagged]

sersi · 2026-03-31T09:38:51 1774949931

But wouldn't the type of people that notifes anomalous network activity be exactly the type of people who add a 7 day delay because they're security conscious?

DrewADesign · 2026-03-31T13:57:52 1774965472

And I’ll bet a chunk of already-compromised vibe coders are feeling really on-top-of-shit because they just put that in their config, locking in that compromised version for a week.

HN For You