More

jpalomaki · 2026-06-13T09:19:41 1781342381

Isn't training material the biggest problem for truly open source LLMs (such that could compete with top tier models)? The computation part can be solved with money, but compiling a comprehensive training set that could be freely shared and free of copyright issues is pretty much impossible.

dorfsmay · 2026-06-13T18:39:29 1781375969

Didn't the courts decide that if it's just for learning everything is fair game?

ajdegol · 2026-06-13T09:29:05 1781342945

I wonder if we could gamify and democratise it somehow, like fold-at-home and wikipedia...

I've been training a teeny specialised model to run in a browser on a phone to detect harmonium notes played in a song (harmonium turns out is a pita, another story for another day), getting good labelled data is _all_ of the hard work.

That being said, maybe for cheap inference, using a big model to train something ultra-suited for the task at hand might be how we could handle local inference; thinking language specific models.

reedciccio · 2026-06-13T09:41:04 1781343664

You don't need to have fully copyright-unencumbered datasets to build Open Source AI, as that (as you say) would be impossible. https://opensource.org/ai

jpalomaki · 2026-06-11T07:56:25 1781164585

Do we need to bring Keybase[1] "back"? The original idea, mapping your social media presence to certain encryption keys.

In the future it will be increasingly difficult to prove in online context that you are not a bot. Being able to show that your social media (HN, GitHub, etc) presence goes way back would be an option.

[1] https://en.wikipedia.org/wiki/Keybase

account42 · 2026-06-11T10:13:37 1781172817

But the AI actions are already associated with a "real" pre-existing account in TFA, that didn't stop anything.

jpalomaki · 2026-06-06T14:39:46 1780756786

"yes, there were regressions in some use cases of rsync in the 3.4.3 release. I quite deliberately tried to err on the side of fixing security issues for that release, and there were some valid (but unusual) use cases that got caught up in the changes"

jpalomaki · 2026-06-05T16:30:31 1780677031

It’s sometimes convenient if database is the only ”stateful” component in architecture.

Also if all the "state" is in one database, then you have better chance of getting consistent backups.

jpalomaki · 2026-06-04T22:21:41 1780611701

Not sure if this is the future I want, but I've always thought the main idea of smart glasses is to automatically bring up information that is relevant in your current context. One part of this is to recognize who you are staring at.

dylan604 · 2026-06-04T22:23:23 1780611803

It worked well for the Terminator, so why shouldn't it work for us too? Being able to identify target/foe is of course how this will be used.

However, I'd be much more inclined for the Black Mirror use of being able to block someone literally not just a number in your phone.

rightbyte · 2026-06-05T15:04:06 1780671846

I have some vague recollection of a sketch where the user walks around and gets popups and ads in his glasses and later removes them to discover there is a calm city life around him. Was that Black Mirror?

dylan604 · 2026-06-05T20:39:28 1780691968

That sounds like the Chappelle sketch "The Internet in Real Life"

https://www.youtube.com/watch?v=x4WuNU_0e5c

ncr100 · 2026-06-05T13:30:46 1780666246

Yep, there's a anime called I think it's the Eden of the East, which explores this as a key motivating technology in the story.

asdff · 2026-06-05T02:18:19 1780625899

ICE contract coming soon!

jpalomaki · 2026-05-30T17:27:23 1780162043

"By integrating Ookla’s data products, including Speedtest®, Downdetector®, Ekahau®, and RootMetrics®, Accenture will help Communications Service Providers (CSPs), hyperscalers, and enterprises optimize the mission-critical Wi-Fi and 5G networks that power their digital core. [...] Ookla’s data platform is anchored by more than 250 million consumer-initiated tests per month, complemented by controlled drive, walk, and embedded testing options"[1]

[1] https://newsroom.accenture.com/news/2026/accenture-to-acquir...

simonw · 2026-05-30T17:57:53 1780163873

Is there some legal reason to scatter announcements with that many ® symbols, or do they just do it for style reasons / because they think it makes the announcement look more impressive?

kube-system · 2026-05-30T18:18:52 1780165132

Using the symbol allows them greater protection under the Lanham act, because it counts as “notice” that the mark is registered.

Without it, it limits your ability to recover damages from infringement.

trollbridge · 2026-05-30T23:20:40 1780183240

When you are making the absurd case you’ve trademarked “speed test”, yes, you have to take pains to mark it.

notatoad · 2026-05-30T23:39:50 1780184390

i'm guessing that part of accenture's consulting business is helping people navigate the trademark registration process. so they've got to hype up the ®.

conception · 2026-05-30T18:16:20 1780164980

Legal. Gotta protect your trademarks.

Suppafly · 2026-05-31T01:28:04 1780190884

To a degree, but I've never seen anything requiring you to show a mark literally every time it's mentioned.

jpalomaki · 2026-05-25T10:43:42 1779705822

How about the payments, what is the easiest way from corporate customer perspective in DACH? Let's say for smaller subscriptions, <1000€/month?

fakedang · 2026-05-26T01:00:21 1779757221

You missed the point of the article, which is that DACH places more importance on Compliance, Security and Stability. Those are the first questions first and foremost, and because they are expensive questions, you have to charge more than <€1000/month.

jpalomaki · 2026-05-22T07:50:15 1779436215

"Only $99 to add 5G Backup to any UniFI network" and "Fully unlocked for any compatible carrier with SIM and eSIM support". Wonder if there's some catch?

jpalomaki · 2026-05-15T00:09:42 1778803782

Tell it to create html summaries with diagrams and sidebar for navigation.

Or ask Codex to create image that explains xyz.

jpalomaki · 2026-05-07T05:24:09 1778131449

It’s likely because the quick thought is that auth is just user table with hashed password.

Then when you really start thinking about it, the list of requirements grows.

Of course it’s still totally doable for an average developer, but takes time and mistakes can be catastrophic. And maybe the time is better spent developing stuff that differentiates you from others.

HN For You