For the best experience on desktop, install the Chrome extension to track your reading on news.ycombinator.com
Hacker Newsnew | past | comments | ask | show | jobs | submit | history | jpalomaki's commentsregister

Isn't training material the biggest problem for truly open source LLMs (such that could compete with top tier models)? The computation part can be solved with money, but compiling a comprehensive training set that could be freely shared and free of copyright issues is pretty much impossible.

Didn't the courts decide that if it's just for learning everything is fair game?

I wonder if we could gamify and democratise it somehow, like fold-at-home and wikipedia...

I've been training a teeny specialised model to run in a browser on a phone to detect harmonium notes played in a song (harmonium turns out is a pita, another story for another day), getting good labelled data is _all_ of the hard work.

That being said, maybe for cheap inference, using a big model to train something ultra-suited for the task at hand might be how we could handle local inference; thinking language specific models.


You don't need to have fully copyright-unencumbered datasets to build Open Source AI, as that (as you say) would be impossible. https://opensource.org/ai

Do we need to bring Keybase[1] "back"? The original idea, mapping your social media presence to certain encryption keys.

In the future it will be increasingly difficult to prove in online context that you are not a bot. Being able to show that your social media (HN, GitHub, etc) presence goes way back would be an option.

[1] https://en.wikipedia.org/wiki/Keybase


But the AI actions are already associated with a "real" pre-existing account in TFA, that didn't stop anything.

"yes, there were regressions in some use cases of rsync in the 3.4.3 release. I quite deliberately tried to err on the side of fixing security issues for that release, and there were some valid (but unusual) use cases that got caught up in the changes"

It’s sometimes convenient if database is the only ”stateful” component in architecture.

Also if all the "state" is in one database, then you have better chance of getting consistent backups.


Not sure if this is the future I want, but I've always thought the main idea of smart glasses is to automatically bring up information that is relevant in your current context. One part of this is to recognize who you are staring at.

It worked well for the Terminator, so why shouldn't it work for us too? Being able to identify target/foe is of course how this will be used.

However, I'd be much more inclined for the Black Mirror use of being able to block someone literally not just a number in your phone.


I have some vague recollection of a sketch where the user walks around and gets popups and ads in his glasses and later removes them to discover there is a calm city life around him. Was that Black Mirror?

That sounds like the Chappelle sketch "The Internet in Real Life"

https://www.youtube.com/watch?v=x4WuNU_0e5c


Yep, there's a anime called I think it's the Eden of the East, which explores this as a key motivating technology in the story.

ICE contract coming soon!

"By integrating Ookla’s data products, including Speedtest®, Downdetector®, Ekahau®, and RootMetrics®, Accenture will help Communications Service Providers (CSPs), hyperscalers, and enterprises optimize the mission-critical Wi-Fi and 5G networks that power their digital core. [...] Ookla’s data platform is anchored by more than 250 million consumer-initiated tests per month, complemented by controlled drive, walk, and embedded testing options"[1]

[1] https://newsroom.accenture.com/news/2026/accenture-to-acquir...


Is there some legal reason to scatter announcements with that many ® symbols, or do they just do it for style reasons / because they think it makes the announcement look more impressive?


Using the symbol allows them greater protection under the Lanham act, because it counts as “notice” that the mark is registered.

Without it, it limits your ability to recover damages from infringement.


When you are making the absurd case you’ve trademarked “speed test”, yes, you have to take pains to mark it.

i'm guessing that part of accenture's consulting business is helping people navigate the trademark registration process. so they've got to hype up the ®.

Legal. Gotta protect your trademarks.


To a degree, but I've never seen anything requiring you to show a mark literally every time it's mentioned.

How about the payments, what is the easiest way from corporate customer perspective in DACH? Let's say for smaller subscriptions, <1000€/month?


You missed the point of the article, which is that DACH places more importance on Compliance, Security and Stability. Those are the first questions first and foremost, and because they are expensive questions, you have to charge more than <€1000/month.


"Only $99 to add 5G Backup to any UniFI network" and "Fully unlocked for any compatible carrier with SIM and eSIM support". Wonder if there's some catch?


Tell it to create html summaries with diagrams and sidebar for navigation.

Or ask Codex to create image that explains xyz.


It’s likely because the quick thought is that auth is just user table with hashed password.

Then when you really start thinking about it, the list of requirements grows.

Of course it’s still totally doable for an average developer, but takes time and mistakes can be catastrophic. And maybe the time is better spent developing stuff that differentiates you from others.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search:

HN For You