More

anabis · 2025-11-06T05:02:01 1762405321

Not complaining too loudly because improvement is magical, but trying to stay on top of model cards and knowing which one to use for specific cases is bit tedious.

I think the end game is decent local model that does 80% of the work, and that also knows when to call the cloud, and which models to call.

anabis · 2025-10-23T00:40:29 1761180029

Yeah, mapping chinese characters to linear UTF-8 space is throwing a lot of information away. Each language brings some ideas for text processing. sentencepiece inventor is Japanese, which doesn't have explicit word delimiters, for example.

ComputerGuru · 2025-10-23T18:44:50 1761245090

It's not throwing any information away because it can be faithfully reconstructed (via an admittedly arduous process), therefore no entropy has been lost (if you consider the sum of both "input bytes" and "knowledge of utf-8 encoding/decoding").

anabis · 2025-10-17T03:18:48 1760671128

I've seen many comments that they are great for OCR stuff, and my usecase of receipt photo processing does have it doing better than ChatGPT , Claude or Grok.

anabis · 2025-10-08T09:13:53 1759914833

Yeah, its like a GPS navigation system. Useless and annoying in home turf. Invaluable in unfamiliar territory.

km144 · 2025-10-08T18:28:29 1759948109

Maybe it that's an apt analogy in more ways than one, given the recent research out of MIT on AI's impact on the brain, and previous findings about GPS use deteriorating navigation skills:

> The narrative synthesis presented negative associations between GPS use and performance in environmental knowledge and self-reported sense of direction measures and a positive association with wayfinding. When considering quantitative data, results revealed a negative effect of GPS use on environmental knowledge (r = −.18 [95% CI: −.28, −.08]) and sense of direction (r = −.25 [95% CI: −.39, −.12]) and a positive yet not significant effect on wayfinding (r = .07 [95% CI: −.28, .41]).

https://www.sciencedirect.com/science/article/pii/S027249442...

Keeping the analogy going: I'm worried we will soon have a world of developers who need GPS to drive literally anywhere.

jncfhnb · 2025-10-08T19:03:42 1759950222

I’m navigationally clueless but I don’t drive professionally

anabis · 2025-09-30T04:57:12 1759208232

> invaluable when you're operating in even a slightly unfamiliar environment

Its like the car navigation or Google Maps. Annoying and not much useful when in hometown. Very helpful when traveling or in unfamiliar territory.

anabis · 2025-09-29T03:26:46 1759116406

OpenAI cookbook says LLMs understand XML better than Markdown text, so maybe that also? Although, it should be more specified and structured, but not HTML.

onion2k · 2025-09-29T03:56:02 1759118162

OpenAI cookbook says LLMs understand XML better than Markdown text.

Yes, for prompts. Given how little XML is out on the public internet it'd be surprising if it also applies to data ingestion from web scraping functions. It'd be odd if Markdown works better than HTML to be honest, but maybe Markdown also changes the content being served e.g. there's no menu, header, or footer sent with the body content.

anabis · 2025-09-13T11:44:45 1757763885

I also had a compiler related description come to me after using Copilot. It allows you to partially generate imperative code declaratively, by writing a comment like

//now I will get rows X, Y, Z from ContentsProvider

then tab tab complete. You can then even tweak the generated code, very useful!

runlevel1 · 2025-09-13T20:12:39 1757794359

There's the rub: AI coding is like super tab complete. It's a useful tool. Full stop.

anabis · on Nov 18, 2024

You need proof of layooff (離職票) to collect you unemployment benefits. It is illegal not to issue one, but it is possible for the company can cause you some pain in issuing it.

tdeck · on Nov 18, 2024

Do you get unemployment benefits in Japan even if you quit?

w00kie · on Nov 18, 2024

Yes, but only after 3 months without a new job as opposed to immediately if laid off.

anabis · on Sept 6, 2023

>the most commonly cited rationale for institutionalization in those years was

>that neurotypical siblings would suffer—from shame, from attention starvation—

>if their disabled siblings were kept at home.

There is alternative perspective of "Young Carers" or "きょうだい児" in Japanese.

https://en.wikipedia.org/wiki/Young_carer

I don't think it was a easy choice then, or easy choice now.

Seems to be a relevant subreddit. https://www.reddit.com/r/siblingsupport

thriftwy · on Sept 6, 2023

Here we have an interesting pattern where the childhood is actually a brittle thing:

If you have a good childhood you grow up into an well integrated adult with no issues. However, if you are having a bad childhood, not only you have the process disrupted for you but it also cascades into your family, in this case siblings. That's because the childhood is so social, and it is often the bad kind.

Many of non-neurotypical (or even physically disabled) children could grow up into integrated adults, it would just take somewhat more time. However, they often don't have access to that time as they are isolated and sidelined by the social structure.

Adult don't care about each other or meddle in each other's affairs, but children would. That's often deleterious.

anabis · on Sept 5, 2023

It's probably more training-compute intensive, but they can do drop-out, right? The strategy they used for ImageNet recognition, when they were using supervised learning and training data was scarse.

minimaxir · on Sept 5, 2023

Dropout is one strategy for regularization but doesn't guarantee avoiding overfitting, especially now that modern AI models generalize much better than they did during the ImageNet days. Many of the big LLMs use a dropout of 0.1 though.

HN For You