More

mcyc · on Oct 30, 2024

A monthly writing challenge. See also: https://alpha.polymaths.social/@amin/statuses/01JBB8JKJMZ5ES...

mcyc · on Oct 23, 2024

A bit late, but I wrote up an example inference script here: https://gist.github.com/mcognetta/7a98e50859664b8efbb4ec094a...

It is a bit roundabout, since it involves converting maia models to onnx before loading into pytorch and some outdated versions of libraries (maia/lc0 are a little old). We were using this for transfer learning for a competition, so we needed some flexibility that we didn't know how to do quickly/easily in TF.

Hope this helps.

------------------

Personal note: given your interest in chess ai and your starcraft username, I think we would have a lot of shared interests. Feel free to reach out (info is in my profile).

hatsunearu · on Oct 23, 2024

Hi! I just messaged you.

mcyc · on Oct 4, 2024

See also: https://landing.google.co.jp/double-sided/ (in Japanese)

mcyc · on July 26, 2024

You may enjoy this article: https://arxiv.org/abs/2109.08203

The author treats the seed as a hyperparameter and searches for the one that performs best for training a CV model.

mcyc · on Feb 22, 2024

If you are interested in this field (exploring the limits of neural models using formal language theory), I help run a weekly seminar on it, Formal Languages and Neural Networks:

https://flann.super.site/

We have had many great speakers (most of them are recorded and available on YouTube) and have a welcoming Discord.

mcyc · on Feb 8, 2024

The Dean responded in the comments of this post.

https://statmodeling.stat.columbia.edu/2024/02/06/its-bezzle...

mcyc · on Jan 26, 2024

I also found the color scheme to be difficult to understand. A friend suggested interpolating between red and green depending on how close you are to the correct key. This is not too hard, since red is (255, 0, 0) and green is (0, 255, 0), so you can compute a distance (normalized to [0, 1]) and output (255 x d, 255 x (1-d), 0) to get the interpolated color.

It looks quite nice visually.

I wrote a small thread on it: https://twitter.com/good_in_theory/status/175079370720771734...

https://sigmoid.social/@mc/111821291510126156

b0r0phyll · on Jan 27, 2024

Hey, I just released a new update that adds multiple color schemes to help with colorblindness! At some point I may add a color blind mode that does away with the need for colors altogether, but for now this should make things a bit better. Also, there is now a gauge next to the board so you can more easily see the range of colors (unfortunately, it doesn't yet work on Firefox). Check it out! I also skipped forward a day, so if you already played today you can play again with the next word to try it out.

mcyc · on Jan 27, 2024

Ah, the other color schemes look really good (especially I like the heatmap one). However, the default stoplight one is nearly impossible for me to disambiguate. For example, in [1], I can't tell which of mine are closer at all (I don't think I am colorblind, but maybe I'm just in for a surprise today).

Anyway, overall, a very fun variant. Thanks for sharing!

[1]: https://shorturl.at/cBJQ1

mcyc · on Jan 26, 2024

Interpolating with the game's specified final green value (34, 139, 34) "forest green" provides a nicer effect than true green (0, 255, 0).

https://twitter.com/good_in_theory/status/175088137157541934...

mcyc · on Jan 21, 2024

Your videos are absolutely fantastic, thanks for making them!

mcyc · on July 18, 2023

SD1 uses a head dimension that is not supported by the old version of Flash Attention but is supported by this version.

mcyc · on May 9, 2023

The mean transduction in the sense of sequence-to-sequence models (transducing one sequence to another).

Attention based models don't necessarily need to be sequence to sequence. They can be classifiers, decoder only, etc. Attention is just one tool in the ML architecture toolkit.

HN For You