More

nalgeon · on April 30, 2024

I like to explain things by example, and interactive examples are even better. So I built a sandbox server that lets you run virtually any software. And a widget to easily add interactive code examples to any kind of technical documentation.

You can use it to build your own sandboxes, or use existing ones to write interactive guides like these: https://github.com/nalgeon/tryxinyminutes

mtlynch · on April 30, 2024

Cool to see this here, Anton!

If you can still edit your title, I suggest prefixing this post with "Show HN: " so that it's obvious that you're the author sharing your project.

nalgeon · on April 30, 2024

Thanks, Michael! I think it's pretty clear even without it :)

mtlynch · on April 30, 2024

But you also get to show up in the special category:

https://news.ycombinator.com/show

refset · on April 30, 2024

Codapi looks very slick! It's great that you kept everything generic enough to support many different backend use cases.

Coincidentally my colleagues recently built a custom version of almost this exact same interactive UX for writing SQL tutorials: https://docs.xtdb.com/tutorials/immutability-walkthrough/par...

If only you had launched sooner :)

nalgeon · on April 26, 2024

Sure! I really like your writing.

nalgeon · on April 14, 2024

Great tips, thank you! The thing is, getting maximum throughput is not the goal of the project (at least not at this stage). I'm using reasonable SQLite defaults (including WAL), but that's it for now.

nalgeon · on April 14, 2024

Thank you! I also think that the relational model can get you pretty far if you don't need to squeeze every last bit of performance out of the program. And the added benefit of using a battle-tested SQL engine is far fewer storage-related bugs.

nalgeon · on April 14, 2024

Yeah, it's explained in the code[1]

SQLite only allows one writer at a time, so concurrent writes will fail with a "database is locked" (SQLITE_BUSY) error.

There are two ways to enforce the single writer rule:

1. Use a mutex for write operations.

2. Set the maximum number of DB connections to 1.

Intuitively, the mutex approach seems better, because it does not limit the number of concurrent read operations. The benchmarks show the following results:

- GET: 2% better rps and 25% better p50 response time with mutex

- SET: 2% better rps and 60% worse p50 response time with mutex

Due to the significant p50 response time mutex penalty for SET, I've decided to use the max connections approach for now.

[1]: https://github.com/nalgeon/redka/blob/main/internal/sqlx/db....

Sytten · on April 14, 2024

This is really not true in WAL mode with synchronous NORMAL, this was only true with the default journal mode and a lot of people are misusing sqlite because of that. You still have one writer at a time but you wont get the SQLITE_BUSY error.

You can check the documentation [1], only some rare edge cases return this error in WAL. We abuse our sqlite and I never saw it happen with a WAL db.

[1] https://www.sqlite.org/wal.html#sometimes_queries_return_sql...

nasretdinov · on April 14, 2024

How about having two pools, one for writes only, and the other one for reads? SQLite allows you to open the DB more than in one thread per application, so you can have a read pool and a write pool with SetMaxConnections(1) for better performance. This of course also means that reads should be handled separately from writes in the API layer too.

nalgeon · on April 14, 2024

Thought about it, decided to start with simpler and good enough option. The goal here is not to beat Redis anyway.

nasretdinov · on April 14, 2024

Well I agree, that's a good starting point. You probably won't be able to beat Redis with SQLite anyway :), although given that WAL mode allows for concurrent reads it might give it a large enough performance boost to match Redis in terms of QPS if the concurrency is high enough.

acrispino · on April 15, 2024

for what it's worth, the two pool approach is suggested here by a collaborator to github.com/mattn/go-sqlite3: https://github.com/mattn/go-sqlite3/issues/1179#issuecomment...

avinassh · on April 15, 2024

Have you tried setting BUSY_TIMEOUT and benched it? I believe it would be better too.

https://www.sqlite.org/pragma.html#pragma_busy_timeout

kiitos · on April 14, 2024

> The benchmarks show the following results

Where are the benchmarks?

nalgeon · on April 14, 2024

Can't fit everything in 1.0, has to start with something. If the community is interested in the project, there will be more.

nalgeon · on April 14, 2024

I'm a big fan of both Redis and SQLite, so I decided to combine the two. SQLite is specifically designed for many small queries[1], and it's probably as close as relational engines can get to Redis, so I think it might be a good fit.

[1]: https://sqlite.org/np1queryprob.html

b33j0r · on April 14, 2024

I love this, it’s the solution that makes sense for 90% of the times I have used redis with python.

I’ve made several versions of this, and to be honest, it ended up being so straightforward that I assumed it was a trivial solution.

This is pretty well-planned. This is 100% the way to go.

Heh. I took a detour into making my idea of “streams” also solve event sourcing in native python; dumb idea, if interesting. Mission creep probably killed my effort!

Nice work

sesm · on April 14, 2024

What are the project goals? I assume it's a drop-in replacement for Redis that is supposed to be better in certain cases? If yes, then what cases do you have in mind?

nalgeon · on April 14, 2024

The goal is to have a convenient API to work with common data structures, with an SQL backend and all the benefits it provides. Such as:

— Small memory footprint even for large datasets.

— ACID transactions.

— SQL interface for introspection and reporting.

sesm · on April 15, 2024

So the goal is to have a Redis-like API but not actually be an in-memory data store and reduce memory consumption this way? For example, for a project/service that started with Redis, but then priorities shifted and small memory footprint became more important than performance? Did I get it right?

grobbyy · on April 15, 2024

I can tell you my use case. I have a lot of systems which I'd like to use on a single computer AND at scale.

That's sometimes even just for development work.

A lot of these use a common API to a more complex distributed store, as well as to something simple like files on disk, in memory, or SQLite.

I'm most cases, it's one user at a time, so performance doesn't matter, but simplicity does.

It can also be for the project which has a 1 percent chance of going viral.

Etc. But I find relatively few cases between truly small scale and large scale.

sesm · on April 15, 2024

Local dev drop-in replacement for Redis seems like a really good usecase, thanks!

justinclift · on April 15, 2024

That's pretty cool. Reckon it would work with existing code that calls Redis over the wire for RQ?

  https://python-rq.org

This RQ stuff has been a pain with a recent project because only Python seems to use it, so once an RQ job has been submitted only Python based things can do anything with it. :(

If Redka works as a backend replacement, we could potentially have non-Python things check the SQLite database instead.

nalgeon · on April 15, 2024

It works with redis-py (which python-rq uses), but I doubt it will be any good in this case. python-rq seems to use Lua scripting in Redis, which is not planned for 1.0. I'd rather not add it at all, but we'll see.

justinclift · on April 15, 2024

No worries. :)

Spivak · on April 15, 2024

Highly recommend Faktory (by the folks who brought you Sidekiq) as a language agnostic job server.

justinclift · on April 15, 2024

Thanks, looks potentially interesting: https://github.com/contribsys/faktory

And it's written in Go. :)

yawaramin · on April 15, 2024

Any plans to support Redis' memory-based eviction policy eg `volatile-lru`? To me a great benefit of Redis is its guarantee of stable memory usage.

nalgeon · on April 14, 2024

Did I?

> Both in-process (Go API) and standalone (RESP) servers.

In-process means that the database is "embedded / clientside" in your terms.

jhatemyjob · on April 14, 2024

It's a server.

orthecreedence · on April 15, 2024

Do you usually embed redis?

jhatemyjob · on April 18, 2024

I would like to.

orthecreedence · on April 25, 2024

You want to use an in-mem database like SQLite or DuckDB or even replicate the data structures in-code that redis uses. Redis doesn't really add a whole lot to the table over a general programming language with a good standard library other than being shared over a network.

jhatemyjob · on April 25, 2024

Exactly, the standard library is what I'm after. I am currently using SQLite but I prefer the redis interface

nalgeon · on April 1, 2024

There's also an interactive "Git by Example" guide I recently wrote on a similar topic. It has a playground for each of the commands mentioned in Martin's article (and many others):

— switch

— restore

— sparse-checkout

— worktree

— bisect

https://antonz.org/git-by-example/

nalgeon · on March 7, 2024

You are literally living the dream!

HN For You