While most people may not see breaking the speed limit as the primary purpose of their car, the way cars are designed, especially marketed and used in everyday life normalizes and even encourages exceeding posted speeds. This makes speeding not an edge case, but a central, majority use case in practice.
Ok, that's not actually what I believe, I don't even know if you could make this argument. This is just for the arguments sake, sorry.
I think that uh they are shipping a platform through which crowdfunding can happen for AI models, which is an inherently cool idea imo.
Man, I understand the implementation might have some rough edges but that's besides the point because the idea is cool, not sure why people are almost picking up on this guy.
Maybe I am wrong, I usually am, but I have been on hackernews for almost an year and HN is usually not like this. Most comments here feel like bully comments, literally being too harsh is not necessary and just reflects our personality back imo.
I can only speak for myself and here it hits a nerve. Yes, the idea is fine, but the idea is only "crowdfund training", and not how this can be actually and practically implemented. It shows this exact ignorance of people who have no idea about anything but are sure that just with enough funding they can change the world, people just need to see how smart they are. I am not saying that this fully applies to OP, more that this is a regular occurance and can get people rather annoyed, at least it does for me, and thus lead to such harsh and blunt responses.
He "ships" a website with a gmail address and nothing substantial. I could do the same, likely better, in 20 minutes. How could I even tell this from a scam, there is nothing of substance. And the great idea is just obvious and all the painful details to make it work are completely ignored
well a great point, I have no skin in the game and genuinely just want to discuss, but you don't really have to add anything substantial in such cases.
Like, What do you expect him to add, create a distributed training system, well that is orders of magnitude inefficient than normal training where people donate their money
If you want him to get some reputation, that's fair but I have always believed in building in the public kind approach. Maybe I am wrong, but yes the website definitely might be made better and honestly I might create some checkpoints from this website like never ever use some other persons trademarks,
make the website pleasant to see
just use some mail service, its not worth it showing the gmail sign. The people mocking this is wild
These are things that are easy to do imo.
As I said in the other project, transparency feels like the key to such problem.
And honestly the fact that you could ship it in 20 minutes might be valid but I mean :/ cmon man.
What are your thoughts? I also thought of such idea and wanted to build something like this but gave up, Might build it in a year or two but what would you suggest him to do? Instead of giving him harsh responses, lets be productive since I don't care who implements my idea. I just want a place where people crowdfund models. I don't care if some patrick person builds it or I build or you build it. It should be good though
As I said in a different reply, I would not know how to address the engineering and management aspect, let alone the legal aspect, which are the biggest blockers and likely the reason this is not already done.
It's fine to develop "in the open" but this is a pitch for several million dollars, and handing that out without any credentials or track record is just not happening.
At least provide rough estimates for what is needed to get this done. What architecture? What training data? Where does the training happen? Who manages and administrates the cluster? Volunteers who try this the first time or paid experts? What solutions to failure recovery, to storage, to tracking and monitoring? Who has the last word on fundamental decisions? How will the legal component be handled? Do they already have a good law firm, how much would that cost? Will the first training be successful right away or how many iterations will be required? Can you even get access to the required GPUs at a reasonable price point? Train on older architecture? How much effort is required and planned to save cost by making training more efficient, ........
Training a model is not only expensive, but also technically challenging on a pure engineering level. Cluster management, storage, backups, access, fault recovery, and so forth. While crowdfunding training of a LLM is a nice idea, personally I would not invest in something this "uncooked". Why do you believe you are able to properly manage $5M+ and the infrastructure necessary?
I've met several people that believe if you just get the funding the rest is super easy, and it's slightly infuriating.
Apart from that, I agree with the other comments that the website looks unprofessional and llama is a bad name to use for this.
If I would want to give this a shot, I would first get engineers committed with a plan to start as soon as there's funding, set up a non-profit to handle the operation, and make sure that potential investors get the impression I knew what I was doing by providing a full plan and timeline, including addressing the legal challenges (among those, make it clear that the resulting model will be commercially usable and not sued to death. Are you planning on guaranteeing indemnification or do you want to release the model as-is? Etc)
Why do you compress the executable? I mean this is a fun part for size limit competitions and malicious activities (upx often gets flagged as suspicious by a lot of anti virus, or at least it used to), but otherwise I do not see any advantage other than added complexity.
Also interesting that "ultra lightweight" here means no error reporting, barely checking, hardcoding, and magic values. At least using tty color escape codes, but checking if the terminalm supports them probably would have added too much complexity......
Yes, it is fun to create small but mighty executables. I intentionally kept everything barebones and hardcoded, because I assumed if you are interested in using Agent-C, you will fork it an make it your own, add whatever is important to you.
This is a demonstration that AI agents can be 4KB and fun.
You should still not compromise on error reporting, for example. The user would not know if a failure occurs because it can't create the /tmp file, or the URL is wrong, or DNS failed, or the response was unexpected etc. These are things you can lose hours to troubleshooting and thus I would not fork it and make my own if I have to add all these things.
I also disagree that it's small but mighty, you popen curl that does the core task. I am not sure, but a bash script might come out even smaller (in particular if you compress it and make it self expanding)
From what I've seen he hasn't actually addressed any bugs or features in xorg... ever. I'm not sure if that even grants the title of "maintainer". Shuffling code around is fun and easy, but not necessarily productive.
a) noone looks at the code
b) they make random user-facing changes
c) random churn that makes the code better (preferably with as few user-facing modifications as possible)
IMO the best option is c). Guy is literally doing the single best thing possible.
I was consistently tempted to give emacs a fair try but after now more than 20 years of vim it got more and more difficult to even imagine betraying it. Thankfully, now with neovim I can spend hours in my config files, and with fennel I can even lisp.
I had the same idea, but no, I tried with a second, untampered one and I also got a working shell. So it does not seem to be dependent on the tamper state.
> Was my guess as well, maybe it's even possible to use it to flash new keys so the device can be used again?
What keys would you flash them with? Anything encrypted with your "new" keys can't be decrypted on the other end of the transaction anyway, so what would be the point?
I understand what you are saying, but how am I supposed to know the reasoning behind it if for certain reasons I am not supposed to know it?
In this case, I do not care why he had changed his name, I read the beginning of the article and it did not explain why this is something that needs discussion, and stopped reading when it just stated his birth name.
Speaking as someone who has made up myriad "screen names" and email addresses and forum usernames, and someone who has struggled with the validity of his own given name, I considered why it is called a "given name" and how arrogant it may be to unilaterally make one up instead. These are commonly known as "assumed names" or "aliases".
I decided that not only was it futile, after a lifetime of going by one name, to convince people to call me something else, or know me by another name, but it is also arrogant to try and manipulate those names which have been given to me by people who love me. And furthermore, how insulting and appalling it was, for me to say that I didn't like the name I was given, in fact being baptized with that name, would be a repudiation of not only my parents, but my mother church and her authority to accept the name which my parents proposed at that time.
Huh? The platinum rule is treat an individual the way they wish to treat others. So, if someone wishes to dismiss calling others by their preferred names then others can treat that individual in that manner. If it's good for the gander, it's good for the goose.