When chatGPT first came out, my first thought was to replicate it myself. But then, I have too many missing skills or lack the time for backend, frontend and deployment. So I found LAION started an initiative for open-assistant.
https://github.com/LAION-AI/Open-Assistant
So I just join them, so far its quite an active community working on pushing out the initial 0.1 version.
You can use sin as activation function, but that would require careful initialization to avoid gradient explosion as you would ended up with a lot of points where gradient is simply zero. You can refer to Implicit Neural Representations with Periodic Activation Functions for more details.
What I ended up doing was crawling news from major news services and do aggregation myself. My algorithm was simple : cluster news that was reported by multple different services or multiple reports within 36 hours. By showing multiple news titles in a same box and order them by cluster size, I could just skipped those which isn't important at all.
I agree having a good vector is important to start with. However this is not very hard to make it work, you only need to finetune some of the clip models[1] to run it well.
Disclose: I have built a vector search engine to proof this idea[2]
Do anyone know any existing effort on converting these scanned image to text corpus ( probably a new OCR model needed to be developed on these old text ) ? I think it would be more usable if they are in text form in terms of search and research purpose.
I created edgedict [0] a year ago part of my side projects. At that time this is the only open source STT with streaming capabilities. If anyone is interested the pretrained weights for english and chinese is available.
I am surprised to see Comma Two in a consumer report not to mention out perform all other automative company including Tesla (probably the older version?). I remember in one of Geohot twitch stream bragging about their driver monitoring being really hard to duped.
Same in chinese search results for programming related question, most of the time I get was some bad translation from stackoverflow or github issue. Now I had to rely on browser plugins to automatically remove these domains from search results.
For reference, Boeing 747 average speed is 909 km/h (565 mph) with a top speed 986 km/h (613 mph). However maglev train saves time on landing and coordination makes it faster compare to air travel.
And frequency/throughput. At peak hours, the Shinkansen can depart about every two minutes(!). I would regularly step into the train station five minutes before my departure. Or just buy a ticket at the station when I get there, with no reservation, since I know the trains come so often I won't have to wait long.
How long do you have to wait if you miss your flight? How certain are you of an on-time arrival? How early do you arrive at the airport, to make sure you'll catch your plane?
At one time (before 9/11) you could walk into the airport in Minneapolis and be in the air to Chicago in 20 minutes (airplanes left every 15 minutes), buy your ticket at the gate. Of course this is a busy route and the airline decided it was worth running empty seats to ensure it could happen.
There is no technical reason airplanes couldn't do this for any city pair. There are reasons that airplanes don't operate this way, but they are not because it is technically impossible. Politically you can't get rid of the security theater, and yield management probably is the most profitable way to run airplanes, but both could change if we wanted to.
Not sure it has a huge advantage if you take into account the infrastructure: the track is very difficult to build, you need land, you need to take into account every hill or mountain, build bridges and tunnels while a plane can go anywhere without any additional stuff.
Also, a new station at the center of a major city requires a tremendous amount of investment and work, and some new stations may end up outside or far from the center so you'll need time to get to the place you go anyway.
With existing stations, you'll still need new tracks, and likely larger stations anyway as traffic will go up with new lines.
Such projects are likely easier in China where the government has very effective power, not that easy to build in city centers in many other countries.
If anyone is interested in building 10Gb router on a budget you can buy Mellanox OCP NIC with an adapter from OCP to PCIe for around 5~30 USD ( depend on your location )[0]. I recently build one 10G router with 4x10Gb, 2x1Gb ports for around 200 USD using second hand PC.
So I just join them, so far its quite an active community working on pushing out the initial 0.1 version.