More

doppenhe · 2026-04-01T04:25:33 1775017533

very cool , i wish it had multi screen mode

doppenhe · 2025-05-01T14:08:34 1746108514

Very cool, does `npx hyperparam dataset.parquet` phone home?

platypii · 2025-05-01T14:10:23 1746108623

Zero telemetry, fully local. It spawns `http-server` on port 2048 and opens your browser at `localhost`. Similar pattern as Jupyter Notebooks. Feel free to audit the code... the server is <200 LOC.

doppenhe · on June 25, 2024

Could HyParquet's approach be extended to other data formats beyond Parquet?

platypii · on June 26, 2024

I definitely think that UX is an underappreciated area for machine learning data. I want to make a set of libraries and tools that make it easier for people to work with ML data in the browser. The first step of good data science is to become one with your data.

I started with parquet because most datasets for modern LLMs are in parquet format. But there are other formats like JSONL which are common too.

doppenhe · on June 25, 2024

nice thanks for sharing

doppenhe · on Oct 10, 2023

The rapid advancement of large language models (LLMs) like ChatGPT has captured headlines and imaginations. AI systems can now generate amazingly human-like text on any topic with just a few prompts.These behemoths, with their unparalleled capabilities, have necessitated a reevaluation of governance models. As organizations explore integrating LLMs into business operations, it’s crucial to implement governance measures enabling innovation while managing risks. As executives, understanding the transition from traditional machine learning governance to LLM-centric AI governance is crucial.

doppenhe · on Oct 10, 2023

this makes me happy , I was the PM for that feature :)

doppenhe · on March 16, 2023

This is great, thanks for sharing. Key component in evolving FM based applications is making them feel as deterministic as possible vs probabilistic. Framework like this would enable generating trust in the outputs of these FMs.. exciting.

doppenhe · on Feb 9, 2023

Author here would love to discuss with the community

doppenhe · on Aug 26, 2020

hi all creator here. We built this version of our product focused on dynamic data science teams that just wanted to be able to deploy, scale and run their models without worrying about ops. Some more details:

https://algorithmia.com/developers/teams

doppenhe · on July 7, 2020

Deployment, inference and management can participate in this as well!

Here is the missing part for a total e2e solution: https://github.com/marketplace/actions/algorithmia-ci-cd

{disclaimer, we built this Github action}

davidortega · on July 14, 2020

Hi doppenhe, we have that part already implemented using cml-send-github-check and dvc metrics diff. You can compare the metric that you prefer with dvc and then just set the status of the github check uploading your full report. Of course, you can also fail the workflow as your Github action does, but I think is more useful to see it as a report in the check.

disclaimer: I'm work with CML

rhythmvertigo · on July 7, 2020

coooool! going to try this out :)

HN For You