More

ironchef · 2025-06-23T22:37:46 1750718266

"Here’s why: There is absolutely no benefit for you to gain by talking in an exit interview, and plenty of negative consequences to come out of it. At best you’ll be remembered as a complainer, and you may make enemies."

I guess I would counter with if I have friends there, I would like their lives to be better. If my exit interview is able to do that, then I would take that as a net positive.

jonstewart · 2025-06-23T22:43:26 1750718606

The only possible way to help is by providing positive reinforcement. “I loved working with X. Y is really killing it on her KPIs.” I am otherwise in agreement with TFA.

jmye · 2025-06-24T03:39:57 1750736397

This, for sure. I pump up good people doing good things. No one cares that I think our new time tracking sucks, or that an HR policy sent me over the edge, and they definitely don’t care that I think an SVP in a different department is going to tank the company because their metric strategy is all about finding fractions of fractions that make them look good.

But they’ll, even subconsciously, remember that I said Joe and Jane were absolute rock stars.

NBJack · 2025-06-23T23:06:19 1750719979

Then count the costs. If it is worth more to you to leave such feedback and improve their world, it's always your choice.

However, you should also be either convinced that HR gives a crap, or that any potential outcomes are acceptable, including but not limited to being moved into "unregulated attrition" status, losing the ability to be hired by the same company in the future, having your words potentially turned into a lawsuit against you, etc. Unless you have actual, legal, signed documentation in place giving you such assurances, these are all on the table.

andelink · 2025-06-23T23:35:09 1750721709

It is this sort of fear that holds society back. Individualistic thinking and a belief that one cannot make a difference anyways allows so much bad behavior to take place. With everything in life, you should always try to leave a place better than how you found it.

throwawayq3423 · 2025-06-23T23:36:44 1750721804

>I guess I would counter with if I have friends there, I would like their lives to be better. If my exit interview is able to do that, then I would take that as a net positive.

If you had any confidence your feedback would be listened to and actioned on, why would you be leaving?

iwontberude · 2025-06-23T23:01:02 1750719662

One time I got two months severance for complaining in my exit interview.

ironchef · 2025-05-23T02:52:49 1747968769

Folks often use things like LoRAs for that.

ironchef · on March 27, 2025

It's at least CUI. At _least_. I would suggest material that is operational in those regards would be secret or top secret. It had:

* timelines of kinetic resource missions * conops including order of battle, etc.

Per normal ODNI bits, it's pretty clearly within bounds for classification.

ironchef · on Dec 5, 2024

Lots of folks out there use evidence.dev. It's a simple way to get some BI up and running without needing to deal with licensing / corporate IT, etc.

Remember all it takes is 1 employee to put that claim up there (although I do like evidence.dev).

zvr · on Dec 7, 2024

To be precise, it takes 1 employee to say "used in X". It takes corporate decision to say "used by X". And it takes a written agreement to be able to use the trademarked logo of X on your page. (I know, because I have collected more than 60 such agreements to show logos on a page).

ironchef · on June 5, 2024

You appear to be describing a classic type 2 or type 4 slowly changing dimension. (https://en.wikipedia.org/wiki/Slowly_changing_dimension )

ironchef · on Nov 25, 2023

The libraries out there lack the breadth and maturity of some of the other ecosystems (as a simple example).

Or at least they did for some of my corners of the world.

ironchef · on Aug 21, 2023

Here was my situation. Occasional queries. Over a couple petabyte of data. Customer facing so response in seconds per SLA but > 95 percent of the time the warehouse isn’t running. Cached queries from within 24 hours which don’t require the warehouse to even spin up. Our snowflake costs were significantly less than an FTE.

Would that potentially be a situation which “running your own” doesn’t make sense?

ramesh31 · on Aug 21, 2023

>Would that potentially be a situation which “running your own” doesn’t make sense?

Look into datalake architectures. RDBMS based data warehousing is obviously not economical at the petabyte scale. But storing all that data in S3 with Delta Lake/Iceberg format and querying with Spark changes things entirely. You only pay for object storage, and S3 read costs are trivial.

ironchef · on Aug 22, 2023

> Look into datalake architectures.

Yup .. comfy with iceberg/delta/hudi

> RDBMS based data warehousing is obviously not economical at the petabyte scale.

I never said it was .. I'm simply responding to "I simply cannot understand how anyone chooses this over running your own Spark clusters with Jupyterlab". I'm trying to help you understand why folks would choose a SaaS over run your own.

> But storing all that data in S3 with Delta Lake/Iceberg format and querying with Spark changes things entirely. You only pay for object storage, and S3 read costs are trivial.

No. You don't just pay for object storage + minor S3 read costs.

You pay for operations You pay for someone setting up conventions You pay to not have to optimize data layouts for streaming writes You pay to not have to discover race conditions in s3 when running multiple spark clusters writing to same delta tables You pay to not have to discover that your partitions/clustering needs have changed based on new data or query patterns

But look .. I get it. You have chosen to optimize for cost structures in one way .. and I've chosen to optimize in a different way. In the past I've done exactly as you've said as well. I think being able to seeking to see _why_ folks may have chosen a different path may help you understand other areas to consider in operations.

agent281 · on Aug 21, 2023

If you have petabytes of data, I don't think this article is talking about your use case.

sanderjd · on Aug 21, 2023

I think it is?

Or I guess, what data size do you think it's talking about? If you only have gigabytes of data, none of this matters, you can use anything pretty cheaply and easily. So is this article just for "terabytes" or does it go up to "hundreds of terabytes" but not "petabytes"?

agent281 · on Aug 22, 2023

Hmm, I suppose it's a bit challenging to say. I initially thought that it wasn't for the 80% smallest companies and petabytes of data is probably puts you in the top 20%. (Most businesses are small businesses after all.)

However, I now realize that th biggest companies probably should manage their own data. If you're Google why would you use Snowflake?

So I don't know if you are the target audience for this blog post. It's pretty ambiguous.

sanderjd · on Aug 22, 2023

I guess I'll say what I think. I do think it is targeted at that smallest 80% of companies with some digital footprint, and also at most of the top 20%. Or more specifically, I think maybe it's targeted at like the 5th percentile to the 99th percentile. That bottom 5% probably just needs Excel, and that top 1% is probably writing or heavily modifying all their own tools.

But I'm not sure the advice is very good from the 5th percentile up to ... maybe that top 20%? A lot of the stuff in the article assumes the availability of sophisticated data architects and mature infrastructure groups that I really don't think the median company has.

agent281 · on Aug 22, 2023

I agree. Really seasoned data people are not common enough. Small companies need to buy services to lighten the load.

We both seem have a sense of the size of companies at different percentiles. At what percentile would you put your company with petabytes of data?

sanderjd · on Aug 22, 2023

Super hard to say, so ... 80th or 90th? With very low confidence.

But I do have very high confidence that the 99th percentile is much larger than petabytes (think: what's next after "exa"), and I believe that many companies these days crack into "peta" territory.

But as I saw another comment mention, I think another, probably more important, consideration besides size in bytes is cardinality and structure. So maybe this whole classification we're doing is kind of beside the point :)

agent281 · on Aug 23, 2023

Yeah, it's hard to say with any certainty. I agree that the far end is the curve probably looks nothing like the "neighborhood" a couple percent away, relatively speaking.

I also agree that the variety of data plays a big part in its complexity. If you have a few petabytes of data, but it's really only a handful of tables you can real hone in on the relationships. If it's a wide array of sources with many tables between them then you have some nasty problems like entity resolution.

All happy data sets are alike; each unhappy data set is unhappy in its own way.

sanderjd · on Aug 23, 2023

> All happy data sets are alike; each unhappy data set is unhappy in its own way.

Ha, gonna steal that for some doc I write someday :)

agent281 · on Aug 23, 2023

That's only fair: I stole it from Anna Karenina. :]

https://en.m.wikipedia.org/wiki/Anna_Karenina_principle#:~:t....

sanderjd · on Aug 23, 2023

Ha I know, I love that opener, despite it being super cliche to love it. Things are usually cliches for a good reason :)

ironchef · on July 16, 2023

“I can’t imagine being a traditional teacher, and having to teach kids that don’t care.”

What I’ve seen is igniting that spark and moving folks from not caring to caring (or realizing they can affect change) can be even more rewarding than the other mentoring being discussed

ironchef · on May 7, 2023

There are some huge deployments out there. Snowflake (the database) has a big dependency on it for how they do metadata (for example)

johnhenry · on May 7, 2023

Also, notably it backs Deno's new KV Store. https://deno.com/blog/kv#run-locally-or-managed

eternalban · on May 7, 2023

Exactly. I saw that thing tucked away in the diagram for Snowflake when surveying the so-called lakes - by all accounts FDB is an excellent piece of tech.

ironchef · on May 1, 2023

Let’s say you work for a SaaS doing analytics. Your boss says “hey! We need to start reporting on new logos. Can you snag those from the DB?”

But what counts as a new logo? Does a pro serve engagement that doesn’t use the product count? What about a business using the SaaS but still in a trial period? Etc.

A semantic layer helps provide common agrees upon definitions to the business. So any one looking for common data entities can just look those things up… and can come to published definitions (which are backed by queries to databases, data lakes, etc).

Does that help? Another example of this would be dbt for example

HN For You