For the best experience on desktop, install the Chrome extension to track your reading on news.ycombinator.com
Hacker Newsnew | past | comments | ask | show | jobs | submit | history | staringmonkey's commentsregister

Enigma Technologies | Director, Data Acquisition & Strategy | Open to all US cities / Remote | Full-Time

Have you parsed a long-forgotten mainframe format? Scraped an ASP.net site? Visited null island? Managed an amazing team? Enigma is searching for a talented Director of Data Acquisition & Strategy to help us grow our data acquisition efforts. This individual will be establishing our first office outside of NYC and staffing it with an entirely new team!

Read more about the role and apply on our website: https://www.enigma.com/careers/director-data-acquisition-str...

Send questions to: chris.groskopf@enigma.com

Enigma, a New York-based data and technology startup, partners with global companies to transform the way the world uses data. Enigma’s technology connects data across many internal and external sources, empowering organizations to unlock new opportunities and solve their most complex challenges, from combating money laundering to enhancing drug safety.


Journalists have some problems that tend to be somewhat peculiar to their jobs. Some examples:

* A mix of heterogeneous and often internally inconsistent data.

* A lot of data that is categorical, free text or otherwise non-numerical.

* A need to be robust that is not always accompanied by the time necessary to become an expert programmer.

I'm sure some other folks have these problems too, though I can't think of any other industry where folks would touch as diverse a range of data as we do.

If works for other niches, great! But I'm a journalist and I had journalism problems in mind when I built it. I can't speak to the needs of folks in science, finance or what have you.


I'd throw history into the ring for something that would be similar in variety of sources, types of data, etc.

I'm looking forward to checking out the library!


Yes, good point! There's a lot of crossover there and also with digital humanities folks.


+1 for digital humanities folks. Your emphasis on well written documentation is a strong argument for agate over more powerful, but more confusing, data processing libraries. I'm already thinking about using agate in my digital humanities workshops!


Hello! Author of the library here. Just want to point out that I am a journalist and very active in the data journalism community. (6+ years) Both the sites you name-check have journalists who do production online graphics that don't go through the traditional Illustrator workflow and news organizations are increasingly discarding that antiquated pattern. (I've made a hundred graphics for NPR and I don't even have a copy of Illustrator installed.)

To your second point, that's fine. The most common feedback I've gotten is "I don't see what purpose this serves that X doesn't already fulfill!" Well okay then, you don't gotta use it. But given the fact I've done this job for years, working with the very folks who it's targeted at, I think it's probably safe to assume I've got some reasons. (Which you will find enumerated in the blog post and documentation.)


hey man, good work on agate. I like it.

I actually used to live right near the NPR offices in Crystal City.

You might know a pythonista I know who used to work there and is now doing a machine learning start-up. (his name is Greg)


Hey, (I'm the original author of copytext at NPR) this looks like a really nice lib. FWIW, we specifically don't do it this way, because during development we're probably refreshing our page thousands of times and we don't want the delay of hitting the Google servers for every request.

Incidentally, that's also why we don't request the spreadsheet in Javascript--the lag time to render the page is too high.

We've found our approach gives us a nice balance of "real time" and "practical".

Thanks for the comment.


Thank you for open sourcing it!


@all I'm the author of the post (@onyxfish). I'm not even remotely interested in debating its merits here. You all seem well-equipped to form your own judgements. However, for the sake of clarity, I wrote a __blog__ post that got __aggregated__ by The Atlantic. The original post is at http://hacktyler.com. It wasn't written for anyone but those who know me, but people liked it so its become public. Think what you will.


What did the Atlantic offer you. Just a platform to republish and reach a wider audience?


Yes.


I'm the hacker in question and its the former ad I replied to. And yes, I'm quite gainfully and meaningfully employed with the Tribune. And no, I'm not full of shit (at least I wasn't on that occasion.) Glad folks are enjoying/inspired by it.


The ad used the work `grok'. So there was a pretty high chance that the tone of your cover letter would be well received. Good perception!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search:

HN For You