@_Blef Profile picture

Christophe

@_Blef

Co-founder https://t.co/IX0iSSkVYV — I write the best data newsletter according to me → https://t.co/692IOQ0mxq

Similar User
Anna Geller photo

@anna__geller

Lightdash photo

@lightdash_devs

sbalnojan photo

@sbalnojan

Simon Späti 🦋 photo

@sspaeti

Metaplane photo

@metaplane

Sarah Krasnik Bedell photo

@sarahkb125

ABC photo

@Ubunta

mariah does data photo

@mariahjrogers

Ananth Packkildurai photo

@ananthdurai

john kutay photo

@JohnKutay

Kelly Burdine photo

@KellyJBurdine

David Jayatillake photo

@DSJayatillake

Salma Bakouk photo

@SalmaBakouk

Kaxil Naik photo

@kaxil

Brian Au photo

@brianau

Not that I was posting a lot here lately. But decided to switch to other blue thing. My tag there is blef.fr


Christophe Reposted

Vous avez vu aussi arriver dans nos équipes des data scientists, data analysts, data engineers ... Les connaissons nous vraiment ? Avec Christophe Blefari (@_Blef), Data Engineer ▶️ ifttd.io/episodes/data-… #dev #DATA #DataScience #dataengineer #dataanalyst


Then the same companies in 1 year: "we can't find any good junior, they are inexperienced, what a waste of time"

From a company update. AI bubble or not, seeing this. A lot. It’s coming.

Tweet Image 1


Christophe Reposted

The Awesome DuckDB repository is short of 15 stars to hit 1k! ⭐️ github.com/davidgasquez/a… I'ts amazing to see all the things being built on top of the awesome DuckDB.


Not sure I spoke about it here, I'm co-organizing a conference on Nov 25th in Paris this year. We have a CfP still open until mid-July and we sold half of the tickets. (The conf gonna be French-first but English-friendly) forward-data-conference.com


Takeaways about Databricks, Snowflake and Iceberg. Why this is just a natural evolution of both platforms and why it only makes Databricks a data warehouse in kit bridging the gap with Snowflake. blef.fr/databricks-sno…


Christophe Reposted

@duckdb is getting everywhere, even in your browser! But what can you do with DuckDB Wasm? Read an easy-to-follow overview by @mehd_io, not only do we look at some practical use cases from companies using DuckDB Wasm, but there's also a fun example 👇 👨‍💻Mehdi crafted a Firefox…


who tried unity catalog with pyiceberg? can't make it work


Christophe Reposted

Here is another batch of great Real Life Data Engineering™ projects to learn from. - Mozilla: github.com/mozilla/bigque… - Our World in Data: github.com/owid/etl - Ibis: github.com/ibis-project/i… - Open Source Observer: github.com/opensource-obs…

If you want to see how Real Life Data Engineering™ projects look like, these are some of my favs! - GitLab: gitlab.com/gitlab-data/an… - Mattermost: github.com/mattermost/mat… - MIT Open Learning: github.com/mitodl/ol-data… - Catalyst Cooperative: github.com/catalyst-coope… Share yours!🙏



what if you could search into youtube videos?


Christophe Reposted

The reality of LLM training data collection. This is a reason more and more sites w large amounts of content will attempt to block these bots, rate limit them, and put up signup walls or paywalls. 2023 might have been “peak open internet” (at least for user-generated content)

Almost 500 Claude guests within 7 min activity. Over 2 million page requests in all, before we stopped it.

Tweet Image 1


Christophe Reposted

It’s hard to keep up with changes in the data world. I found these 3 newsletters to be a great weekly overview of the happenings. 1. @ananthdurai’s @data_weekly 2. @_Blef blef.fr newsletter 3. @criccomini’s @getmaterialized


Christophe Reposted

Here's a little write-up about how accessing (for free) a training set for bike sharing forecasting, using data I'm scrapping from >50 cities every 15min: maxhalford.github.io/blog/bike-shar…


bah github


A Keycloak expert out there than can help me?


great batch of links this week

🚀 Data Eng Weekly #165 is out: Dive into Intuit's GenAI for faster SQL, PySpark's 2023 review, Uber & Netflix's scalability solutions, AWS's real-time AI streaming, and more! dataengineeringweekly.com/p/data-enginee… 💡📊 #DataEngineering #AI #BigData #TechTrends



I'll skip the Data News this week, it was not possible to put words on the paper this week


What comes to your mind when you think about data formats?

Tweet Image 1

I think we've found the best interface to write Selenium tests.

Introducing the 01 Developer Preview. Order or build your own today: openinterpreter.com/01 The 01 Light is a portable voice interface that controls your home computer. It can see your screen, use your apps, and learn new skills. This is only the beginning for 01— the…



Loading...

Something went wrong.


Something went wrong.