Romain Lesur's Avatar

Romain Lesur

@rlesur.bsky.social

Head of data science lab @ insee.fr | Interested in data stuff

264 Followers  |  1,830 Following  |  7 Posts  |  Joined: 16.11.2024  |  2.1553

Latest posts by rlesur.bsky.social on Bluesky

Preview
The Majority AI View - Anil Dash A blog about making culture. Since 1999.

Okay, for the folks who asked: here's the majority AI view, writing up the reasonable, thoughtful view on AI that the vast majority of people in tech hold, that gets overshadowed by the bluster and hype of the tycoons trying to shill their nonsense. anildash.com/2025/10/17/t... Please share!

17.10.2025 19:29 — 👍 1118    🔁 486    💬 37    📌 145
Preview
Epic: are we production yet · Issue #63 · quarto-dev/quarto-markdown We need to check against many large sites to get a good sense for the impact of this new syntax in practice. autogenerated qmd quartodoc-generated sites (tbd meet with @machow) sites quarto.org Shi...

Quarto friends! I need your help:

We are implementing a new QMD parser in Quarto. It will be super nice. But it's a big change, and we want to minimize the impact.

1. Can you share a link to your Quarto project at github.com/quarto-dev/q...
2. repost this and let your Quarto friends know too?

16.10.2025 14:13 — 👍 27    🔁 32    💬 5    📌 0
Abstract: Under the banner of progress, products have been uncritically adopted or
even imposed on users — in past centuries with tobacco and combustion engines, and in
the 21st with social media. For these collective blunders, we now regret our involvement or
apathy as scientists, and society struggles to put the genie back in the bottle. Currently, we
are similarly entangled with artificial intelligence (AI) technology. For example, software updates are rolled out seamlessly and non-consensually, Microsoft Office is bundled with chatbots, and we, our students, and our employers have had no say, as it is not
considered a valid position to reject AI technologies in our teaching and research. This
is why in June 2025, we co-authored an Open Letter calling on our employers to reverse
and rethink their stance on uncritically adopting AI technologies. In this position piece,
we expound on why universities must take their role seriously toa) counter the technology
industry’s marketing, hype, and harm; and to b) safeguard higher education, critical
thinking, expertise, academic freedom, and scientific integrity. We include pointers to
relevant work to further inform our colleagues.

Abstract: Under the banner of progress, products have been uncritically adopted or even imposed on users — in past centuries with tobacco and combustion engines, and in the 21st with social media. For these collective blunders, we now regret our involvement or apathy as scientists, and society struggles to put the genie back in the bottle. Currently, we are similarly entangled with artificial intelligence (AI) technology. For example, software updates are rolled out seamlessly and non-consensually, Microsoft Office is bundled with chatbots, and we, our students, and our employers have had no say, as it is not considered a valid position to reject AI technologies in our teaching and research. This is why in June 2025, we co-authored an Open Letter calling on our employers to reverse and rethink their stance on uncritically adopting AI technologies. In this position piece, we expound on why universities must take their role seriously toa) counter the technology industry’s marketing, hype, and harm; and to b) safeguard higher education, critical thinking, expertise, academic freedom, and scientific integrity. We include pointers to relevant work to further inform our colleagues.

Figure 1. A cartoon set theoretic view on various terms (see Table 1) used when discussing the superset AI
(black outline, hatched background): LLMs are in orange; ANNs are in magenta; generative models are
in blue; and finally, chatbots are in green. Where these intersect, the colours reflect that, e.g. generative adversarial network (GAN) and Boltzmann machine (BM) models are in the purple subset because they are
both generative and ANNs. In the case of proprietary closed source models, e.g. OpenAI’s ChatGPT and
Apple’s Siri, we cannot verify their implementation and so academics can only make educated guesses (cf.
Dingemanse 2025). Undefined terms used above: BERT (Devlin et al. 2019); AlexNet (Krizhevsky et al.
2017); A.L.I.C.E. (Wallace 2009); ELIZA (Weizenbaum 1966); Jabberwacky (Twist 2003); linear discriminant analysis (LDA); quadratic discriminant analysis (QDA).

Figure 1. A cartoon set theoretic view on various terms (see Table 1) used when discussing the superset AI (black outline, hatched background): LLMs are in orange; ANNs are in magenta; generative models are in blue; and finally, chatbots are in green. Where these intersect, the colours reflect that, e.g. generative adversarial network (GAN) and Boltzmann machine (BM) models are in the purple subset because they are both generative and ANNs. In the case of proprietary closed source models, e.g. OpenAI’s ChatGPT and Apple’s Siri, we cannot verify their implementation and so academics can only make educated guesses (cf. Dingemanse 2025). Undefined terms used above: BERT (Devlin et al. 2019); AlexNet (Krizhevsky et al. 2017); A.L.I.C.E. (Wallace 2009); ELIZA (Weizenbaum 1966); Jabberwacky (Twist 2003); linear discriminant analysis (LDA); quadratic discriminant analysis (QDA).

Table 1. Below some of the typical terminological disarray is untangled. Importantly, none of these terms
are orthogonal nor do they exclusively pick out the types of products we may wish to critique or proscribe.

Table 1. Below some of the typical terminological disarray is untangled. Importantly, none of these terms are orthogonal nor do they exclusively pick out the types of products we may wish to critique or proscribe.

Protecting the Ecosystem of Human Knowledge: Five Principles

Protecting the Ecosystem of Human Knowledge: Five Principles

Finally! 🤩 Our position piece: Against the Uncritical Adoption of 'AI' Technologies in Academia:
doi.org/10.5281/zeno...

We unpick the tech industry’s marketing, hype, & harm; and we argue for safeguarding higher education, critical
thinking, expertise, academic freedom, & scientific integrity.
1/n

06.09.2025 08:13 — 👍 3462    🔁 1766    💬 104    📌 322
I Am An AI Hater I am an AI hater. This is considered rude, but I do not care, because I am a hater.

I considered writing a long carefully constructed argument laying out the harms and limitations of AI, but instead I wrote about being a hater. Only humans can be haters.

27.08.2025 17:04 — 👍 4020    🔁 1477    💬 141    📌 404
Microsoft Excel adds Copilot Al to help ...
theverge.com
The Verget-4.1-mini Al model | 5
successor to the LABS.GENERATIVEAI function Microsoft started experimenting
with in 2023.
Microsoft notes that you can combine its new Al function with other Excel functions, including IF, SWITCH, LAMBDA, or WRAPROWS. The company adds that information sent through Excel's COPILOT function is "never" used for AI training, as "the input remains confidential and is used solely to generate your requested output."
The COPILOT function comes with a couple of limitations, as it can't access information outside your spreadsheet, and you can only use it to calculate 100 functions every 10 minutes. Microsoft also warns against using the AI function for numerical calculations or in “high-stakes scenarios” with legal, regulatory, and compliance implications, as COPILOT "can
give incorrect responses."
Copy Share Select all Web search Dictionary
...

Microsoft Excel adds Copilot Al to help ... theverge.com The Verget-4.1-mini Al model | 5 successor to the LABS.GENERATIVEAI function Microsoft started experimenting with in 2023. Microsoft notes that you can combine its new Al function with other Excel functions, including IF, SWITCH, LAMBDA, or WRAPROWS. The company adds that information sent through Excel's COPILOT function is "never" used for AI training, as "the input remains confidential and is used solely to generate your requested output." The COPILOT function comes with a couple of limitations, as it can't access information outside your spreadsheet, and you can only use it to calculate 100 functions every 10 minutes. Microsoft also warns against using the AI function for numerical calculations or in “high-stakes scenarios” with legal, regulatory, and compliance implications, as COPILOT "can give incorrect responses." Copy Share Select all Web search Dictionary ...

Good thing no one uses Microsoft Excel for anything related to legal, regulatory or compliance business functions

www.theverge.com/news/761338/...

19.08.2025 17:50 — 👍 2739    🔁 872    💬 102    📌 330
Preview
VizDex A library of personal and independent blogs and newsletters dedicated to data visualization.

Introducing VizDex!

An ever-growing library of personal and independent blogs and newsletters dedicated to data visualization.

vizdexproject.com

20.06.2025 15:31 — 👍 115    🔁 32    💬 9    📌 6
Preview
AI Fatigue Is Wearing Me Down. The Hype Obscures What We Really Need to Know Commentary: It's not just you -- the AI onslaught is endless and exhausting.

The AI fatigue is real, from @katiecollins.bsky.social.

www.cnet.com/tech/service...

02.04.2025 18:22 — 👍 8    🔁 4    💬 0    📌 0
Preview
« Il ne faudrait pas découvrir la valeur de l’Etat de droit une fois perdu » : l’alerte de hauts magistrats français Dans des entretiens au « Monde », des hauts magistrats du Conseil d’Etat et de la Cour de cassation témoignent de leurs inquiétudes face aux attaques contre les principes juridiques mis en place en Eu...

« Comment fait-on comprendre au grand public que lorsque vous commencez à toucher les droits fondamentaux de certains, cela concerne, en réalité, les droits de toute la population ? »

07.03.2025 17:57 — 👍 807    🔁 353    💬 12    📌 14
Post image

L’Insee est désormais sur Bluesky ! Dorénavant, vous pouvez également retrouver nos publications en suivant notre compte @insee.fr. Abonnez-vous !

26.02.2025 11:28 — 👍 804    🔁 295    💬 17    📌 20
Post image

12 years. RIP my darling boy.

11.01.2025 17:05 — 👍 1581    🔁 315    💬 23    📌 18
ophirofox Une extension pour navigateur qui permet de lire les articles de presse en ligne sur le compte de bibliothèques ayant souscrit à europresse

Merci pour easyBNF, je ne connaissais pas. J'utilise l'extension de navigateur Ophirofox qui est vraiment bien faite également ophirofox.ophir.dev

02.01.2025 08:07 — 👍 19    🔁 4    💬 2    📌 1

A fascinating article whose conclusions could certainly be transposed to official statistics

25.12.2024 08:27 — 👍 5    🔁 0    💬 0    📌 0
Preview
Datalab - SSP Cloud Plateforme mutualisée de services de traitement des données statistiques et de datascience

Like this parquet file explorer? datalab.sspcloud.fr/data-explore...

17.12.2024 06:12 — 👍 0    🔁 0    💬 0    📌 0

Very interesting. I'm wondering why treating systematic errors, misreporting... are related to a specific analysis. Would it make sense to share these steps and results between different analysis?

25.11.2024 21:27 — 👍 0    🔁 0    💬 1    📌 0
Preview
a panda bear is rolling around in the grass in a zoo enclosure . Alt: a panda bear is rolling around in the grass in a zoo enclosure .

No one can explain stochastic gradient descent better than this panda.

24.11.2024 15:04 — 👍 216    🔁 32    💬 10    📌 6
Preview
Steady States of Data: Building a Foundation for Reproducible and Reusable Statistics Introduction In any National Statistical Office (NSO), the goal is simple but crucial: produce reliable, accurate statistics for the public good. At Statistics Norway (SSB), we’ve developed an approac...

Totally agree. What do you think of the concept of steady states of data?
www.linkedin.com/pulse/steady...

24.11.2024 00:54 — 👍 1    🔁 0    💬 1    📌 0
Preview
Dean Marchiori - 5 tips for dealing with IT Getting data science done with your IT team

As a data and analytics person embedded in the business, I’m often involved in robust discussions with IT over access to software & data. It can be frustrating but @deanmarchiori.bsky.social has some great tips on dealing with this conundrum

#dataBS #dataSky

www.deanmarchiori.com/posts/2024-1...

19.11.2024 10:11 — 👍 13    🔁 2    💬 1    📌 0
Onyxia Datalab Data science environment for k8s

Great post! Sad but true. However, when you succeed to convince executives that data science is a strategic priority and build a mature collaboration with IT, they install onyxia.sh

21.11.2024 04:52 — 👍 1    🔁 0    💬 0    📌 0

Has anyone built an integration of #llm in #quarto documents such as a function that reads all the previous generated text, some data structure of plot and some instructions to then outputs a text to be rendered? @quarto.org @posit.co @hadleywickham.bsky.social

17.11.2024 08:07 — 👍 3    🔁 1    💬 3    📌 0

Dear data folks, what differences do you see between data cleaning as a data science domain and data editing as an official statistics domain? The objectives are the same. Yet data editing has been an active area of research for decades. The two communities seem to ignore each other.

17.11.2024 07:15 — 👍 2    🔁 0    💬 1    📌 0

@rlesur is following 20 prominent accounts