Sune Lehmann's Avatar

Sune Lehmann

@sunelehmann.com.bsky.social

Your friendly neighborhood suneman

867 Followers  |  152 Following  |  17 Posts  |  Joined: 29.11.2023  |  1.6745

Latest posts by sunelehmann.com on Bluesky

So good!

23.07.2025 11:55 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
IceLab Camp Part of the Stress Response Modeling Research School, IceLab Camp is a four-day off-site PhD course that prepares its participants to create new inter- or multidisciplinary research by first teaching ...

Registration is now open for IceLab Camp โ€“ a four-day off-site PhD course designed to train participants in asking research questions, laying the foundation for new multidisciplinary collaborations. www.umu.se/en/icelab/ca...

17.04.2025 05:19 โ€” ๐Ÿ‘ 6    ๐Ÿ” 5    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 2
Image of an abstract with the following text

Abstract: A year ago, ChatGPT surprised the world with its extraordinary language generation capabilities. Chatbots have since become of the fastest adopted consumer product in history with investments in genAI forecasted to reach $12B this year. In this talk, I will first review the fast-evolvingliterature on the document-level membership inference task for LLMs: the methods proposed to detect--a posteriori--whether a specific piece of text was seen during training by an LLM and at least partially memorized, the distribution shift concerns, and some of the solutions proposed. I will then discuss the use of randomized controlled setups to causally study LLM memorization. In particular, Iwill discuss how randomized controlled setup have shed lights on the determinant of memorization and showed LLMs to have a mosaic memory. I will conclude the talks with same thoughts on the security and privacy challenges ahead when it comes to LLMs and the use of synthetically generated trap sequences for membership inference.

Image of an abstract with the following text Abstract: A year ago, ChatGPT surprised the world with its extraordinary language generation capabilities. Chatbots have since become of the fastest adopted consumer product in history with investments in genAI forecasted to reach $12B this year. In this talk, I will first review the fast-evolvingliterature on the document-level membership inference task for LLMs: the methods proposed to detect--a posteriori--whether a specific piece of text was seen during training by an LLM and at least partially memorized, the distribution shift concerns, and some of the solutions proposed. I will then discuss the use of randomized controlled setups to causally study LLM memorization. In particular, Iwill discuss how randomized controlled setup have shed lights on the determinant of memorization and showed LLMs to have a mosaic memory. I will conclude the talks with same thoughts on the security and privacy challenges ahead when it comes to LLMs and the use of synthetically generated trap sequences for membership inference.

01.04.2025 15:46 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
LLM's unintended memories Join us for a talk by Yves-Alexandre de Montjoye on the unintended memories of LLM's on April 8!

Speaker: Yves-Alexandre de Montjoye
Time: 8 Apr. 2025, 15:00-16:00
Place: SODAS Conference Room - 1.1.12, ร˜ster-Farimagsgade 5
Title: LLM's unintended memories

sodas.ku.dk/events/llms-...

01.04.2025 15:46 โ€” ๐Ÿ‘ 6    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

People of Copenhagen:

On Tuesday April 8th, we have awesome privacy researcher @yvesalexandre.bsky.social visiting the group. Yves is a bold and creative scientist, and also former advisor to Marianne Vestager.

Yves will give a talk at SODAS at 3pm that's open to the public (details below)

01.04.2025 15:46 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Today is the (hard) deadline for the postdoctoral call listed below! Join us #postdoc #scienceofscience #AI #networkscience

01.04.2025 11:45 โ€” ๐Ÿ‘ 5    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

Congratulations! Well deserved :)

01.04.2025 11:40 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Associate Professor or DTU Tenure Track Assistant Professor in AI - DTU Compute Are you passionate about Human Centered AI and can you help build the next generations of AI systems within the Danish Pioneer Centre for AI? DTU Compute has an attractive group leader opening as Tenu...

Job opening at DTU Compute & @aicentre.dk - Associate Professor or Tenure Track

27.03.2025 14:43 โ€” ๐Ÿ‘ 20    ๐Ÿ” 8    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Preview
Forskere: Vi stรฅr over for en teknologisk revolution, og Danmark kan vise vejen Med sine stรฆrke traditioner for fรฆllesskab og tillid kan Danmark vise, at ai bรฅde kan skabe vรฆrdi og gรธre samfundet bedre.

New Politiken op-ed (in ๐Ÿ‡ฉ๐Ÿ‡ฐ) by @adler-nissen.bsky.social, @sunelehmann.com, Morten Axel Pedersen and other colleagues. They talk about how AI can be used democratically, innovatively, justly and with the interests of Danish citizens in mind. Read on: politiken.dk/debat/kronik...

25.03.2025 08:36 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

๐Ÿ‡ฉ๐Ÿ‡ฐ ๐Ÿค– ๐ŸŒŽ In yesterday's @politiken.dk we shared an op-ed outlining our vision for CAISA - Denmark's National Center for Artificial Intelligence in Society, led by @adler-nissen.bsky.social and deputy director Professor Thomas Moeslund (1/10)

25.03.2025 08:03 โ€” ๐Ÿ‘ 46    ๐Ÿ” 7    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Ph.D. and postdoc positions in Data Science for epidemic preparedness at the NERDS research group The NERDS (NEtwoRks, Data, and Society) team at the IT University of Copenhagen welcomes applications from aspiring PhD students and postdocs in the areas of Da

I am hiring PhD students and Postdocs to join me in beautiful Copenhagen. Together, we will develop data science methods to improve epidemic preparedness.

Copenhagen is amazing, salary is good, we have plenty of funding, and a great community.

Read more..: candidate.hr-manager.net/ApplicationI...

24.03.2025 12:59 โ€” ๐Ÿ‘ 53    ๐Ÿ” 44    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 2

๐Ÿšจ Cool job alert in #DataScience

If you're looking for a PhD or postdoc position then take a look here ๐Ÿ‘‡

Jonas is a fantastic researcher, you will be embedded in a fantastic research group, and work on highly relevant and applicable research

24.03.2025 18:37 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image Post image

๐ŸŒ Join us in Copenhagen - fantastic city for work-life balance and awesome science.

I am hiring PhD students & Postdocs in my group at SODAS Univ of Copenhagen to explore AI, network science & the science of science.

๐Ÿ“ Start: flex summer 2025
๐Ÿ“Œ Info&Apply: www.robertasinatra.com/2025/03/02/p...

04.03.2025 10:26 โ€” ๐Ÿ‘ 80    ๐Ÿ” 57    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 3
Executive Summary
A pro-Russia content aggregation network, Pravda, appears to be set up to flood large-language models with pro-Kremlin content, The American Sunlight Project has found. 

Over the past several months, ASP researchers have investigated 108 new domains and subdomains belonging to the Pravda network, a previously-established ecosystem of largely identical, automated web pages that previously targeted many countries in Europe as well as Africa and Asia with pro-Russia narratives about the war in Ukraine. ASPโ€™s research, in combination with that of other organizations, brings the total number of associated domains and subdomains to 182. The networkโ€™s older targets largely consisted of states belonging to or aligned with the West. 

Notably, this latest expansion includes many countries in Africa, the Asia-Pacific, the Middle East, and North America. It also includes entities other than countries as targets, specifically non-sovereign nations, international organizations, audiences for specific languages, and prominent heads of state. The top objective of the network appears to be duplicating as much pro-Russia content as widely as possible. With one click, a single article could be autotranslated and autoshared with dozens of other sites that appear to target hundreds of millions of people worldwide. 

ASP researchers also believe the network may have been custom-built to flood large language models (LLMs) with pro-Russia content. The network is unfriendly to human users; sites within the network boast no search function, poor formatting, and unreliable scrolling, among other usability issues. This final finding poses foundational implications for the intersection of disinformation and artificial intelligence (AI), which threaten to turbocharge highly automated, global information operations in the future.

Executive Summary A pro-Russia content aggregation network, Pravda, appears to be set up to flood large-language models with pro-Kremlin content, The American Sunlight Project has found. Over the past several months, ASP researchers have investigated 108 new domains and subdomains belonging to the Pravda network, a previously-established ecosystem of largely identical, automated web pages that previously targeted many countries in Europe as well as Africa and Asia with pro-Russia narratives about the war in Ukraine. ASPโ€™s research, in combination with that of other organizations, brings the total number of associated domains and subdomains to 182. The networkโ€™s older targets largely consisted of states belonging to or aligned with the West. Notably, this latest expansion includes many countries in Africa, the Asia-Pacific, the Middle East, and North America. It also includes entities other than countries as targets, specifically non-sovereign nations, international organizations, audiences for specific languages, and prominent heads of state. The top objective of the network appears to be duplicating as much pro-Russia content as widely as possible. With one click, a single article could be autotranslated and autoshared with dozens of other sites that appear to target hundreds of millions of people worldwide. ASP researchers also believe the network may have been custom-built to flood large language models (LLMs) with pro-Russia content. The network is unfriendly to human users; sites within the network boast no search function, poor formatting, and unreliable scrolling, among other usability issues. This final finding poses foundational implications for the intersection of disinformation and artificial intelligence (AI), which threaten to turbocharge highly automated, global information operations in the future.

A pro-Russia content aggregation network is churning out at least 3 MILLION pieces of propaganda per year, all on sites that are virtually unusable by humans.

So what's the goal? We explore the idea that it might be to flood LLMs with pro-Russia content:
static1.squarespace.com/static/6612c... 1/

27.02.2025 13:49 โ€” ๐Ÿ‘ 1140    ๐Ÿ” 568    ๐Ÿ’ฌ 23    ๐Ÿ“Œ 86
Post image

Visiting @mpidr.bsky.social this weekโ€”super excited to see whatโ€™s happening in Demographic Studies (donโ€™t miss my talk!).
Also, Iโ€™ll be in Berlin on Feb 2, Helsinki from Feb 3-6, and Copenhagen from Feb 10-12. Let me know if youโ€™re around and up for a coffee ๐Ÿงช๐Ÿ”ฌโ˜•๏ธ

27.01.2025 14:48 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

Nice!!

19.12.2024 05:28 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
How to make your own Die Hard Christmas tree ornament. Weโ€™ve all by now seen this excellent Die Hard Christmas tree ornament, but since it got spread around virally so late in the year no one had any time to sell them (ahem, Etsy). You can buy Diโ€ฆ

Many would argue that no xmas tree is complete without the classic John-McClane-crawling-through-an-airduct ornament (via kottke.org) cohenaaron.wordpress.com/2016/12/23/h...

18.12.2024 18:55 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
โ€œIโ€™m pretty sure that if you type something like โ€˜shoesโ€™ or โ€˜handbags,โ€™ youโ€™re going to get the Shopping UI with hundreds of ads screaming at you to buy [their product],โ€ he said. โ€œSo thereโ€™s no incentive to . . . give you an aggregated summary with exactly what people are saying about different shoes so you know exactly what you [should] buy.โ€

โ€œIโ€™m pretty sure that if you type something like โ€˜shoesโ€™ or โ€˜handbags,โ€™ youโ€™re going to get the Shopping UI with hundreds of ads screaming at you to buy [their product],โ€ he said. โ€œSo thereโ€™s no incentive to . . . give you an aggregated summary with exactly what people are saying about different shoes so you know exactly what you [should] buy.โ€

I disagree with perplexity.ai CEO Aravind Srinivas about the importance of sourcing in search results, but boy does he have Google's number here. Search for something they can't sell, you get an AI summary. Something they can, you don't. Try it yourself.

From: www.fastcompany.com/91125423/per...

18.12.2024 06:05 โ€” ๐Ÿ‘ 306    ๐Ÿ” 75    ๐Ÿ’ฌ 11    ๐Ÿ“Œ 1
Preview
Mass hysteria was the inevitable outcome of the UFO craze Dancing in Strasbourg, vol. 3675

Mass hysteria was the inevitable outcome of the UFO craze

17.12.2024 15:54 โ€” ๐Ÿ‘ 2    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Dear academics

We could dramatically reduce our administrative workload if we all just agreed not to ask for reference letters until we made our list of finalists.

This is massive collective action problem has already been solved byโ€ฆ

*checks notes*

โ€ฆevery other industry on earth.

14.12.2024 15:00 โ€” ๐Ÿ‘ 630    ๐Ÿ” 127    ๐Ÿ’ฌ 21    ๐Ÿ“Œ 21
Key quote from long article

Key quote from long article

One of the key reasons to use entropy in scientific work apparently already expressed by von Neumann. From www.quantamagazine.org/what-is-entr...

17.12.2024 12:52 โ€” ๐Ÿ‘ 7    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Tiago P. Peixoto - Untangling the hairball using statistical inference Inferential Network Science

New blog post:

โ€œUntangling the hairball using statistical inferenceโ€

skewed.de/tiago/posts/...

20.05.2024 12:34 โ€” ๐Ÿ‘ 27    ๐Ÿ” 10    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
GitHub - microsoft/markitdown: Python tool for converting files and office documents to Markdown. Python tool for converting files and office documents to Markdown. - microsoft/markitdown

Wild! Microsoft has a tool to convert office, etc files to markdown. I haven't tested much, but could be a nice alternative to Pandoc (when the desired output is markdown) github.com/microsoft/ma...

15.12.2024 11:31 โ€” ๐Ÿ‘ 10    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Not that I know of, but it's a cool metric. I guess google knows. Part of that info is present in maps via the estimated travel time, but one would also need to know/estimate the number of folks on the street over time.

14.12.2024 12:58 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

We are hiring a postdoc/researcher at my group (SUNLab) @nunetsi.bsky.social. Interested in working on multidisciplinary problems like segregation, health, or economic growth from the perspective of Social Urban Networks? Contact me if you are interested. Please share it!

15.11.2024 13:39 โ€” ๐Ÿ‘ 46    ๐Ÿ” 53    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 6
Post image

๐Ÿ“• The Atlas for the Aspiring Network Scientist by
Michele Coscia is amazing!
๐Ÿ‘‰Available as a free electronic PDF: networkatlas.eu

#Networkscience #Datavisualization #ArtificialIntelligence

05.12.2024 17:07 โ€” ๐Ÿ‘ 29    ๐Ÿ” 12    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Preview
INTELLECT-1 Release: The First Globally Trained 10B Parameter Model We're excited to release INTELLECT-1, the first 10B parameter language model collaboratively trained across the globe. This represents a 10ร— scale-up from our previous research and demonstrates that l...

This is interesting. People are starting to do distributed training runs of big models. Still problematic in terms of energy use and overall safety (!), but it seems to mean that all the LLM action doesn't need to be concentrated within a few big companies.

www.primeintellect.ai/blog/intelle...

04.12.2024 13:14 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
What do you love when you fall for AI? Inside the surprisingly meaningful, unexpectedly heartbreaking, and deeply confusing reality of AI relationships.

One of the best pieces we published this yearโ€”a deep (and surprisingly moving) story about AI chatbots and the people who love them.

03.12.2024 17:36 โ€” ๐Ÿ‘ 65    ๐Ÿ” 8    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 4
Preview
Foursquare Open Source Places: A new foundational dataset for the geospatial community Stay up to date with the latest from Foursquare! Learn more about Foursquare Open Source Places: A new foundational dataset for the geospatial community

Great move! Foursquare just open-sourced their 100M+ place point of interest dataset. location.foursquare.com/resources/bl...

20.11.2024 13:24 โ€” ๐Ÿ‘ 33    ๐Ÿ” 13    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0

If you're looking for a postdoc, I cannot recommend Esteban's group highly enough. Esteban is awesome, the work is cool and innovative, & being at the Network Science Institute puts you near the center of the Network Science Universe.

20.11.2024 13:05 โ€” ๐Ÿ‘ 7    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@sunelehmann.com is following 20 prominent accounts