Our Python snippet for measuring wAII for a job posting, which is open-sourced.
Our code is open-source. With just a few lines of Python, anyone can measure AI exposure for any job description using our repository. (5/5)
Paper: workshop-proceedings.icwsm.org/abstract.php...
Repository: github.com/EunCheolChoi...
26.06.2025 16:30 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
In the information industry sector, our wAII is โpositivelyโ associated with offered salary.
On the other hand, in the wholesale trade industry sector, wAII is โnegativelyโ associated with offered salary.
๐๐ก๐๐ญ ๐ฐ๐ ๐๐จ๐ฎ๐ง๐
1. Jobs in tech, manufacturing, and engineering are more exposed to AI; HR, public service, and legal sectors are less exposed
2. Exposure to AI shows distinct association patterns regarding offered salary depending on the industry sectors (4/5)
26.06.2025 16:30 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
A diagram explaining how we extracted โtasksโ from each job post. A task is a role or a skillset that is essential for a job position.
We query based on extracted tasks and retrieve the most similar AI-related patents in terms of semantic similarities.
Weighted AI Index is calculated as the sum of the product between the task weight and the similarity of the task and the retrieved patent.
Full LLM prompt for extracting tasks from a job post.
To track this, we developed the Weighted AI Index (wAII), a scalable method to measure how closely a jobโs tasks align with recent AI innovations.
We extract key job tasks from postings; compare them to AI-related patents; compute an โAI exposureโ score. (3/5)
26.06.2025 16:30 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
A bar chart comparing โweighted AI index (wAII)โ of different industry sectors. wAII is our proposed method of measuring how much a job position is exposed to AI technologies.
๐๐๐ฒ ๐ข๐๐๐
AI doesn't affect all jobs equally. Some industries and roles are far more exposed to disruption from technological innovation than others. (2/5)
26.06.2025 16:30 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Title page for our new paper, โMapping Labor Market Vulnerability in the Age of AIโ
๐พ๐๐๐ ๐จ๐ฐ ๐๐๐๐ ๐๐ ๐๐๐?
Itโs a question that keeps many of us up at nightโand for good reason.
Our new research maps labor market vulnerability in the age of AI with 100K job postings and 50K AI-related patents. (1/5)
26.06.2025 16:30 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
"Limited effectiveness of LLM-based data augmentation for COVID-19 misinformation stance detection" by @euncheolchoi.bsky.social @emilioferrara.bsky.social et al, presented by the awesome Chur at The Web Conference 2025
arxiv.org/abs/2503.02328
01.05.2025 05:06 โ ๐ 6 ๐ 2 ๐ฌ 0 ๐ 0
Hot takes:
- The benefits of easily accessible social media data usually outweigh the potential harms.
- Some uses of (public) social media data are unethical/should be illegal, and we should target that.
- We're better off having clear boundaries between public and private online spaces.
28.11.2024 18:38 โ ๐ 12 ๐ 2 ๐ฌ 0 ๐ 1
Someone Made a Dataset of One Million Bluesky Posts for 'Machine Learning Research'
A Hugging Face employee made a huge dataset of Bluesky posts, and itโs already very popular.
An employee of Huggingface, a site of AI training datasets, made a dataset of a million Bluesky posts scraped simply because they could. Itโs currently trending: www.404media.co/someone-made...
27.11.2024 00:09 โ ๐ 1109 ๐ 476 ๐ฌ 60 ๐ 196
Bluesky's firehose is a treasure trove of public data for researchers and developers, and it's completely free. Check out our developer docs: docs.bsky.app
23.11.2024 05:54 โ ๐ 7934 ๐ 1532 ๐ฌ 321 ๐ 167
Book outline
Over the past decade, embeddings โ numerical representations of
machine learning features used as input to deep learning models โ have
become a foundational data structure in industrial machine learning
systems. TF-IDF, PCA, and one-hot encoding have always been key tools
in machine learning systems as ways to compress and make sense of
large amounts of textual data. However, traditional approaches were
limited in the amount of context they could reason about with increasing
amounts of data. As the volume, velocity, and variety of data captured
by modern applications has exploded, creating approaches specifically
tailored to scale has become increasingly important.
Googleโs Word2Vec paper made an important step in moving from
simple statistical representations to semantic meaning of words. The
subsequent rise of the Transformer architecture and transfer learning, as
well as the latest surge in generative methods has enabled the growth
of embeddings as a foundational machine learning data structure. This
survey paper aims to provide a deep dive into what embeddings are,
their history, and usage patterns in industry.
Cover image
Just realized BlueSky allows sharing valuable stuff cause it doesn't punish links. ๐คฉ
Let's start with "What are embeddings" by @vickiboykis.com
The book is a great summary of embeddings, from history to modern approaches.
The best part: it's free.
Link: vickiboykis.com/what_are_emb...
22.11.2024 11:13 โ ๐ 653 ๐ 101 ๐ฌ 22 ๐ 6
Opportunities and risks of LLMs in survey research
Recent advances in the development of large language models (LLMs) bring both disruptive opportunities and underlying risks to survey research. LLMs' capabiliti
New Paper on Opportunities and risks of LLMs in survey research papers.ssrn.com/sol3/papers.... " Backed by both practical examples & academic literature, we identify areas
for research and development, distinguishing between challenges related to survey methods &
the tools used to deploy surveys"
22.11.2024 15:07 โ ๐ 4 ๐ 2 ๐ฌ 1 ๐ 1
Interested in RLHF, DPO, LLM alignment?
I've just created this list featuring awesome people like @natolambert.bsky.social .
The list is the opposite of exhaustive; I've just joined some days ago ๐
go.bsky.app/MqRGAf2
21.11.2024 13:26 โ ๐ 83 ๐ 19 ๐ฌ 10 ๐ 1
If you're keen to learn content verification for fact-checking and open source investigations, this is a step-by-step guide on how to verify images and videos that I posted a while ago.
Once you familiarise yourself with reverse search, you'll become much better at spotting online misinformation.
14.11.2024 00:56 โ ๐ 416 ๐ 165 ๐ฌ 39 ๐ 8
Sharing my first Computational Social Science starter pack! Will grow with time, feel free to nominate and self nominate!
go.bsky.app/CYmRvcK
13.11.2024 02:05 โ ๐ 97 ๐ 41 ๐ฌ 61 ๐ 3
Ready for another Computational Social Science Starter Pack?
Here is number 2! More amazing folks to follow! Many students and the next gen represented!
go.bsky.app/GoEyD7d
14.11.2024 23:42 โ ๐ 77 ๐ 52 ๐ฌ 33 ๐ 43
The official account of the Journal of Health Communication- a leading journal covering the full breadth of health communication; seeking to advance a synergistic relationship between research and practical information.
Ph.D. Candidate in Data Science
(Graduating May 2025) | CSS |
Sapienza University, CENTAI (Italy)
Ph.D. in Artificial Intelligence for Society, University of Pisa and CNR
Previously: University of Trento
Links: https://linktr.ee/jordi.condom
MPhil Candidate at University of Adelaide
USC Information Sciences Institute, a unit of @viterbischool.usc.edu, is a world leader in research and development of advanced information processing, computer and communications technologies.
Co-founder of Reliant AI. Scientist. European federalist.
Deep dive on AI companies //
Goldman Sachs Alumni //
"Machine intelligence is the last invention that humanity will ever need to make.โ โ Nick Bostrom
Please consider subscribing to substack for support: https://tinyurl.com/AI-Monaco
Civic tech, social computing, online communities, comp social science
Previously: comp neuro, analytics (Bensmaia Lab @UChicago, Fulbright@LMU Munich, Bluebonnet Data, Amazon)
Boston -> Chicago -> Munich -> ...
โHumankind has not woven the web of life. We are but one thread within it. Whatever we do the web, we do to ourselves.โ
โ Chief Seattle, Squamish-Duamish
(1782 โ June 7, 1866)
https://x.com/DavidUllrich202 ๐ณ๏ธโ๐
Saber crear software de calidad te da libertad.
Escribo historias, consejos y experiencias.
https://xurxodev.com/libros
https://xurxodev.com/estudio-comunidad-xurxodev/
Atheist, Skeptic, Geek, Nerd, Human,๐ณ๏ธโ๐โฌ๏ธโ๏ธ
I post interesting things, check and follow if you like.
๐ซ Crypto ๐ซContent ๐ซ Begging money ๐ซ Sale ๐ซ Bot
Researching fear as a tool in online public discourse. Side hustle: Making sure that Language Models gets scared too (detecting it).
PhDing @gesis.org โข Computational Social Science โข NLProc
https://s-vigneshwaran.github.io
Researcher & faculty member @DPKM dedicated to the field of AI, with the focus on knowledge technologies (knowledge graphs, semweb, RAG) & their use in e-gov, skills matching, research ecosystem, digital humanities and education. Partner @km-a.bsky.social.
PhD CS @CSatUSC. (Mostly) doing @nlp_usc.
๐ MS CS (AI) @USC.
๐ BSc CE at SBU.
Where Law and Technology Converge. The Latest AI News Impacting The Legal Industry. Empowering Lawyers with AI โ Gain The Edge To Win #LegalTech #LegalAI
https://thelegalwire.ai
Co-leader OWASP Cornucopia. If you like what we do for open source, visit our code repository https://github.com/OWASP/cornucopia and give us a star โญ
๐ ยซDifference is of the essence of humanityยป ๐ฆ โ John Hume
#appsec #owasp #cornucopia #threatmodeling
(she/her) | #NLProc PhD student | NSF GRFP | LLM bias evalutation | community-engaged ethical AI | advocating for women and LGBTQ+ in STEM | okie living in LA