Hot take: I think we just demonstrated the first AI agent computer worm π€
When an agent sees a trigger image it's instructed to execute malicious code and then share the image on social media to trigger other users' agents
This is a chance to talk about agent security π
20.03.2025 14:28 β π 8 π 2 π¬ 0 π 0
Very interesting paper about unlearning for AI Safety, a subject that deserves more attention. β¬οΈ
11.01.2025 15:11 β π 49 π 6 π¬ 0 π 0
Thanks to my amazing collaborators:
@TingchenFu @AmyPrb @StephenLCasper
@AmartyaSanyal @Adel_Bibi @aidanogara_ @_robertkirk @ben_s_bucknall @fiiiiiist Luke Ong @philiptorr Kwok-Yan Lam @RobertTrager
@DavidSKrueger @sorenmind JosΓ© HernΓ‘ndez-Orallo @megamor2.bsky.social @yaringal.bsky.social
10.01.2025 16:58 β π 7 π 1 π¬ 0 π 0
Excited about our most recent work on the challenges we face when using unlearning method for safe and secure AI! Work done collaboratively by a great team & led by @fbarez.bsky.social
10.01.2025 17:06 β π 11 π 0 π¬ 0 π 0
Link?
21.12.2024 09:52 β π 3 π 0 π¬ 1 π 0
A photograph of a white robot wearing a hat holding up a white sheet of paper with the number eight on it. In the background, there is a faded beige building in front of a dark blue backdrop.
A dark blue background with white text reading β...coverage in eight of the top papers for Associate Professor Yarin Galβs research projects. Associate Professor Yarin Gal is in the papers a lot... for good reason! The AI and ML Associate Professorβs work on researching the potential βcollapseβ of machine learning models, hallucinating Large Language Models (LLMs), and the AI tool EVEscape that could help predict viral outbreaks have all been picked up in the press this year. Take a look: Nature (Model Collapse). BBC (EVEscape). Financial Times (Model Collapse). Forbes (Model Collapse). Time (LLM hallucination). Independent (LLM hallucination). Euronews (LLM hallucination). The Standard (LLM hallucination)β.
On the eighth day of Christmas, RobOx gave to us: coverage in eight of the top papers for Associate Professor Yarin Galβs research projects. @yaringal.bsky.social
#CompSciOxford #12DaysOfChristmas #Oxmas
08.12.2024 10:36 β π 3 π 1 π¬ 0 π 0
I look forward to co-directing the Canadian AI Safety Institute (CAISI) Research Program at CIFAR with @catherineregis.bsky.social
We will be designing the program in the coming months and will soon share ways to get involved with this new community.
Read more here: cifar.ca/cifarnews/20...
12.12.2024 19:36 β π 30 π 5 π¬ 4 π 0
I'm looking for PhD applicants who have expertise in Gaussian processes and/or Transformers for an exciting PhD project
If this sounds interesting, application deadline for funding is 3/12
Please share with people you think this might be relevant to!
oatml.cs.ox.ac.uk/apply.html
30.11.2024 14:42 β π 38 π 8 π¬ 1 π 0
Welcome to the Crazy Rich Bayesian Starter Pack, folk who are/were vaguely into Bayesian reasoning but - with a few exceptions - don't shun the non-Bayesian.
go.bsky.app/JYH5Z6M
25.11.2024 12:08 β π 76 π 13 π¬ 24 π 5
@girving.bsky.social probably has more suggestions. Maybe Scott?
25.11.2024 16:57 β π 0 π 0 π¬ 0 π 0
brew install mactop
github.com/context-labs...
23.11.2024 20:02 β π 51 π 6 π¬ 1 π 1
The International Society for Bayesian Analysis (ISBA) has joined Bluesky. You can follow the account at @isba-bayesian.bsky.social to stay updated on events, publications, and discussions within the #Bayesian community.
Please add the account to your starter packages.
23.11.2024 22:25 β π 39 π 17 π¬ 1 π 0
Now that @jeffclune.bsky.social and @joelbot3000.bsky.social are here, time for an Open-Endedness starter pack.
go.bsky.app/MdVxrtD
20.11.2024 07:08 β π 105 π 32 π¬ 16 π 5
On my way to Oxford to meet amazing people and give a talk on the opportunities of AI to accelerate progress in environmental modeling.
20.11.2024 08:35 β π 15 π 1 π¬ 2 π 0
Assistant Professor (Tenure Track) of Computer Science β Responsible Artificial Intelligence
π£ We have a tenure-track faculty opening in Responsible AI at @ethzurich.bsky.social :
ethz.ch/en/the-eth-z.... Deadline Nov 30 for full consideration. ETH Zurich is a vibrant environment for AI research with the ETH AI Center etc. Please help spread the word!
20.11.2024 08:31 β π 79 π 23 π¬ 2 π 0
Some machine learners were once children. Hereβs where you can find them:
go.bsky.app/F6mM37U
19.11.2024 23:31 β π 125 π 16 π¬ 18 π 3
I donβt need to go on social media to have my worldview challenged I am in theoretical physics I have a new existential crisis daily
18.11.2024 23:43 β π 26811 π 2034 π¬ 368 π 122
MaPPing Your Model: Assessing the Impact of Adversarial Attacks on LLM-based Programming Assistants
LLM-based programming assistants offer the promise of programming faster but with the risk of introducing more security vulnerabilities. Prior work has studied how LLMs could be maliciously fine-tuned...
Since this platform is finally attracting a critical mass of ML researchers, here's our recent work on prompt-based vulnerabilities of coding assistants:
arxiv.org/abs/2407.11072
TL;DR β An attacker can convince your favorite LLM to suggest vulnerable code with just a minor change to the prompt!
17.11.2024 23:40 β π 216 π 32 π¬ 4 π 4
Hey, this Friday I'm the Keynote speaker at the 20th AAAI Conference on AI and Interactive Digital Entertainment (AIIDE), the best conference on AI and Games sites.google.com/gcloud.utah....
I think I will talk about why the next big challenge in AI game playing should be Dungeons and Dragons π§π
19.11.2024 03:24 β π 84 π 7 π¬ 7 π 4
All the ACL chapters are here now: @aaclmeeting.bsky.social @emnlpmeeting.bsky.social @eaclmeeting.bsky.social @naaclmeeting.bsky.social #NLProc
19.11.2024 03:48 β π 107 π 37 π¬ 1 π 3
Hey! @friedler.net made a FAccT starter pack: bsky.app/starter-pack...
19.11.2024 03:52 β π 10 π 5 π¬ 0 π 0
Hope I'm the first to post this all time classic on this platform
19.11.2024 04:51 β π 2928 π 627 π¬ 39 π 28
Hey, @bsky.app @support.bsky.team, is there a way for you to shorten the displayed usernames when trailed by βbsky.socialβ? If someone has some other domain name, then fine, show that, but if we're using the default domain, can we get rid of these lengthy string of characters?
18.11.2024 20:29 β π 85 π 7 π¬ 6 π 1
I've created an initial Grumpy Machine Learners starter park. If you think you're grumpy and you "do machine learning", nominate yourself. If you're on the list, but don't think you are grumpy, then take a look in the mirror.
go.bsky.app/6ddpivr
18.11.2024 14:40 β π 415 π 55 π¬ 124 π 15
Google DeepMind is hiring Student Researchers in EMEA π
18.11.2024 12:27 β π 33 π 4 π¬ 1 π 0
https://ai.ethz.ch/education/phd-and-postdoc-programs.html
π£ Last call for the Ph.D. and Postdoc Fellowships at the ETH AI Center -- Deadline Nov 19 '24 t.co/aYI5tWXUWK @ethzurich.bsky.social
18.11.2024 10:52 β π 21 π 9 π¬ 0 π 0
Safety case template for frontier AI: A cyber inability argument
Frontier artificial intelligence (AI) systems pose increasing risks to society, making it essential for developers to provide assurances about their safety. One approach to offering such assurances is...
Iβm keen to dig more into safety cases, thereβs something βproving a negativeβ about them but equally itβs good to see a really concrete attempt to tether speculation. Hereβs a new piece from UK AISI @girving.bsky.social and gov AI attempting to provide a template
arxiv.org/abs/2411.08088
17.11.2024 14:18 β π 9 π 2 π¬ 1 π 0
Couldn't find a machine learning for health starter pack so I made one.Β
DM/Reply if you want to be added!
go.bsky.app/PJKJ8vK
17.11.2024 06:34 β π 109 π 29 π¬ 48 π 0
I created a starter pack of scientists in the European Laboratory for Learning and Intelligent Systems (ELLIS) πͺπΊ
Please ping me and Iβll add you.
go.bsky.app/Cihupkk
17.11.2024 16:02 β π 77 π 27 π¬ 46 π 1
Let's build AI's we can trust!
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
Security and Privacy of Machine Learning at UofT, Vector Institute, and Google π¨π¦π«π·πͺπΊ Co-Director of Canadian AI Safety Institute (CAISI) Research Program at CIFAR. Opinions mine
AI Professor @UCIrvine | Formerly @blei_lab, @Princeton | #GenAI, #Compression, #AI4Science | General Chair @aistats_conf 2025 | AI Resident @ChanZuckerberg
VP, AI Models @IBMResearch, IBM Director, @MITIBMLab. Former prof and serial/parallel entrepreneur.
Probabilistic machine learning and its applications in AI, health, user interaction.
@ellisinstitute.fi, @ellis.eu, fcai.fi, @aifunmcr.bsky.social
Phd Student @universityofoxford.bsky.social working with Yarin Gal, Michael Bronstein and Haggai Maron
I work at Sakana AI ππ π‘ β @sakanaai.bsky.social
https://sakana.ai/careers
https://mega002.github.io
Professor at UW; Researcher at Meta. LMs, NLP, ML. PNW life.
Sr Manager / Lab Lead @ Spotify
andreasdamianou.com
Prof of machine learning at University of Helsinki. Interested in (differential) privacy and open source software.
Associate Professor at MIT EECS, LIDS.
Machine Learning Professor
https://cims.nyu.edu/~andrewgw
Machine learning prof at U Toronto. Working on evals and AGI governance.
The International Society for Bayesian Analysis (ISBA), est. 1992 to promote the development and application of Bayesian analysis, provides an international community for those interested in Bayesian analysis and its applications.
https://bayesian.org
PhD in AI (Edinburgh), ex-Google, ex-Alexa, serial failed entrepreneur, full time digital nomad for 5 years. Currently trying to on help researchers research faster to reverse aging.
Opinions are mine, but you're welcome to them
Machine Learning engineer @planet. Mentor with Frontier Development Lab. Previously at Dropbox & Google. Started coworking. Interests: Machine Learning, space, Earth Observation, VR.
http://codinginparadise.org
Twitter: @bradneuberg