Estimating Correctness Without Oracles in LLM-Based Code Generation
Generating code from natural language specifications is one of the most successful applications of Large Language Models (LLMs). Yet, they hallucinate: LLMs produce outputs that may be grammatically c...
Can we statistically estimate how likely an LLM-generated program is correct w/o knowing what is a correct program for that task?
Sounds impossible-but it's actually really simple. In fact, our measure of "correctness" called incoherence can be estimated (PAC guarantees).
arxiv.org/abs/2507.00057
02.07.2025 07:26 β π 11 π 3 π¬ 3 π 2
SQuaLity is a unified database system test suite. Currently, database test suites are not systematically re-used across systems. We explore the opportunities and challenges of doing so. Reusing test suites is difficult due to multiple factors, but also allows finding otherwise overlooked bugs.
19.06.2025 08:09 β π 1 π 0 π¬ 0 π 0
CODDTest is a new testing approach for finding logic bugs in relational database systems. We got inspiration from constant folding and propagation and compilers; by ensuring that the database state is constant, we can fold elements of a query to then check whether its result remains unchanged.
19.06.2025 08:09 β π 2 π 0 π¬ 1 π 0
Our second paper is on finding bugs in graph libraries and databases. The key insight behind "Graph-Cutting" is that graph algorithms are sensitive to graph structures. By dividing one graph into multiple subgraphs, we can infer checkable relationships between the results while stressing the system.
19.06.2025 08:09 β π 1 π 0 π¬ 1 π 0
We will present "Affine Equivalent Inputs", an approach to finding logic bugs in spatial databases like PostGIS. The key contribution of the paper is a new test oracle. If we apply an affine transformation like rotation to two geometries, topological relationships like intersection are preserved.
19.06.2025 08:09 β π 1 π 0 π¬ 1 π 0
Looking forward to attending my first database conference, SIGMOD 2025 (βͺ@sigmod2025.bsky.socialβ¬), in Berlin starting this weekend. We will present four works, all of which tackle the challenge of making data-centric systems reliable, and together have found more than 100 bugs in important systems!
19.06.2025 08:09 β π 7 π 0 π¬ 1 π 0
.βͺ@abhikrc.bsky.social is kicking off the Fuzzing and Software Security Summer School 2025! Looking forward to a week full of exciting talks and tutorials as well as interactions.
26.05.2025 01:05 β π 10 π 0 π¬ 0 π 0
π I'll be launching the Formal Methods Engineering Lab (manchester-fme.github.io) β and I am hiring!
If youβre interested, feel free to reach out.
22.05.2025 12:54 β π 11 π 6 π¬ 0 π 0
This is something we've been working on for a while, and now are thrilled to share.
Veil: the first foundational framework that combines SMT-based and interactive proofs about distributed protocols, seamlessly, in Lean!
Tool: github.com/verse-lab/veil
Paper: verse-lab.github.io/papers/veil-...
14.04.2025 08:52 β π 38 π 5 π¬ 1 π 0
FUZZING'25 Workshop @ ISSTA
The 4th International Fuzzing Workshop (FUZZING) 2025 welcomes all researchers,
scientists, engineers and practitioners to present their latest research findings,
empirical analyses, t...
#FUZZING'25 Deadline Extension
ββββββ
If you have not finished your #FUZZING paper yet, you are in luck! :) We decided to extend the deadline to March 26, 2025!
π fuzzingworkshop.github.io
//cc @rohan.padhye.org, LΓ‘szlΓ³ Szekeres,
@ruijiemeng.bsky.social, @mboehme.bsky.social
21.03.2025 12:26 β π 3 π 7 π¬ 0 π 0
Need a fuzzing harness? No time to write one? Tired of false-positives? Let OGHarn lead the way to bug discovery!π
I'm excited to share my paper(with @snagycs.bsky.social)"No Harness, No Problem: Oracle-guided Harnessing for Auto-generating C API Fuzzing Harnesses" at @icseconf.bsky.social 2025!
17.03.2025 18:52 β π 17 π 6 π¬ 1 π 0
SQLancer will be part of the Google Summer of Code (GSoC) program again this year! We are looking for motivated contributors as well as database development teams who would like to have their systems tested. summerofcode.withgoogle.com/programs/202...
02.03.2025 09:48 β π 11 π 4 π¬ 0 π 0
#FUZZING'25 CALL FOR PAPERS
ββββββ
β¨ New OC members:
* Ruijie Meng (@ruijiemeng.bsky.social; NUS)
* Rohan Padhye (@rohan.padhye.org; CMU).
β¨ New paper type: Fuzzing Nuggets (short papers).
π fuzzingworkshop.github.io
π
20.March (Submission)
π
17.April (Notification)
π
28.June (Workshop)
17.02.2025 18:40 β π 19 π 11 π¬ 1 π 0
ICSE'25: research paper
BINSEC: ICSE'25: research paper
How to detect backdoors efficiently?
βΆοΈ Backdoors were found in firmware & open-source code
βΆοΈ Detection requires much manual reverse-engineering
βΆοΈ Fuzzers cannot see backdoors
Our @icseconf.bsky.social preprint on finding backdoors with fuzzing is at binsec.github.io/nutshells/ic...
13.02.2025 12:29 β π 3 π 1 π¬ 0 π 1
The upcoming ICFP/SPLASHβ25 will feature 23 (!!) co-located workshops, in addition to the main conferences, doctoral symposium, Onward!, and SAS, and we canβt wait to see everyone in Singapore in October.
Hereβs a sneak peek of the hallway track location.
12.02.2025 16:41 β π 11 π 1 π¬ 0 π 0
Microsoft Forms
π’π’VLDB26 (@vldb.bsky.social) will have a ShadowPC! Apply at forms.office.com/e/XYihKj4UKq by March 1! Great opportunity for folk who are new to the data management community & want to gain paper reviewing experience. More info at application form. Chairs: @zistvan.bsky.social, Tianzheng Wang, & I.
10.02.2025 17:26 β π 11 π 6 π¬ 0 π 0
Siesta 2025 is coming: 5th Software Engineering Summer School, August 27-29, Lugano, Switzerland. Stay tuned! siesta.si.usi.ch/2025/
10.02.2025 07:43 β π 4 π 2 π¬ 0 π 0
Blog post titled "Concurrency bugs in Lucene: How to fix optimistic concurrency failures" By Benjamin Trent and Ao Li (February 7, 2025)
Text reads: "Thanks to Fray, a deterministic concurrency testing framework from CMUβs PASTA Lab, we tracked down a tricky Lucene bug and squashed it"
Back to basics: Concurrency testing in Java!
Our new tool *Fray* correctly solves a 25+ year old problem for real-world software. See this feature from Elastic Labs about Fray's contributions to Lucene.
π°: www.elastic.co/search-labs/...
π§: github.com/cmu-pasta/fray
π: arxiv.org/pdf/2501.12618
07.02.2025 21:57 β π 20 π 4 π¬ 2 π 0
Our paper on efficient automated exploit generation has been accepted to USENIX Security '25.
The gist: instead of generating individual attacks, we synthesise the whole *programming language* that expresses many exploits and guarantees their realisability.
Paper: ilyasergey.net/assets/pdf/p...
02.02.2025 06:35 β π 47 π 7 π¬ 2 π 0
Congratulations to the new #ACM Fellows of 2024, including Satish Chandra, Marsha Chechik, Derek Dreyer, Tim Menzies, Mira Mezini, Bashar Nuseibeh, Abhik Roychoudhury @abhikrc.bsky.social, and Ben Zorn. What a great year this is!
23.01.2025 07:31 β π 32 π 4 π¬ 0 π 1
There are no assignments, but we'll be having a hackathon. The speakers will present their topics of expertise and what they think is most exciting. Thus, there will also be no curriculum. We will upload talk abstracts closer to the start of the summer school.
23.01.2025 06:06 β π 0 π 0 π¬ 1 π 0
We will hold the Fuzzing and Software Security Summer School 2025 in Singapore from May 26 to 30. Please consider registering or sharing the info with anyone you think would be interested. Registration will be open until 16 Feb. fuzzing.comp.nus.edu.sg @abhikrc.bsky.social @umathur.bsky.social
21.01.2025 15:39 β π 14 π 9 π¬ 1 π 1
18.01.2025 15:47 β π 3 π 0 π¬ 1 π 0
Followed the call of an Oriental Pied Hornbill that lead me to a pair of Buffy Fish Owls. Only my second time to see wild owls in Singapore!
18.01.2025 15:47 β π 6 π 0 π¬ 1 π 0
An exciting talk by Armando Solar-Lezama on applying PL ideas for ML, with applications in robotics and computational biology. You canβt do cooler than that.
Also, a second SIGPLAN Milner Award recipient giving a talk at NUS in the last two weeks.
16.01.2025 06:27 β π 17 π 1 π¬ 0 π 0
One week today is the deadline for expressing your interest to attend Programming Language Implementation Summer School (PLISS) -- we have a great program, if I do say so myself! pliss.org/2025/
14.01.2025 11:03 β π 23 π 16 π¬ 2 π 1
At the forefront of research with world-wide top class researchers, to ensure that software is safe, reliable and efficient.
Professor in Software Engineering @ Universidad de Sevilla (Spain). Website: https://personales.us.es/sergiosegura/
The 40th IEEE/ACM International Conference on Automated Software Engineering β https://conf.researchr.org/home/ase-2025
CTO Office / Head of Infrastructure & Security Research at Bloomberg. Interested in software, systems, and network verification at world scale.
The CISPA Helmholtz Center for Information Security is a German national Big Science Institution within the Helmholtz Association. We research information security in all its facets.
Data visualization researcher. Associate Prof of CS at the University of Utah. Lab website: https://vdl.sci.utah.edu. Co-founder of
datavisyn: https://datavisyn.io
University of Genova, Italy
Associate Professor @ IMDEA Software Institute. Doing research on security at the hardware/software interface. Find me also at @mguarnieri@infosec.exchange
Researcher, professor of Computer Science at Vrije Universiteit Brussel, Belgium
PhD student at TU Wien.
mcschroeder.github.io
Senior Lecturer in Cyber Secutity at @UniMelb & ARC DECRA Fellow. Prev @MonashInfotech & @NUSComputing . (Fuzz) Testing enthusiast.
Ph.D. Student at the University of Utah | Computer Security
https://gabe-sherman.github.io/
We are a volunteer-led team within SIGPLAN that provides audio, livestreaming, recording, remote participation and archiving for SIGPLAN conferences.
https://sigplan.org/AV
Every event we cover is available on YouTube:
https://youtube.com/@acmsigplan
Programming languages; Postdoc at INRIA/ENS; Incoming Assistant Professor at Tufts CS
https://continuation.passing.style/
Assistant professor at TU Wien and associate faculty at the Complexity Science Hub. Previously: KTH, Brown, Uni Wien. Interested in social network analysis and graph algorithms. Connecting theory π€ practice.
More info: https://neumannstefan.com.
Principal Researcher at Consensys Diligence, previously at ETH ZΓΌrich, UT Austin, Microsoft Research, and Google
PhD student in the Programming Group at the University of St. Gallen
making Infrastructure as Code more reliable
https://david-spielmann.ch/
Applied probabilist. Probability, MCMC, optimization, information theory, TCS.
https://mchchoi.github.io/