Latest Changes (27/05/2025 - V6.16.2.0) - Forum announcements
βͺ
βͺ#EPPI-Reviewer new release:
Version 6.16.3.0 adds DeepSeek to the list of LLMs available.
Additionally, it includes significant improvements on how the Coding tools are shown/handled, and a new search type.
All details: eppi.ioe.ac.uk/cms/Default....
03.07.2025 13:39 β π 2 π 1 π¬ 0 π 0
Latest Changes (27/05/2025 - V6.16.2.0) - Forum announcements
βͺ#EPPI-Reviewer new release: Version 6.16.2.0 introduces a number different LLM models, which can be used for robot/auto coding tasks.
Before only one LLM/model was available.
This release also contains a bug fix and a security fix.
All details: eppi.ioe.ac.uk/cms/Default....
27.05.2025 13:30 β π 2 π 1 π¬ 0 π 0
Latest Changes (28/04/2025 - V6.16.1.0) - Forum announcements
#EPPI-Reviewer new release: Version 6.16.1.0 introduces the "Automation Logs", provides DIY access to LLM-Driven coding functions, supports "Priority Screening Simulations" and more.
All details: eppi.ioe.ac.uk/cms/Default....
28.04.2025 13:12 β π 1 π 1 π¬ 0 π 0
So, if you worry that LLMs (or so called AI) will inevitably deteriorate the quality of evidence synthesis work, and thus devalue it, perhaps we can help.
This tech can be used responsibly, with some effort. We at EPPI Reviewer/EPPI Centre are trying to do so, and trying to help others too.
π§΅ 18/18
13.02.2025 15:26 β π 1 π 0 π¬ 0 π 0
Meanwhile, staff at the @eppicentre.bsky.social, along with collaborators are busy designing, running, and reporting on "Studies within a review" (SWARS). This is done for 2 purposes:
1. Ensure our results can be trusted.
2. Produce published examples of responsible use & evaluation of LLMs.
π§΅ 17/n
13.02.2025 15:26 β π 1 π 0 π¬ 1 π 0
Wrapping up:
#EPPI-Reviewer now supports using an LLM (limited to GPT4o, for now) to run screening and data extraction tasks.
The LLM can be fed titles and abstracts or the full text.
All functionalities have been designed to *facilitate* (rather than obfuscate) their per-review evaluation.
π§΅ 16/n
13.02.2025 15:24 β π 1 π 0 π¬ 1 π 0
[One under-evaluated, or rather, borderline ignored issue is that of error propagation: the more steps are "automated", the more errors will happen at each different stage, and they are likely to compound non-linearly, undermining more and more the trustworthiness of results.]
π§΅ 15/n
13.02.2025 15:24 β π 1 π 0 π¬ 1 π 0
For example, the machine might make up the answer to the "how many participants?" question when supplied with an RCT protocol, rather than the actual study report.
Effect: every evidence synthesis effort, and every LLM-task therein NEEDS to be evaluated on its own and then in its full context
π§΅ 14/n
13.02.2025 15:23 β π 1 π 0 π¬ 1 π 0
LLMs are necessarily extremely sensitive to:
- Every specific prompt,
- The contents supplied,
- And even more, the interaction between the 2.
What we found early on is that most "hallucinations" happen when the question asked implies an assumption that isn't valid for a given βcontentβ.
π§΅ 13/n
13.02.2025 15:22 β π 0 π 0 π¬ 1 π 0
The guiding principles are conceptually simple, though.
Any new feature shipped out needs to be matched by effective, accessible to regular users, and not-too-onerous systems to evaluate in full how well it works.
This is paramount, because as a general rule, evaluations DO NOT generalise.
π§΅ 12/n
13.02.2025 15:21 β π 0 π 0 π¬ 1 π 0
Side effects are that we need to move fast, but move thoughtfully and effectively too.
Thus, from 2024 onwards, we developers have been in "marathon-length, sprint mode".
The main focus is to deliver facilities to leverage LLMs in ways that maximise the trustworthiness of results.
π§΅ 11/n
13.02.2025 15:20 β π 0 π 0 π¬ 1 π 0
We still challenge one another with such questions, but we are also all aware that we have fully committed to a general strategy.
1. LLMs will be used in Evidence Synthesis - no matter what our positions will be.
2. Thus, what we want to do is to help the field to use LLMs responsibly.
π§΅ 10/n
13.02.2025 15:19 β π 0 π 0 π¬ 1 π 0
For us working on #EPPI-Reviewer, this single clear "answer" opened up a multitude of new questions about what we *should* do, how to manage risks without curbing our effectiveness, and more importantly, on what kind of "effects" we wanted to have.
π§΅ 9/n
13.02.2025 15:18 β π 0 π 0 π¬ 1 π 0
The results are here: ceur-ws.org/Vol-3832/pap...
(With many thanks to Lena Schmidt, @kaitlynhair.bsky.social, Fiona Campbell, @clkapp.bsky.social, Alireza Khanteymoori, Dawn Craig and Mark Engelbert.)
Take home message was: yes, this kind of tech will be used in Evidence Synthesis - a lot.
π§΅ 8/n
13.02.2025 15:17 β π 0 π 0 π¬ 1 π 1
At the Hackathon, we were able to run a very small, but we think "carefully planned" study that we hoped could begin answering our initial question, and concurrently provide an example of how to run such studies responsibly.
π§΅ 7/n
13.02.2025 15:17 β π 0 π 0 π¬ 1 π 0
At the same time, it was very evident that lots of quick and dirty evaluations were being run and/or published, often flawed, not only because of their small sample size, but more worryingly, undermined by not-very-thoughtful methods and/or an excess of optimism.
π§΅ 6/n
13.02.2025 15:16 β π 1 π 0 π¬ 1 π 0
Developing collaborations and technology for evidence synthesis
An event series to develop open software for evidence synthesis
The only way to find out was to try. So we put together "proof of concept" functionalities, and put them to a preliminary test with the invaluable help of external collaborators, gathered together for the Evidence Synthesis Hackathon (2023 - www.eshackathon.org).
π§΅ 5/n
13.02.2025 15:15 β π 0 π 0 π¬ 1 π 0
This is important, because it can potentially mitigate or even remove the problem of (so called) hallucinations, and applied to many, if not most steps of evidence synthesis, broadly defined.
Thus, the 1st question was: can we really use LLMs and get reliable results?
π§΅ 4/n
13.02.2025 15:14 β π 0 π 0 π¬ 1 π 0
Back when GPT was "new", it was very clear to us that Large Language Models (LLMs) could be useful (and thus: disruptive) in the field of evidence synthesis, because in our use-case, one could ask to the machine questions, supplied along with the text that should be used to answer them.
π§΅ 3/n
13.02.2025 15:14 β π 0 π 0 π¬ 1 π 0
Latest Changes (12/02/2025 - V6.16.0.0) - Forum announcements
It took us 15 months or more to move from "proof of concept" to releasing a version of EPPI Reviewer that integrates comprehensive LLM-driven functionalities, available to all users (see: eppi.ioe.ac.uk/cms/Default....).
It's been quite a journey, so perhaps it's worth sharing some thoughts.
π§΅ 2/n
13.02.2025 15:13 β π 0 π 0 π¬ 1 π 0
Doing evidence synthesis and grappling with the ever-widening range of promises about using new Large Language Models to speed up the process?
Perhaps youβre worried about how such tools may be misused?
Yeah, we (the #EPPI-Reviewer team) feel the same.
A thread on our journey so far.
π§΅ 1/n
13.02.2025 15:11 β π 5 π 7 π¬ 1 π 3
Latest Changes (12/02/2025 - V6.16.0.0) - Forum announcements
#EPPI-Reviewer new release: Version 6.16.0.0 makes the LLM-driven auto-coding functionalities available (on demand) to all users. It also includes a new Machine Learning facility to retrospectively check Screening decisions.
More details are here: eppi.ioe.ac.uk/cms/Default....
12.02.2025 14:14 β π 4 π 3 π¬ 0 π 0
Latest Changes (06/12/2024 - V6.15.5.1) - Forum announcements
#EPPI-Reviewer new release: Version 6.15.5.1 is a "Single Feature" release, which extends the (on invitation, experimental) GPT-coding features, which can now operate against the full-text, instead of being limited to title and abstract.
All details are here: eppi.ioe.ac.uk/cms/Default....
06.12.2024 13:31 β π 6 π 3 π¬ 0 π 0
https://methods.cochrane.org/equity/
Ottawa, Canada
#InformationSpecialist with the Evidence Synthesis Team, NIHR PenARC, University of Exeter. Likes #SystematicReviews, books, wildlife, politics (occasionally). Villager.
Researcher, systematic reviewer and information scientist in information retrieval, automation, and research use at UCL SRI and EPPI Centre. Social science and health.
Retired (recovering) implementation scientist. Ottawa, Canada.
Interdisciplinary scientist specialising in modelling the complexities of conservation & sustainability, as well as evidence synthesis; with a sideline in pheasants. Researcher at NINA(Norwegian Institute for Nature Research), Trondheim.
Research Fellow, Adj. Associate Professor, Editor | Review of Education
#EvidenceSynthesis #AIEd #EdTech #StudentEngagement
Former (molecular) neurobiologist, now software developer (@eppicentre.bsky.social). Science junkie, evidence seeker, naΓ―ve philosophaster, music lover.
Getting more and more radical as I grow old.
Also at: @GraziosiSergio@social.esmarconf
Evidence synthesis + behavioural science at UCL β’ population health β’ interventions β’ environments β’ unpopular music β’ architecture (& morality) β’ "fun-sponge boffin" (Daily Star)
π https://profiles.ucl.ac.uk/92345-gareth-hollands
π London, UK
#UCL #EPPI-Centre researcher
https://profiles.ucl.ac.uk/33741
Researching evidence informed policy-making; evidence synthesis; social exclusion; public health; social gerontology; LGBTQ health.
Cymro, E17 taffia, Tiger keeper, He/Him
Associate Professor, EPPI Centre, Social Research Institute, UCL
Interests: social research, evidence synthesis, research methods, qualitative evidence synthesis, mixed-methods synthesis, intervention component analysis, qualitative comparative analysis
Associate Professor, Associate Director of EPPI-Centre, UCL, Co-Founder Southeast Asia Evidence Policy and Partnership -SEAEPP, Science to Impact for better health and well-being
Views are mine
Professor of Social Science at University College London | Head of UCL Social Research Institute | Chair of Release | Fellow of the Academy of Social Sciences | Interested in: drug policy, social science, critical theory, history | www.tobyseddon.com
#UCL and #EPPICentre researcher
https://profiles.ucl.ac.uk/48814-alison-o%27maraeves
I've been an evidence synthesist for more than two decades (although that includes previous incarnations as a 'meta-analyst' and more recently 'systematic reviewer')
Combining different ways of knowing β practical, emotional, cultural, scientific β for better academic knowledge and a better world https://www.ucl-ioe-press.com/research-for-all
Professor of Social Research & Policy
Co-Director of EPPI Centre, UCL
Evidence synthesis, computer science