Sam Abbott's Avatar

Sam Abbott

@seabbs.bsky.social

Real-time infectious disease modelling. Developing methods for outbreak response, surveillance, and pandemic preparedness. samabbott.co.uk Come join me on the epinowcast forum: https://community.epinowcast.org/latest

1,431 Followers  |  702 Following  |  668 Posts  |  Joined: 18.12.2023  |  2.1406

Latest posts by seabbs.bsky.social on Bluesky

I imagine a very similar idea can be used for wastewater surviellance models

05.12.2025 11:59 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Oo this is fun!

05.12.2025 11:59 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Unfortunately due to speaker illness this seminar has had to be cancelled.

03.12.2025 12:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

In March 2024 I was reading R docs and learned 🧡

1. doc for NaN and is.nan() says: "β€˜NaN’ means β€˜Not a Number’" and "NA, β€˜_Not Available_’ which is not a number as well"

-> so neither is "a number".

03.12.2025 09:19 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 3    πŸ“Œ 0

Yes agree and/or the right people have both the tools and the data they need.

03.12.2025 10:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
David Hodgson – Epinowcast Epinowcast community site

#epinowcast seminar tomorrow at 3pm UK time/10am US East time: David Hodgson talking about Serological data and the seroanalytics suite of tools

www.epinowcast.org/seminars/202...

02.12.2025 17:46 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This is despite a lot of work and focus in this area over the last decade which I think flags that the incentives and approaches we are using as levers are flawed.

02.12.2025 15:15 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I still find it profoundly shocking we don't have the tools and methods we need to estimate many of these delays in real time at the kind of granularity needed in order to inform most measures robustly.

02.12.2025 15:15 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This is a nice piece and makes some good points. Something it mentions is that work is being done to make the tools we need but ...

02.12.2025 15:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Evaluation of the epidemiological outlook of the influenza A/H3N2 clade K in England during the 2025-26 season Key findings England is currently experiencing a high growth rate of infections caused by the influenza A/H3N2 K clade. Antigenic change from the previously dominant clade, a rapid selective sweep evi...

H3N2 preprint: there are concerns of a severe incoming influenza season due to the drifted H3N2 K clade. We at @psioxford.bsky.social analysed epi data and ran scenario models to see what we could discern about K clade transmission dynamics: zenodo.org/records/1770....

(1/18)

25.11.2025 13:51 β€” πŸ‘ 26    πŸ” 15    πŸ’¬ 1    πŸ“Œ 6
Model is:

b3 <- scasm(
  y ~ s(x0, bs = "bs", k= k) + s(x1, bs = "sc", xt = "m+", k = k) +
         s(x2, bs = "bs", k = k) + s(x3, bs = "bs", k = k),
  family=poisson, bs=200
)

The second smooth `s(x1) is a shape constrained smooth with a positive monotonicity constraint (xt = "m+").

The `bs = 200` arguments uses 200 boostrap samples, which generates bootstrap distributions for each coefficient in the model. These bootstrap samples respect the shape constraints, while the usual +/- 2 SE credible intervals may not.

The uncertainty in the partial effects is shown by two credible interval bands; a dark blue central band is a 68% Bayesian credible interval, while the lighter blue outer interval is a 95% Bayesian credible interval.

The background of each panel is light grey with white grid lines, in a similar style to ggplot2's default theme.

Model is: b3 <- scasm( y ~ s(x0, bs = "bs", k= k) + s(x1, bs = "sc", xt = "m+", k = k) + s(x2, bs = "bs", k = k) + s(x3, bs = "bs", k = k), family=poisson, bs=200 ) The second smooth `s(x1) is a shape constrained smooth with a positive monotonicity constraint (xt = "m+"). The `bs = 200` arguments uses 200 boostrap samples, which generates bootstrap distributions for each coefficient in the model. These bootstrap samples respect the shape constraints, while the usual +/- 2 SE credible intervals may not. The uncertainty in the partial effects is shown by two credible interval bands; a dark blue central band is a 68% Bayesian credible interval, while the lighter blue outer interval is a 95% Bayesian credible interval. The background of each panel is light grey with white grid lines, in a similar style to ggplot2's default theme.

A new release of the mgcv #RStats πŸ“¦ is out on CRAN and Simon Wood (U Edinburgh) has added some significant new features despite the small bump in version number:

🌟 scasm() for estimating GAMs with shape constrained smooths. Can be used with any family & smoothness selection is via the EFS method

12.11.2025 11:28 β€” πŸ‘ 95    πŸ” 24    πŸ’¬ 3    πŸ“Œ 5

For some reason it is much more of a pleasure when it is some software or small idea than a whole paper. Maybe because we have less formal ways to refer to those?

12.11.2025 14:07 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
GitHub - epinowcast/hashprng: Hash-based Matching Pseudo-Random Number Generation Hash-based Matching Pseudo-Random Number Generation - epinowcast/hashprng

which evaluates and builds on some of the work in hashpring
(which really was totally down to Carl than me).

github.com/epinowcast/h...

12.11.2025 14:07 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Methods for Reproducible Comparison of Strategies in Stochastic Modelling Stochastic simulations are often used to model real-world phenomena such as infectious disease dynamics. In this modelling, differing strategies are often compared to one another by comparing the mode...

Always a pleasure when you see a package or some scratch work being used as part of a longer form piece of work.

Nice paper here by Rob Sunnucks et al. www.medrxiv.org/content/10.1...

12.11.2025 14:07 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Ten Simple Rules for AI-Assisted Coding in Science While AI coding tools have demonstrated potential to accelerate software development, their use in scientific computing raises critical questions about code quality and scientific validity. In this pa...

Ten Simple Rules for AI-Assisted Coding in Science arxiv.org/abs/2510.22254 - our latest, led by @ericwbridgeford.bsky.social

28.10.2025 13:12 β€” πŸ‘ 75    πŸ” 32    πŸ’¬ 0    πŸ“Œ 2

Really nice to see this being used - a real blast from the past. If anyone is keen to help get this back on CRAN or do other work on it that would be amazing!

11.11.2025 10:34 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Logo for the #TidyTuesday Project. The words TidyTuesday, A weekly data project from the Data Science Learning Community (dslc.io) overlaying a black paint splash.

Logo for the #TidyTuesday Project. The words TidyTuesday, A weekly data project from the Data Science Learning Community (dslc.io) overlaying a black paint splash.

TidyTuesday is a weekly social data project. All are welcome to participate! Please remember to share the code used to generate your results!
TidyTuesday is organized by the Data Science Learning Community. Join our Slack for free online help with R and other data-related topics, or to participate in a data-related book club!

 How to Participate
Data is posted to social media every Monday morning. Follow the instructions in the new post for how to download the data.
Explore the data, watching out for interesting relationships. We would like to emphasize that you should not draw conclusions about causation in the data.
Create a visualization, a model, a shiny app, or some other piece of data-science-related output, using R or another programming language.
Share your output and the code used to generate it on social media with the #TidyTuesday hashtag.

TidyTuesday is a weekly social data project. All are welcome to participate! Please remember to share the code used to generate your results! TidyTuesday is organized by the Data Science Learning Community. Join our Slack for free online help with R and other data-related topics, or to participate in a data-related book club! How to Participate Data is posted to social media every Monday morning. Follow the instructions in the new post for how to download the data. Explore the data, watching out for interesting relationships. We would like to emphasize that you should not draw conclusions about causation in the data. Create a visualization, a model, a shiny app, or some other piece of data-science-related output, using R or another programming language. Share your output and the code used to generate it on social media with the #TidyTuesday hashtag.

World map showing estimated TB incidence rates per 100,000 population. Countries range from light green (low incidence) to dark purple (high incidence), with highest rates in Sub-Saharan Africa and South/Southeast Asia. Map sourced from WHO and rendered using getTBinR::map_tb_burden() in R.

World map showing estimated TB incidence rates per 100,000 population. Countries range from light green (low incidence) to dark purple (high incidence), with highest rates in Sub-Saharan Africa and South/Southeast Asia. Map sourced from WHO and rendered using getTBinR::map_tb_burden() in R.

@dslc.io welcomes you to week 45 of #TidyTuesday! We're exploring WHO TB Burden Data: Incidence, Mortality, and Population!

πŸ“ https://tidytues.day/2025/2025-11-11
πŸ“° https://samabbott.co.uk/getTBinR/index.html

#RStats #PyData #JuliaLang #DataViz #tidyverse #r4ds

10.11.2025 13:48 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 0    πŸ“Œ 2

Liked this: "In conclusion, thinking about modelling as experimentation does not cage creativity; it structures it, giving us better tools to ask sharper questions, present clearer answers and build cumulative science."

10.11.2025 16:23 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

I really enjoyed this - luckily found it just before having a call with John about workflows for IDM vs just after.

10.11.2025 16:22 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

You're absolutely right! 😜

06.11.2025 16:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

(I mean this all in a context of talking about code not day to day!)

06.11.2025 14:19 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

For example, I really like communicating briefly and do so with LLMs as it works for them. However, with people its important to add qualifies etc.

Similarly, in a review the LLM can be told to change something small on bulk without a worry but for a person this might take a really long time.

06.11.2025 14:11 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I spend a lot of my day interacting with LLMs and reviewing LLM generated content (mostly all via Claude Code). I'd really like it to communicate in a way that is more clearly not human as I find sometimes that when I talk to actual people I might not have switched communication style.

06.11.2025 14:11 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

5 minutes!

05.11.2025 14:55 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

This is today at 3pm UK time

05.11.2025 12:11 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Lina Cristancho Fajardo - Accounting for heterogeneity in mosquito exposure is necessary to forecast chikungunya outbreaks in Reunion Island – Epinowcast Epinowcast community site

For this week’s #epinowcast seminar (as ever the first Wednesday of the month at 3pm UK time) we have Lina Cristancho Fajardo talking about:

Why heterogeneity in mosquito exposure is necessary to forecast chikungunya outbreaks in Reunion Island.

www.epinowcast.org/seminars/202...

04.11.2025 10:51 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 1

Now I'm also looking for a research software engineer to implement a pile of research results to R packages loo, posterior, bayesplot, projpred, priorsense, brms or/and Python packages ArviZ, Bambi and Kulprit. Apply by email with no specific deadline (see contact info at users.aalto.fi/~ave/)

03.11.2025 11:13 β€” πŸ‘ 54    πŸ” 51    πŸ’¬ 2    πŸ“Œ 2
Preview
Untargeted longitudinal ultra deep metagenomic sequencing of wastewater provides a comprehensive readout of expected and unexpected viral pathogens Wastewater surveillance has become a powerful tool to monitor circulating viruses at a community level. Currently, most wastewater surveillance efforts use target-based approaches such as quantitative...

Can you take a quarter cup of composite sewage, simply ask β€˜what’s in there?’, and find out all of the pathogens circulating in that community?

That is the question we asked in our latest pre-print.

Turns out you can.
1/
www.medrxiv.org/content/10.1...

31.10.2025 12:19 β€” πŸ‘ 118    πŸ” 36    πŸ’¬ 6    πŸ“Œ 5

I just had a look for tools etc to do some data analysis on this for an individual or research more generally but in the quick skim I had time for didn't find anything.

Does anyone know of good resources?

31.10.2025 10:19 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Had another similar conversation about software papers being a bit pointless and just let people cite the software.

31.10.2025 10:19 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@seabbs is following 20 prominent accounts