F. Javier Rubio's Avatar

F. Javier Rubio

@fjrubio.bsky.social

Lecturer at the Department of Statistical Science of UCL. All opinions my own. πŸ‡²πŸ‡½πŸ‡¬πŸ‡§ https://sites.google.com/site/fjavierrubio67/ #rstats #JuliaLang #Bayesian #Statistics #Biostatistics

871 Followers  |  365 Following  |  47 Posts  |  Joined: 08.01.2024  |  2.0967

Latest posts by fjrubio.bsky.social on Bluesky

Preview
Approaches for modelling survival time in groups with very low risk I'm currently working on a study with a group of hematologists. Patients with PV (Polycythemia vera) have very low risk of future thromboembolism after diagnosis due to disease management. Patients

I've got 1 event in one group in which we know the risk of the outcome is very low.

This is creating enormous confidence intervals. Can I use firth penalization? Should I get more data? All ideas welcome

stats.stackexchange.com/questions/66...

07.08.2025 02:06 β€” πŸ‘ 12    πŸ” 5    πŸ’¬ 8    πŸ“Œ 0
4-panel comic. (1) [Person 1 with ponytail flanked by person with short hair and another person speaking into microphone at podium] PERSON 1: In the early 2010s, researchers found that many major scientific results couldn’t be reproduced. (2) PERSON 1: Over a decade into the replication crisis, we wanted to see if today’s studies have become more robust. (3) PERSON 1: Unfortunately, our replication analysis has found exactly the same problems that those 2010s researchers did. (4) [newspaper with image of speakers from previous panels] Headline: Replication Crisis Solved

4-panel comic. (1) [Person 1 with ponytail flanked by person with short hair and another person speaking into microphone at podium] PERSON 1: In the early 2010s, researchers found that many major scientific results couldn’t be reproduced. (2) PERSON 1: Over a decade into the replication crisis, we wanted to see if today’s studies have become more robust. (3) PERSON 1: Unfortunately, our replication analysis has found exactly the same problems that those 2010s researchers did. (4) [newspaper with image of speakers from previous panels] Headline: Replication Crisis Solved

Replication Crisis

xkcd.com/3117/

21.07.2025 23:54 β€” πŸ‘ 4862    πŸ” 651    πŸ’¬ 29    πŸ“Œ 28
Preview
Conformalized Regression for Continuous Bounded Outcomes Regression problems with bounded continuous outcomes frequently arise in real-world statistical and machine learning applications, such as the analysis of rates and proportions. A central challenge in...

New preprint with Fabrizio Leisen and our PhD student Zhanli Wu:

"Conformalized Regression for Bounded Outcomes"

arxiv.org/abs/2507.14023

#rstats #conformal #prediction

R code and data are also available at:

github.com/ZWU-001/CPBo...

21.07.2025 08:19 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Research Studentships

🚨3 Departmental PhD Studentships have now become available. The deadline for applications will be 31 of July. Available for overseas and home students.

www.ucl.ac.uk/statistics/p...

11.07.2025 09:13 β€” πŸ‘ 1    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

The latest issue of the ISBA bulletin, containing a call for contributed discussions for two papers:

06.07.2025 12:20 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
5-panel comic. (1) [teacher with long hair next to whiteboard] TEACHER: I’m supposed to give you the tools to do good science. (2) [teacher addressing students] But what *are* those tools? Methodology is hard and there are so many ways to get incorrect results. What is the magic ingredient that makes for good science? (3) TEACHER: To figure it out, I ran a regression with all the factors people say are important: [embedded list in sub-panel, cut off at end] Outcome variable: correct scientific results. Predictors: collaboration; skepticism of others’ claims; questioning your own beliefs; trying to falsify hypotheses; checking citations; statistical rigor; blinded analysis; financial disclosure; open data (4) TEACHER: The regression says two ingredients are the most crucial: 1) genuine curiosity about the answer to a question, and 2) ammonium hydroxide. (5) STUDENT: Wait, why did *ammonia* score so high? How did it even get on the list? LONG HAIR: ...And now you’re doing good science!

5-panel comic. (1) [teacher with long hair next to whiteboard] TEACHER: I’m supposed to give you the tools to do good science. (2) [teacher addressing students] But what *are* those tools? Methodology is hard and there are so many ways to get incorrect results. What is the magic ingredient that makes for good science? (3) TEACHER: To figure it out, I ran a regression with all the factors people say are important: [embedded list in sub-panel, cut off at end] Outcome variable: correct scientific results. Predictors: collaboration; skepticism of others’ claims; questioning your own beliefs; trying to falsify hypotheses; checking citations; statistical rigor; blinded analysis; financial disclosure; open data (4) TEACHER: The regression says two ingredients are the most crucial: 1) genuine curiosity about the answer to a question, and 2) ammonium hydroxide. (5) STUDENT: Wait, why did *ammonia* score so high? How did it even get on the list? LONG HAIR: ...And now you’re doing good science!

Good Science

xkcd.com/3101/

12.06.2025 20:28 β€” πŸ‘ 3522    πŸ” 629    πŸ’¬ 24    πŸ“Œ 34
Call for discussion papers 2025: Innovative usages of natural experiments and causal inference in st

Call for discussion papers 2025: Innovative usages of natural experiments and causal inference in statistics and data scienceπŸ‘‡

rss.org.uk/news-publica...

03.06.2025 11:19 β€” πŸ‘ 2    πŸ” 2    πŸ’¬ 0    πŸ“Œ 1
Preview
a novel discrepancy measure My friend EJ Wagenmakers, along with his colleagueΒ  Raoul Grasman, have proposed a novel measure of discrepancy between two distributions that they formulate through a basic Bayesian lens as an exp…

A novel discrepancy measure:

xianblog.wordpress.com/2025/06/01/a...

02.06.2025 14:54 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - FJRubio67/HazReg: Parametric hazard-based regression models (R package) Parametric hazard-based regression models (R package) - FJRubio67/HazReg

Not a proper ML guy, but I have used this trick a few times:

1. In the log-likelihood of relative survival models.

github.com/FJRubio67/Ha...

2. Some β€œintractable likelihood” models:

rpubs.com/FJRubio/AMLE...

24.05.2025 17:07 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Generalized Additive Models | Annual Reviews Generalized additive models are generalized linear models in which the linear predictor includes a sum of smooth functions of covariates, where the shape of the functions is to be estimated. They have...

A nice overview of GAMs:

"Generalized Additive Models" by Simon N. Wood

doi.org/10.1146/annu...

15.05.2025 08:31 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Interesting to see cancer epidemiology papers in the top 10:

15.04.2025 11:53 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

3 year PDRA position in the OCEAN project (https://oceanerc.com/) to work at Warwick with Gareth Roberts, Adam Johansen and other OCEAN Researchers on a range of topics around scalable distributed computation. Deadline 10th April 2025 at 11.55pm UK time. Details at
warwick-careers.tal....

25.03.2025 12:00 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Or, to clarify notation:

12.03.2025 13:44 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Archive Request xkcd.com/3052

17.02.2025 23:52 β€” πŸ‘ 15268    πŸ” 1584    πŸ’¬ 122    πŸ“Œ 82
Post image

πŸŽ‰ Happy 199th birthday, UCL! πŸŽ‰

We’re just one year away from celebrating 200 years of innovation, discovery, and impact.

UCL has always been a place of firsts. Now, we look ahead to an exciting bicentenary year!

Learn more: www.ucl.ac.uk/news/2025/fe...

#UCL200 #UCLFoundationDay #UCLBirthday

11.02.2025 11:57 β€” πŸ‘ 59    πŸ” 27    πŸ’¬ 0    πŸ“Œ 1

Hi #stats and #datascience community, we are compiling a list of resources for teaching multivariable thinking. Please feel free to share any resources that you might have developed and/or use in your own teaching. I'd appreciate it if you can help me spread the word.

10.02.2025 18:05 β€” πŸ‘ 18    πŸ” 9    πŸ’¬ 2    πŸ“Œ 2
CRiSM Event 2025

I'm really looking forward to the CRiSM 2.0 Conference warwick.ac.uk/fac/sci/stat... from 21st-23rd May. We've been lucky enough to get a really nice list of speakers; registrations open now if anyone is in the market for some interesting talks in May.

06.02.2025 20:52 β€” πŸ‘ 7    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0
Preview
Principal Investigator positions at ELLIS Institute Finland | ELLIS Institute Finland Now recruiting new PIs in artificial intelligence and machine learning

I have big news: @ellis.eu has launched its 2nd major research center, @ellisfinland.bsky.social! I have agreed to start as founding director & the first call for PI positions is open. This is a major opportunity for outstanding researchers, join us! ellisinstitute.fi/PI-recruit

04.02.2025 14:40 β€” πŸ‘ 58    πŸ” 26    πŸ’¬ 3    πŸ“Œ 3
Problems caused by grade inflation | Statistical Modeling, Causal Inference, and Social Science

Problems caused by grade inflation
statmodeling.stat.columbia.edu/2025/01/19/p...

19.01.2025 15:46 β€” πŸ‘ 14    πŸ” 6    πŸ’¬ 1    πŸ“Œ 0
Post image

Interesting short paper about the debates on Bayesian inference in the 1950s

Bayesian issues in the 1950s: an episode involving Karl Popper and Jimmie Savage

- Stephen M Stigler

doi.org/10.1093/jrss...

17.01.2025 09:22 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Looks to be a really nice read; clear that the man enjoys writing!

arxiv.org/abs/2501.00925
'Fisher Information in Kinetic Theory'
- CΓ©dric Villani

03.01.2025 09:55 β€” πŸ‘ 54    πŸ” 9    πŸ’¬ 2    πŸ“Œ 2
Diagram with large number: 2.7.123
First β€œ2” is commented: Proud version. Bump when you are proud of the release
Second β€œ7” is commented: Default version. Just normal/okay releases
Third β€œ123” is commented: Shame version. Bump when fixing things too embarrassing to admit

Diagram with large number: 2.7.123 First β€œ2” is commented: Proud version. Bump when you are proud of the release Second β€œ7” is commented: Default version. Just normal/okay releases Third β€œ123” is commented: Shame version. Bump when fixing things too embarrassing to admit

I propose we replace semantic versioning with pride versioning

21.12.2024 19:07 β€” πŸ‘ 2534    πŸ” 737    πŸ’¬ 34    πŸ“Œ 52
Preview
Winter 2025 - Researcher positions in AI and machine learning β€” FCAI

Postdoc and doctoral student positions in developing Bayesian methods! The positions are funded by Finnish Center for Artificial Intelligence FCAI and there are many other topics, too, but if you specify me as the preferred supervisor then it's going to be Bayesian. fcai.fi/winter-2025-...

19.12.2024 15:34 β€” πŸ‘ 27    πŸ” 13    πŸ’¬ 1    πŸ“Œ 3
Video thumbnail

One #postdoc position is still available at the National University of Singapore (NUS) to work on sampling, high-dimensional data-assimilation, and diffusion/flow models. Applications are open until the end of January. Details:

alexxthiery.github.io/jobs/2024_di...

15.12.2024 14:46 β€” πŸ‘ 41    πŸ” 18    πŸ’¬ 0    πŸ“Œ 0

One of the risks in testing before submitting is that you may end up training chatGPT. So, by the time the assessment is released, the AI may have already learned those questions. It is not clear or certain, though, as we do not know how those AIs are trained.

11.12.2024 12:44 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

That is quite good. I know several colleagues who have tested their assessments in ChatGPT and it gets very close to full marks.

11.12.2024 11:03 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Also as Last Author, so it also solves the problem in medical and epi journals!

11.12.2024 08:45 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

rspatialdata: a collection of data sources and tutorials on downloading and visualizing spatial data with #rstats πŸ’»πŸ—ΊοΈπŸ“Š

πŸ‘‰ rspatialdata.github.io

08.12.2024 18:08 β€” πŸ‘ 100    πŸ” 37    πŸ’¬ 4    πŸ“Œ 2
Preview
Ig Nobel Face-to-Face (video part 2 of 3) TheΒ Ig Nobel PrizesΒ honor things so surprising that they make people LAUGH, then THINK. At theΒ 2024 Ig Nobel Prize ceremony, ten new prizes were awarded. Two days later, most of&nbsp…

Video of the Ig Nobel Face-to-Face meeting, which allowed time for discussion. I'm talking about our 350,757 coin flips. Also features the UvA work on drunken worms.
improbable.com/2024/12/03/i...

06.12.2024 08:01 β€” πŸ‘ 23    πŸ” 5    πŸ’¬ 1    πŸ“Œ 1

I can't see why not. Priors and Posteriors quantify uncertainty, it does not mean the true generating model is changing every time.

08.12.2024 13:05 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 5    πŸ“Œ 0

@fjrubio is following 20 prominent accounts