Matthew Yates's Avatar

Matthew Yates

@allomedna.bsky.social

Molecular Ecologist and Conservation Geneticist. I work a lot with fish, eDNA, and eRNA - with a soft spot for salmonids (very original, I know...)

164 Followers  |  163 Following  |  39 Posts  |  Joined: 11.12.2024
Posts Following

Posts by Matthew Yates (@allomedna.bsky.social)

Ludicrous. Absolutely ludicrous, especially coming from such a respected institution.

27.04.2025 06:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I’m honestly flabbergasted that this even occurred. The idea of an LLM acting as a SA survivor or a trained crisis councillor and then giving real people advice without them knowing the origin of that advice? Worse, being actively deceived about the origin of that advice?

27.04.2025 06:05 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

BUT EVEN IF that weren’t true, this is a gross violation of informed consent, which is a component of research ethics that is *extremely important*.

27.04.2025 06:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Before anyone tries to justify this on utilitarian grounds, the research is of dubious quality because Reddit is so riddled with bots that a substantial amount of their LLM interactions could have been bots talking to bots. So the data are fundamentally flawed from the get-go.

27.04.2025 06:05 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

How the hell the University ethics review process let this go is beyond me, but apparently the university is defending it?

27.04.2025 06:05 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Nobody in the online community had any idea this was going on. The community also has mechanisms to score whether users were β€˜convinced’ by the arguments of a poster (in this case AI). The LLM had more than 100 such comments that received this score. These are (theoretically, at least) real people.

27.04.2025 06:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
META: Unauthorized Experiment on CMV Involving AI-generated Comments

See:

www.reddit.com/r/changemyvi...

The researchers had the LLMs pose as users with expertise or experience in various subject areas (including SA) who then provided advice or arguments to users posting in the subreddit.

27.04.2025 06:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

What in the absolute f*ck

Apparently researchers at the University of Zurich conducted an experiment in the /r/changemyview subreddit where they assessed the ability of large language models to change people’s views.

This was allegedly done without any informed consent whatsoever.

27.04.2025 06:05 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
eDNAjoint: An R package for interpreting paired or semi‐paired environmental DNA and traditional survey data in a Bayesian framework Environmental DNA (eDNA) sampling is increasingly used in surveys of species distribution as a potentially sensitive and efficient monitoring method. Yet access to modelling tools designed specifi...

Just stumbled across this, another solid paper from the Kelly lab! Great to see work like this on making eDNA interpretation more accessible to end-users

besjournals.onlinelibrary.wiley.com/doi/10.1111/...

26.03.2025 18:45 β€” πŸ‘ 2    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0
Preview
Environmental DNA Haplotyping Reveals Dispersal Patterns of Invasive Bluegill Sunfish, Lepomis macrochirus, in Japan We developed an environmental DNA (eDNA)-based haplotyping approach to reveal the haplotype distributions of invasive bluegill sunfish in Japan. Our analysis detected a negative relationship between ...

Well this is just a super cool paper, using eDNA to track dispersal dynamics of invasive fish species based on haplotype number:

doi.org/10.1002/edn3...

13.03.2025 17:04 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Edit: Post 6 should say 'note that you take the mean AFTER scaling with 'b'

12.03.2025 15:14 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

And there you have it! Pretty happy with how this turned out, was a long collaboration with lots of different folks - huge thanks to Dr. Taylor Wilcox, Dr. Shannon Kay, Dr. Pedro Peres-Neto (@comecology.bsky.social), and Dr. Daniel Heath, who all were instrumental in developing this paper!

12.03.2025 15:11 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

20) But slope values for eDNA/biomass relationships will be higher for protists compared to bacteria due to their larger cell sizes.

12.03.2025 15:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

19) Amongst single-cellular organisms, β€˜b’ will equal 0 since we expect eDNA to scale with numerical abundance (each cell contains ~approximately~ the same number of gene copies).

12.03.2025 15:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

18) Compared to macro-organisms, eDNA/biomass slopes will be very low since capturing organisms whole produces lots of eDNA.

12.03.2025 15:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

17) Amongst multicellular microorganisms (e.g., plankton, microinvertebrates) β€˜b’ will equal 1, as we expect eDNA to scale with biomass due to most eDNA being derived from capturing organisms whole-body.

12.03.2025 15:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

16) Plants and turtles? Hard external surfaces and low metabolism -> high eDNA/biomass slopes compared to metabolically active groups like fish.

12.03.2025 15:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

15) Amongst macro-organisms β€˜b’ will likely be close to ~0.75, and eDNA/biomass regression slopes are likely to be predominantly determined by metabolics and surface area characteristics.

12.03.2025 15:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image 12.03.2025 15:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

14) More broadly, the framework also lays out a foundation for modelling these relationships across taxonomic groups; we can thus make predictions about what the relative values of the beta coefficients in the relationship and the β€˜b’ scaling parameters will look like across taxonomic groups.

12.03.2025 15:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

13) If only one of those relationships produces a good regression, then that means you probably need to directly try to estimate the value of β€˜b’ in your system!

12.03.2025 15:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

12) If you correlate eDNA with biomass, you are assuming a β€˜b’ of 1 -> that inherently means that eDNA divided by mean population mass should correlate with organism abundance. If it doesn’t, you have a problem!

12.03.2025 15:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

11) If you correlate eDNA with N, you are assuming a β€˜b’ of 0 -> that inherently means that eDNA multiplied by mean population mass should correlate well with biomass. If it doesn’t, you have a problem!

12.03.2025 15:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

10) This is what most studies have done – previous researchers typically presented the β€˜best-fit’ relationship for either eDNA/biomass or eDNA/abundance. We show that either assumption REQUIRES that adjusted eDNA should reflect the other variable – you should always present BOTH relationships

12.03.2025 15:07 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

9) Key take-home: Assuming eDNA correlates with numerical abundance assumes an allometric scaling coefficient of 0, and assuming eDNA correlates with biomass assumes an allometric scaling coefficient of 1.

12.03.2025 15:07 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

8) This can be achieved by jointly estimating both the allometric scaling coefficient (β€˜b’) AND jointly estimating the beta coefficient of the regression, while driving the regression through the origin.

12.03.2025 15:07 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

7) What’s REALLY cool is that the algebra shows that the two regressions (eDNA/N + eDNA/biomass) should share the same value of the beta coefficient (slope). Furthermore, predicted N and biomass must also satisfy the relationship that N multiplied by the mean mass of a population = biomass

12.03.2025 15:07 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

6) We algebraically demonstrate that eDNA data should scale with numerical abundance by dividing quantitative eDNA data by the mean of the mass values in a population raised to the power of an allometric scaling coefficient, β€˜b’ (not that you take the mean AFTER scaling to β€˜b’).

12.03.2025 15:07 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

5) We provide a series of equations you can β€˜adjust’ eDNA data with to simultaneously reflect both numerical abundance and biomass. This is based on theory from metabolic ecology, which has developed allometric scaling frameworks to model metabolic relationships, which we extend to eDNA.

12.03.2025 15:07 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

4) To predict numerical abundance and biomass from eDNA data, you have to have eDNA modelled on the x-axis in both relationships. This provides a bit of a conundrum – you’re trying to predict two variables from one!

12.03.2025 15:07 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0