Iain Mathieson's Avatar

Iain Mathieson

@mathiesoniain.bsky.social

437 Followers  |  241 Following  |  48 Posts  |  Joined: 21.12.2024  |  2.275

Latest posts by mathiesoniain.bsky.social on Bluesky

Converse is people using the term "experiment" to describe some kind of data analysis.

23.10.2025 12:47 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

And Mary-Claire

18.10.2025 21:33 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Well I suppose specifically for a developmental trait like this I might think that the environmental assumptions of the twin study design would be violated - i.e. twins share more of the relevant (in utero) environment than sibs, leading to an overestimate of h2.

18.10.2025 13:57 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I don't think this really makes sense though; it's like how anthropologists sometimes talk about things like "potential height" which is the height you would reach if you had perfect nutrition. But it's not a phenotype.

17.10.2025 22:38 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Indeed, rates of right-handedness were increased by [an environment] in which people were forced to write right-handed.

17.10.2025 13:27 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I mean if someone is forced to the point where they use their right hand all the time, are they not right-handed?

17.10.2025 13:25 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Yeah, those are always inflated

17.10.2025 13:24 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Also I don't believe the heritability is 25%. UKB SNP heritability is less than 2%.

17.10.2025 01:22 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

It's still environment, even if it's not practically measurable! Also, handedness can modified [by changing the environment].

17.10.2025 01:19 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0

Not sure the individual variation is the issue; it looks like they see it in most samples but I wonder whether there could be an off-target effect in the PCR step. Would want to check whether the putative mutant reads are enriched for mismatches.

12.09.2025 14:52 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The only bit of the ARG you own is your terminal branch.

02.08.2025 03:00 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

We'll have to fix that!

31.07.2025 12:59 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I don't know if it's better. They can certainly phase better, and this suggests they have access to UKB data: www.ukbiobank.ac.uk/projects/hig.... Plus they have in-house non-EUR panels (e.g. www.nature.com/articles/s42...)

30.07.2025 02:36 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Yes seems so, at least a couple of months ago I was able to download imputed data.

29.07.2025 21:53 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

True, I mean I guess that [people imputing their own genomes] is exactly what they don't want people doing. Also 23andMe gives you imputed data, possibly better than TopMed imputation anyway?

29.07.2025 12:56 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Maybe because they don't want people trying to split up their samples over many small jobs. If you just have one, I guess just add in 19 samples from 1000 Genomes?

29.07.2025 01:21 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

He chose.... poorly

24.07.2025 15:41 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I find Figure 7 more informative about the route than Figure 1

24.07.2025 14:10 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
We are a computational lab focusing on developing machine learning algorithms for biological data. Currently we are working on inference tasks related to natural selection and demographic inference, as well as generative models for genomic data from humans, mosquitos, and other species.

The Mathieson Lab at the University of Pennsylvania is hiring a computational postdoc in machine learning and evolutionary biology. Apply with a CV and references: smathi@sas.upenn.edu. More info: https://saramathieson.github.io/lab #postdoc

24.07.2025 12:23 โ€” ๐Ÿ‘ 19    ๐Ÿ” 15    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I think it makes very little difference - the high LD regions are small (and don't obviously have different patterns of cross-individual differences), so won't have much effect on the genome-wide differences.

23.07.2025 16:33 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

In practice the main use of LD pruning seems to be removing high ("long-range") LD regions that otherwise show up as individual PCs (e.g. see Fig1 etc here: academic.oup.com/bioinformati...). In some sense those are still "real" structure in the ARG, but it's kind of undesirable for some analyses.

22.07.2025 17:50 โ€” ๐Ÿ‘ 5    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I think you could probably bias it one way or another depending on how you do the LD pruning (LD in which population?). I think of it as like you're changing the relative weighting of different parts of the ARG.

22.07.2025 17:50 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

So |EUR-YRI| can be < |YRI-X| for some other African population X, due to structure within Africa. |EUR-YRI| can be < |YRI-YRI| for some PCs but if you sum the distances across enough PCs, weighting them appropriately, it will always be larger.

21.07.2025 03:37 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I see, but to the original point that's not how PCA works. I don't think |EUR-YRI| is ever going to be less than |YRI-YRI| in PC space โ€“ย again YRI-YRI can coalesce before the split. Of course if you only look at the first 2 (or first N) PCs, it can be, but that just depends on sampling.

21.07.2025 03:37 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

They are not measuring pairwise differences. If I understand their figure 4A, if one individual is 0/0 and another is 1/0 then they count that as S=1. But If the second individual is 1/1 then they also count that as S=1 (not S=2). So they undercount differences in individuals with low heterozygosity

21.07.2025 01:59 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Another way to think about it: the variants that fix in A during the bottleneck are equally likely to fix in the matching or non-matching state as a randomly chosen individual from B, so the expected number of differences does not change.

20.07.2025 20:37 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

On the other hand, Fst does behave the way you describe, because Fst between A and B depends on E[T_aa] โ€“ย which does depend on what happens on the A lineage โ€“ย as well as E[T_ab]. So Fst between A and B is > Fst between B and C.

20.07.2025 20:33 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

I don't think that's right. The number of pairwise differences is proportional to the TMRCA. a and b cannot coalesce until before the split (at which time all populations are symmetric) so E[T_ab]=E[T_bc] (=T+2N_anc) and does not depend on what happens on the A lineage.

20.07.2025 20:33 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

The bottleneck doesn't change the expected coalescence time between AFR and non-AFR lineages (equivalent to the genetic distance). It does change the expected coalescence time among non-AFR lineages.

20.07.2025 13:10 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

I think it's more to do with the structure within Africa; if you think about the tree, it's clear that e.g. Non-AFR vs East AFR should have fewer differences than East AFR vs West AFR etc. I don't think the OOA bottleneck changes the expected number of non-AFR vs AFR differences.

20.07.2025 13:06 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@mathiesoniain is following 20 prominent accounts