Sebastian Paez's Avatar

Sebastian Paez

@jspaezp.bsky.social

Data scientist @ Talus Bio. Kind of into data, proteomics, open source and biology.

122 Followers  |  159 Following  |  25 Posts  |  Joined: 15.10.2023
Posts Following

Posts by Sebastian Paez (@jspaezp.bsky.social)

I feel like that question is a trap ...

26.11.2025 19:38 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
a man in a green shirt says the horror while sitting on a couch ALT: a man in a green shirt says the horror while sitting on a couch
26.11.2025 19:05 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
LinkedIn This link will take you to a page that’s not on LinkedIn

🚨 Has PRIDE helped your research?

Take 15 mins to tell funders why open data matters!
πŸ“Š Fill out the EMBL-EBI 2025 survey πŸ‘‰
www.surveymonkey.com/r/QGFMBH8?ch...

Your feedback helps keep PRIDE open, FAIR & impactful.
πŸ™ Please share!
#FAIR #OpenData #Proteomics #MassSpectrometry #PRIDE

18.06.2025 06:02 β€” πŸ‘ 10    πŸ” 12    πŸ’¬ 0    πŸ“Œ 0

How do you read from the wiff directly? (I am not super familiar with the state of SDKs for wiff/wiff2, thanks beforehand)

13.05.2025 23:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I think @ypriverol.bsky.social might have some hard data on this one

06.05.2025 18:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In the past I have written stuff to delete things more than X years old that are not raw file and that seems to already give a pretty good compromise. pdresults, pep.xmls, maxquant .peaks and that kind of stuff are massive files. (If I recall my past life that was taking up ~75% of the space)

05.03.2025 23:39 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Well that is false ... I calculated it for 120 TB, not 380 ... so it would actually be ~ 475 USD/month

05.03.2025 06:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Google Cloud Pricing Calculator Create your own Custom Price Quote for the products offered through Google Cloud based on number, usage, and power of servers

Couldnt help myself from crunching the numbers of how much this would be in the cloud ... turns out its ~ 155 bucks/month cloud.google.com/products/cal...

05.03.2025 06:33 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

On our way to #USHUPO25! Poor @jspaezp.bsky.social spending a crosscountry flight sitting between not-on-bluesky-Andrea and me πŸ˜‚ πŸ™

21.02.2025 17:15 β€” πŸ‘ 26    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

It also depends on the tool/acquisition method. Some can be understood as missing at random and some cannot (or at least different ratios of the two)

30.01.2025 10:01 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

We’re building a new static type checker for Python, from scratch, in Rust.

From a technical perspective, it’s probably our most ambitious project yet. We’re about 800 PRs deep!

29.01.2025 17:18 β€” πŸ‘ 725    πŸ” 104    πŸ’¬ 35    πŸ“Œ 34
A screenshot of Atuin Desktop

A screenshot of Atuin Desktop

we've been working hard on addressing early beta feedback for Atuin Desktop

in the past week

- dark mode (y'all kept asking)
- improved collaboration UX
- custom terminal fonts
- fixed several sync & performance issues

27.01.2025 14:17 β€” πŸ‘ 41    πŸ” 7    πŸ’¬ 6    πŸ“Œ 1

bsky is descending into degeneracy D:
bsky.app/profile/ucdp...

22.01.2025 22:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Anyone recommend a cheapish reader for scanning 2d barcodes? Best if it could scan an entire plate of Matrix tubes. Also looking at used/refurbished options specific for these or similar plates, if there’s anything that is highly recommended.

18.01.2025 15:27 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Post image

@michaellazear.bsky.social & @jspaezp.bsky.social I'm Running Sage on a dual Epyc 128 thread box w/ 2TB memory (I think) . Searching 19 ddaPASF files directly with built in timsRUST!! Dang this is screaming fast !!

22.01.2025 02:43 β€” πŸ‘ 8    πŸ” 4    πŸ’¬ 3    πŸ“Œ 2

@swillems.bsky.social do you have any insights on this one ? From the data I've seen the pro data is a bit larger than the ultra series ... Maybe you used compression on the pro and not the ultra ?

20.01.2025 16:30 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Just fyi ... Lfq is something that is not supported on the releases yet but we are thinking on how to have a good implementation for it. (We have an experimental implementation .. DM me if you want to try it out)

16.01.2025 16:16 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Why did the project name have a space ? What kind of savagery is that? :P

31.12.2024 17:15 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
# Remove bsky non-mutual follows

from fastcore.utils import *
from fastcore.xtras import *
from atproto import Client
from fastprogress.fastprogress import master_bar, progress_bar

cli = Client()
pv = cli.login('youruser', 'yourpass')
did = pv.did

pv.posts_count,pv.followers_count,pv.follows_count

def at_paged(did, meth):
    "Return all pages of results from some method"
    resp = None
    while True:
        resp = meth(did, cursor=resp.cursor if resp else None)
        yield resp
        if not resp.cursor: return

posts = L(at_paged(did, cli.get_author_feed)).attrgot('feed').concat()

posts[-2].post.record

fws = L(at_paged(did, cli.get_follows))

frs = L(at_paged(did, cli.get_followers))

following = fws.attrgot('follows').concat()
followers = frs.attrgot('followers').concat()

len(following),len(followers)

to_unfollow = set(following.attrgot('did')) - set(followers.attrgot('did'))
len(to_unfollow)

followd = {o.did:o.viewer.following for o in following}

for fdid in progress_bar(to_unfollow):
    try: cli.unfollow(followd[fdid])
    except Exception as e: print(f"Could not unfollow {fdid}: {e}")

# Remove bsky non-mutual follows from fastcore.utils import * from fastcore.xtras import * from atproto import Client from fastprogress.fastprogress import master_bar, progress_bar cli = Client() pv = cli.login('youruser', 'yourpass') did = pv.did pv.posts_count,pv.followers_count,pv.follows_count def at_paged(did, meth): "Return all pages of results from some method" resp = None while True: resp = meth(did, cursor=resp.cursor if resp else None) yield resp if not resp.cursor: return posts = L(at_paged(did, cli.get_author_feed)).attrgot('feed').concat() posts[-2].post.record fws = L(at_paged(did, cli.get_follows)) frs = L(at_paged(did, cli.get_followers)) following = fws.attrgot('follows').concat() followers = frs.attrgot('followers').concat() len(following),len(followers) to_unfollow = set(following.attrgot('did')) - set(followers.attrgot('did')) len(to_unfollow) followd = {o.did:o.viewer.following for o in following} for fdid in progress_bar(to_unfollow): try: cli.unfollow(followd[fdid]) except Exception as e: print(f"Could not unfollow {fdid}: {e}")

I used a few starter packs to help connect with my communities, but after a couple of weeks I noticed nearly all the posts I'm interested in are from folks that follow me back.

So I created an nb to unfollow non-mutual follows. Code in alt text, or here:
colab.research.google.com/drive/1V7QjZ...

02.12.2024 21:24 β€” πŸ‘ 75    πŸ” 6    πŸ’¬ 5    πŸ“Œ 1
Advent of Code 2024

Gentle reminder that 2024 advent of code is up (yes I am a nerd and you can be one as well <3)!
Daily programming puzzles at Advent of Code #AdventOfCode adventofcode.com

02.12.2024 03:02 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

This is a movie of actin filaments in a cultured cell. I like watching it and I hope you do to.

18.11.2024 05:50 β€” πŸ‘ 249    πŸ” 33    πŸ’¬ 8    πŸ“Œ 4
Post image

Congrats to Professor Graham Cooks who just surpassed 100,000 citations and hit the 150 h-index! More photos and story: www.chem.purdue.edu/media/news/2...

25.11.2024 19:49 β€” πŸ‘ 21    πŸ” 7    πŸ’¬ 1    πŸ“Œ 2

I think it is very interesting but I was wondering why your approach does not deal explicitly with missing values (more accurately, missing values are excluded from the CV calculation). Is the assumption that all missing values are missing at random here?

25.11.2024 19:10 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

The Llama 3.2 1B and 3B models are my favorite LLMs -- small but very capable.
If you want to understand how the architectures look like under the hood, I implemented them from scratch (one of the best ways to learn): github.com/rasbt/LLMs-f...

20.11.2024 08:33 β€” πŸ‘ 141    πŸ” 16    πŸ’¬ 7    πŸ“Œ 1

Some of us like writting the software a lot more than any of those :P

15.11.2024 22:14 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@pride-ebi.bsky.social is now here. Now you can delete your X account if you are in proteomics.

13.11.2024 14:10 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

#teamMassSpec here is a starting pack in case you just moved over, or in case you've been absent for a while. This is #proteomics or #massspec related. The pack is not comprehensive but it's a start.

go.bsky.app/HH7kqEh

10.11.2024 15:46 β€” πŸ‘ 89    πŸ” 51    πŸ’¬ 19    πŸ“Œ 1

That is what the personal GCP/AWS account is for ... For the massive proteomics raw data fun :)

13.02.2024 22:37 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

You are right! I had forgotten about that, I have only used lisp to set something up in emacs five or six operating systems ago :P

15.12.2023 01:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

It is definitely odd in it's design but I think it has a lot of really interesting elements. I don't know any other language with AST as first class citizens. He is ugly but I would not call it terrible.

15.12.2023 01:11 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0