sebastian's Avatar

sebastian

@noone.social.bsky.social

Data engineer. Interested in AI x Privacy.

85 Followers  |  376 Following  |  28 Posts  |  Joined: 09.01.2024  |  2.2045

Latest posts by noone.social on Bluesky

The so-called Department of Government Efficiency: We saved $1M per year by converting 14,000 magnetic tapes (70 year old technology for information storage) to permanent modern digital records

The so-called Department of Government Efficiency: We saved $1M per year by converting 14,000 magnetic tapes (70 year old technology for information storage) to permanent modern digital records

YOU DID WHAT?

07.04.2025 03:05 β€” πŸ‘ 5196    πŸ” 1017    πŸ’¬ 311    πŸ“Œ 629
Preview
The 2025 AI Engineering Reading List We picked 50 paper/models/blogs across 10 fields in AI Eng: LLMs, Benchmarks, Prompting, RAG, Agents, CodeGen, Vision, Voice, Diffusion, Finetuning. If you're starting from scratch, start here.

For papers have a look at this list: www.latent.space/p/2025-papers

05.02.2025 06:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Just remembered 'Pwned or Bot' by @troyhunt.com: using breach history as identity validation. Past pwns = digital footprints! Clever way to spot real humans in our bot-filled internet.

29.01.2025 15:49 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Developer Creates Infinite Maze That Traps AI Training Bots "Nepenthes generates random links that always point back to itself - the crawler downloads those new links. Nepenthes happily just returns more and more lists of links pointing back to itself."

β€œLet's say you've got horsepower and bandwidth to burn, and just want to see these AI models burn. ... It's also sort of an art work, just me unleashing shear unadulterated rage at how things are going.”

love to see it

www.404media.co/developer-cr...

23.01.2025 16:13 β€” πŸ‘ 196    πŸ” 71    πŸ’¬ 1    πŸ“Œ 10
Dan Fixes Coin-Ops (@ifixcoinops@retro.social) Watching a mutual ask for printer recs and receive a chorus of tired tech folk going "Just get a Brother, they're fine" and man MAN Like this is actually kinda fascinating honestly, Brother is now t...

As other have said Brother is a great choice.

retro.social/@ifixcoinops...
This explains why: "Brother's have remained consistently Fine I Guess, which now makes them the best printer manufacturer simply by virtue of them opting out of the Who Can Get Crappiest Fastest race"

21.01.2025 13:56 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Oh wow. That's super cool.

18.01.2025 08:41 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Five things privacy experts know about AI - Ted is writing things … and that AI salespeople don't want you to know!

This is an excellent primer on some of the privacy dangers posed by large scale AI, from a cybersecurity perspective. Written in clear language, it's the most accessible rundown I've seen yet on these topics!

desfontain.es/blog/privacy...

14.01.2025 11:31 β€” πŸ‘ 423    πŸ” 184    πŸ’¬ 31    πŸ“Œ 22

who is this for? that's what I can't wrap my head around - who wants to follow someone who's not real, and is posting about their regular day to day life except none of it is really happening? who is this *for*?

03.01.2025 11:01 β€” πŸ‘ 595    πŸ” 118    πŸ’¬ 38    πŸ“Œ 6
Post image Post image Post image Post image

Volkswagen left an unprotected database with up to two years of sensitive personal data on 800k networked VW, Seat, Audi and Skoda cars accessible online, including names, user IDs, sensor and geolocation data.

CCC talk by FlΓΌpke and @michaelkreil.bsky.social:
streaming.media.ccc.de/38c3/relive/...

28.12.2024 22:42 β€” πŸ‘ 94    πŸ” 52    πŸ’¬ 9    πŸ“Œ 9

πŸ’€

27.12.2024 18:29 β€” πŸ‘ 366    πŸ” 77    πŸ’¬ 43    πŸ“Œ 74
Preview
GitHub - OpenMined/30DaysOfFLCode: Official Repo for the 30DaysOfFLCode Challenge Initiative Official Repo for the 30DaysOfFLCode Challenge Initiative - OpenMined/30DaysOfFLCode

@openmined.bsky.social's #30DaysOfFLCode challenge just wrapped up, and they've curated this fantastic resource list for learning about federated learning. Definitely worth bookmarking if you plan to explore FL in the future: github.com/OpenMined/30...

#FederatedLearning #PrivacyPreservingAI

22.12.2024 16:39 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
it depends

it depends

In case you don't have time to read the EDPB opinion on AI training, here's a summary of pretty much every paragraph.

18.12.2024 15:35 β€” πŸ‘ 13    πŸ” 2    πŸ’¬ 0    πŸ“Œ 1

Used to hate finding typos after hitting send. Now I'm just like "Well, at least they won't think I'm a LLM."

14.12.2024 16:30 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Quantum Country A free introduction to quantum computing and quantum mechanics

quantum.country by @andymatuschak.org.

I haven't read it myself (yet) but the way this 'mnemonic book' is laid out looks awesome!

10.12.2024 11:02 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

From a legal perspective this isn't really covered AFAIK (or there doesn't seem to be an issue with it). Even if it's your identical twin brother that uses services like these.

08.12.2024 19:56 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Remember That DNA You Gave 23andMe? The company is in trouble, and anyone who has spit into one of its test tubes should be concerned.

I think @carissaveliz.bsky.social touched on this topic in her book "Privacy Is Power". Highly recommended btw.

There is also this article by The Athlantic, but it is unfortunately paywalled.
www.theatlantic.com/health/archi...

08.12.2024 19:48 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Learning the ropes: why Germany is building risk into its playgrounds Lofty climbing towers are part of trend away from total safety and towards teaching children to navigate difficult situations

German playground designers embrace risk as learning tool "even if the consequence is the odd broken bone".

www.theguardian.com/world/2021/o...

07.12.2024 16:03 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Thanks for the article.

About encrypted DNS: I'm curious about choosing servers. While encrypted DNS solves the ISP plaintext issue, what makes certain servers more trustworthy beyond different jurisdictions? (personally, I'd avoid Google's DNS given their track record with user privacy)

07.12.2024 08:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Is the image just black for everyone else? Wondering if there was an error uploading or if it's just my client.

05.12.2024 21:03 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
"How it works" from proofofhumanity.id

"How it works" from proofofhumanity.id

I wonder if something like this could work. But it might be a too strong barrier for new users that just want to try Bluesky out without having verified users able to vouch for them...

proofofhumanity.id
(ignoring the crypto aspect, just referring at the core idea itself)

05.12.2024 20:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Job ad:
Palantir Technologies
Privacy and Civil Liberties Software Engineer
New York, NY
Engineering /
Full-time /
Hybrid

Job ad: Palantir Technologies Privacy and Civil Liberties Software Engineer New York, NY Engineering / Full-time / Hybrid

::chefs kiss::

05.12.2024 01:44 β€” πŸ‘ 72    πŸ” 13    πŸ’¬ 6    πŸ“Œ 2

In the UK case GDPR was enacted before Brexit and is thus valid UK law.

04.12.2024 07:21 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Thanks to GDPR they need to offer this option to EU citizen and non-EU citizen living in the EU. It also applies to some non-EU countries like Norway. I think Switzerland and UK are special cases but seem to be covered here.

So yes, everyone in the EU & UK should be able to do this.

04.12.2024 07:20 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Screen shot of a dogshit New York Times article claiming that the next big real estate trend was purchasing property in the Metaverse

Screen shot of a dogshit New York Times article claiming that the next big real estate trend was purchasing property in the Metaverse

Do you guys remember this fucking bullshit lmao

01.12.2024 11:44 β€” πŸ‘ 4684    πŸ” 571    πŸ’¬ 104    πŸ“Œ 69

However, that doesn't really address the password sharing. But that might just be the "move fast and break things" of access controls.

01.12.2024 11:39 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Hanlon's razor: "Never attribute to malice that which is adequately explained by stupidity."

01.12.2024 11:35 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

That being said, I nonetheless think the best approach is publishing IDs that point to the entries, not complete datasets. We cannot expect people wanting to delete their content to contact everyone who made these datasets public. The burden should be on those using the data, not the data subjects.

28.11.2024 19:42 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Other obvious downsides, beyond being time-intensive: It requires knowledge of the API or software (not obvious for e.g. social scientists) and data entropy as tweets get deleted over time, making comparisons across studies difficult.

28.11.2024 19:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Depending on the dataset, it is more than just a mild convenience. Extreme example is the 2016 US election dataset had 280 million tweets. With Twitter’s API limitations it would take 32 days to retrieve the full dataset. (IF all tweets would still be available)

28.11.2024 19:31 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

A counter-argument is that keeping the control of data with the platform concentrates the power with the company behind it, as seen with X. Open data access enables e.g. third-party tools and federation.

So it seems that while we're losing control over data, we keep some control over the plattform.

28.11.2024 19:25 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@noone.social is following 20 prominent accounts