Kenan Tang's Avatar

Kenan Tang

@kenantang.bsky.social

CS PhD student at UCSB AI for healthcare / LLM writing assistants / Image Editing kenantang.github.io

388 Followers  |  1,207 Following  |  26 Posts  |  Joined: 20.11.2024
Posts Following

Posts by Kenan Tang (@kenantang.bsky.social)

Constructed using Nano Banana Pro, this dataset contains 28,000 2K-resolution images tracking the gradual destruction of image content across 100 consecutive edits.

Witness the collapse firsthand.

πŸ”—https://huggingface.co/datasets/kenantang/Banana100

#AI #ModelCollapse #NanoBananaPro #Google

🧡2/2

06.02.2026 18:14 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

AI agents can lead to an irreversible de-evolution of human knowledgeπŸ“‰

As shown in the video, agentic models drive a cycle of decay: when they edit images iteratively, they introduce invisible noise that accumulates until quality collapse.

To quantify this decay, we built Banana100.

🧡1/2

06.02.2026 18:12 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Post image

We are recruiting postdocs @ai-ucsb.bsky.social !
With @haewonjeong.bsky.social Yao Qin

You want to lead the future of AI4Science?

Apply to UCSB Real AI For Science Initiative 🌟
Deadline: Sept 15, 2025.

This is the view you'll have from... your desk!
By @adelemyers.bsky.social

22.08.2025 17:37 β€” πŸ‘ 7    πŸ” 4    πŸ’¬ 1    πŸ“Œ 2
Post image

The @cvprconference.bsky.social AI Art Online Gallery 2025 is now live πŸ₯³

Featuring 100+ artworks across aesthetics, environment and identity.

Check it outπŸ‘‡

thecvf-art.com

#CVPR2025 #CVPRAIart #creativeAI

11.06.2025 18:19 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

🎨 Excited to share that my work is featured in the CVPR AI Art Gallery 2025!
Come and see how AI image generation can be controlled with surgical precision.

Link: thecvf-art.com/project/comp...

Thanks to @elluba.bsky.social and @cvprconference.bsky.social for hosting the event!
#CVPR2025

11.06.2025 16:38 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image Post image

Flux.1 Kontext [pro] failing on an image editing task. The task is to add a backpack onto this bench. 100% failure rate. None of the prompt-based models, including gpt4o and Gemini 2.5 Pro, have succeeded on this task.

30.05.2025 02:05 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
NutriBench: the first dataset for evaluating LLMs at carbohydrate estimation NutriBench is the first publicly available natural language meal description based nutrition benchmark.

πŸ“ Join us at ICLR tomorrow (April 24) at 10 am, Hall 3 + Hall 2B #19, for our poster on NutriBench: the first publicly available natural language meal description benchmark for nutrition estimation!
Here's our webpage: mehak126.github.io/nutribench.h...
@dongx1997.bsky.social

#ICLR #AI4health

23.04.2025 15:43 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

SPICE enables complex edits like gesture adjustment, action modification, and object addition with occlusion. SPICE is compatible with major diffusion model UIs (Automatic1111/ComfyUI) and supports popular models like Flux Dev, SDXL, SD1.5, and their variants.

17.04.2025 16:13 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image Post image

Our team has officially open-sourced the SPICE image editing workflow.

Paper: arxiv.org/abs/2504.09697
Code: github.com/kenantang/sp...

17.04.2025 16:12 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸ“’ Open Source & Collaboration:

- Full GitHub release this April.

- Early testing available now.

Perfect for smart creative platforms and developers building advanced image processing tools. DM for early test access! πŸš€

#AI #ImageEditing #OpenSource #SPICE

01.04.2025 16:07 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Example scenario: Adding a backpack onto a bench. Traditional methods face spatial errors and distortion. SPICE uses a two-stage denoising method to ensure exact object placement every time.

01.04.2025 16:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

4/ Resolution limits? Not here. SPICE natively handles any resolutionβ€”4K, vertical screens, ultra-wideβ€”without cropping or compression. Total creative freedom.

01.04.2025 16:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

3/ SPICE excels at spatial reasoning. With minimal user prompts, it accurately constructs complex 3D spatial relationships. Precise editing, simplified.

01.04.2025 16:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

2/ Ever struggled with multi-step image distortion? SPICE enables ultra-long editingβ€”100+ iterations without degradation. Say goodbye to cumulative distortion issues!

01.04.2025 16:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

1/ SPICE supports diverse artistic styles seamlesslyβ€”photorealistic, cartoon, or any LoRA-compatible art style. True cross-style adaptability, no compromises.

01.04.2025 16:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

🧡 Our team just introduced SPICE, a novel image editing framework that significantly outperforms GPT-4o & Gemini 2.0 in single-step editing tests.

Here’s why SPICE matters in AI-driven creative workflows: πŸ‘‡

01.04.2025 16:05 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Could you further specify your question? The model needs the user to tell it what to edit in the image.

04.02.2025 19:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image Post image

The image editing workflow we propose succeeds in editing a challenging image from Emu Edit. The prompt is β€œopen the refrigerator door in the image”.

15.01.2025 22:57 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

PD and Abomination

#AiArt #DarkestDungeon

03.01.2025 13:57 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

The results in the 4 images are generated with Flux Dev. Since the pipeline is training-free, any model can be used (sdxl, etc.). Also, there is no need for a specific inpainting checkpoint. Such a checkpoint is not currently available for a wide range of base models.

01.01.2025 22:15 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image Post image

Localized and precise image editing results from a pipeline we develop. Users only need to provide crude sketches and masks. No hyperparameter tuning or prompt engineering needed. All results are first shot.

Baseline: arxiv.org/abs/2402.17525

01.01.2025 22:09 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We have done developing an open-source tool that does exactly this. Please refer to my previous posts for examples. The tool will be finalized and released soon.

01.01.2025 21:53 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Getting 50% (SoTA) on ARC-AGI with GPT-4o You can just draw more samples

Six months ago someone put a for-loop around GPT-4o and got 50% on the ARC-AGI test set and 72% on a held-out training set redwoodresearch.substack.com/p/getting-50... Just sample 8000 times with beam search.

o3 is probably a more principled search technique...

21.12.2024 18:16 β€” πŸ‘ 137    πŸ” 23    πŸ’¬ 4    πŸ“Œ 6

Starter pack for #artists in #STEM.
#Artist s creating #scientific #visuals ( #illustration, #installation etc), and
#scientist s creating artworks (not necessarily #sciart) are welcome.
Please spread the word and let me know if you would like to be added.
#science #art

go.bsky.app/NzXHtrF

23.11.2024 16:35 β€” πŸ‘ 26    πŸ” 9    πŸ’¬ 10    πŸ“Œ 0
Preview
We Looked at 78 Election Deepfakes. Political Misinformation Is Not an AI Problem.

In fact, @sayash.bsky.social and I have just published an essay with them, where we play our usual role of looking at the evidence and tamping down AI hype and fears instead of playing them up.
knightcolumbia.org/blog/we-look...

(Cross-posted to AI Snake Oil aisnakeoil.com/p/we-looked-...)

15.12.2024 14:23 β€” πŸ‘ 17    πŸ” 6    πŸ’¬ 2    πŸ“Œ 0
AIM-FM Workshop @ NeurIPS'24

🚨 Only 1 day to go! 🚨

Join us at AIM-FM: Advancements In Medical Foundation Models workshop at NeurIPS 2024!

πŸ“… When: December 14th, 2024, 8:20 a.m. PST
πŸ“ Where: East Ballroom A, B
πŸ”— Details: aim-fm-24.github.io/NeurIPS/

#AIM-FM #NeurIPS2024 #MedicalAI #FoundationModels

13.12.2024 08:14 β€” πŸ‘ 4    πŸ” 4    πŸ’¬ 2    πŸ“Œ 0
Preview
Who and What comprise AI Skepticism? An attempt to do justice to a diverse community

buildcognitiveresonance.substack.com/p/who-and-wh...

11.12.2024 02:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

"Sora is a data-driven physics engine."
x.com/chrisoffner3...

10.12.2024 12:42 β€” πŸ‘ 137    πŸ” 16    πŸ’¬ 12    πŸ“Œ 10
Post image

I just updated the translation span annotations from our EMNLP Findings paper. Llama-3.3-70B-Instruct is a free and powerful alternative to gpt-4-0125-preview on this task.

Paper: arxiv.org/abs/2410.00988

Demo: kenantang.github.io/cjk-idioms-gpt/

#AI #LLM #NLP #Translation

09.12.2024 22:47 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Don’t Ride This Bike! Generative AI’s persistent trouble with compositionality and parts When the text-to-image AI generation system DALL-E2 was released in April 2022, the two of us, together with Scott Aaronson, ran some informal experiments to probe its abilities.

open.substack.com/pub/garymarc...

09.12.2024 05:44 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0