Learning Library

← Back to Library

GPU‑Thieving Intern Wins NeurIPS Best Paper

Key Points

  • An intern at ByteDance (TikTok’s parent) stole a large number of GPUs by sabotaging internal AI training pipelines, leading to a $1 million lawsuit and his termination in August 2024.
  • The intern, named Kouan, used the stolen compute time to develop a paper on “Visual Autoregressive Modeling: Scalable Image Generation via Next‑Scale Prediction,” pushing the field beyond token‑ or pixel‑level prediction toward reasoning over larger image concepts.
  • Despite the theft, Kouan submitted the paper to NeurIPS (the premier AI conference) and, after a blind review that judged the work solely on merit, the conference awarded it Best Paper in December 2024.
  • The award sparked controversy, as the conference organizers knowingly recognized work done with stolen resources, while ByteDance remains outraged and continues legal action against Kouan.

Full Transcript

# GPU‑Thieving Intern Wins NeurIPS Best Paper **Source:** [https://www.youtube.com/watch?v=6A3NOedlPWI](https://www.youtube.com/watch?v=6A3NOedlPWI) **Duration:** 00:04:54 ## Summary - An intern at ByteDance (TikTok’s parent) stole a large number of GPUs by sabotaging internal AI training pipelines, leading to a $1 million lawsuit and his termination in August 2024. - The intern, named Kouan, used the stolen compute time to develop a paper on “Visual Autoregressive Modeling: Scalable Image Generation via Next‑Scale Prediction,” pushing the field beyond token‑ or pixel‑level prediction toward reasoning over larger image concepts. - Despite the theft, Kouan submitted the paper to NeurIPS (the premier AI conference) and, after a blind review that judged the work solely on merit, the conference awarded it Best Paper in December 2024. - The award sparked controversy, as the conference organizers knowingly recognized work done with stolen resources, while ByteDance remains outraged and continues legal action against Kouan. ## Sections - [00:00:00](https://www.youtube.com/watch?v=6A3NOedlPWI&t=0s) **Untitled Section** - ## Full Transcript
0:00this is the story of the craziest 0:03internship I have ever heard of happened 0:06in AI it's still unfolding this person 0:09defrauded the company they stole gpus 0:13which is the most precious resource in 0:15AI they've been sued for a million 0:17dollars and they're not done yet they 0:19just won best paper at the most 0:22prestigious AI conference on the 0:24planet their name is 0:26kouan and he started at bite dance which 0:31is the parent company of tick tock back 0:34in the middle of 2024 so like 0:36jish immediately things went wrong 0:41something started to happen so his 0:44colleagues models would 0:46fail his their training runs would crash 0:50naturally during during large training 0:52runs there would be small innocuous file 0:55edits that would pass and somehow the 0:57pipelines would be sabotaged and no one 0:59figured out what was going on but the 1:01net net of it was he was able to change 1:03model weights he was able to hack 1:05machines and he caused enough of the AI 1:08training and research pipeline at bite 1:10dance to fail that he freed up a 1:12significant number of gpus which he used 1:15for his own academic paperwork that's 1:18what he wanted his whole goal was to get 1:20access to 1:21gpus well when bite dance figures this 1:23out in August they terminate him bye-bye 1:27fired fired for malicious interference 1:30bite Dan then reports his behavior to 1:32his university and begins investigating 1:35the extent of the damages he's caused 1:36they are very upset about this but Tian 1:39isn't done writing he keeps writing and 1:41in October of 2024 he submits his 1:43research paper visual autor regressive 1:46modeling scalable image generation via 1:49next scale prediction to nurs which is 1:51the most prestigious AI conference on 1:53the 1:55planet talk about like wow right like 1:59the the the willingness to basically say 2:01yeah I stole the gpus but look at what I 2:03did it's so incredible you have to look 2:05at this that was what happened and if 2:07you're wondering what scalable image 2:08generation via next scale prediction is 2:11he's moving past just next token 2:13prediction or next pixel prediction and 2:16actually looking in images at how you 2:18can have a larger concept to translate 2:21scale more effectively and one of the 2:23things that is at The Cutting Edge of AI 2:25in late 2024 is how do you reason 2:28against larger chunks than just a to 2:30we saw it very recently with um deep 2:33seek V3 doing double token prediction 2:35we've seen it with a paper from meta 2:37that's looking at reasoning across 2:39Concepts this is very much in that vein 2:42but apparently it was such a good paper 2:43that in December very recently the 2:47judges at NPS blind awarded the best 2:51paper at nurs to Kon the intern who 2:56stole the 2:57gpus and and I say blindly because they 3:00measured the paper quality without 3:01looking at names they didn't know this 3:02was who it was now obviously the 3:04conference organizers knew who it was 3:06when they awarded it and they still 3:07chose to award it and there's a lot of 3:09controversy about awarding best paper to 3:12someone who stole 3:15compute and bite dance is certainly mad 3:17about it because when they saw that the 3:19paper was submitted using their stolen 3:21GPU time they sued him in Beijing 3:25demanding a public apology and demanding 3:28$1.1 million in Deb Imes roughly this 3:30was back in November that court case is 3:33still pending so this guy now has a 3:36court case for a million bucks against 3:38him best paper award at nurs and a 3:42massive controversy around what he 3:45did and the where where I come down on 3:48this at the end of the day is you have 3:51someone who is brilliant enough that 3:53they can figure out how to hack the AI 3:55modeling pipeline of a major model 3:57builder and AI researcher and they can 4:00do that for their benefit and they can 4:03get a groundbreaking Innovation out of 4:05it you want to employ them you just want 4:07them to have a very good manager with 4:09tight constraints if you don't employ 4:12them it will be worse because they will 4:14figure out a way to contribute to this 4:15field it is evident that they will not 4:17be stopped from contributing to the AI 4:19field it's about whether you employ them 4:21or not so I would expect that someone in 4:24the model maker space is going to decide 4:26to bite the bullet cover the liability 4:28for the damages sued for or settle out 4:30of court and get this guy employed as 4:32long as they have very very tight 4:33constraints because they want the 4:35innovation in the house they just don't 4:37want the liability that comes from him 4:39being a loose cannon so we will see what 4:41happens it's still unfolding but the 4:44story of Kon is already the wildest 4:47internship story I have ever heard you 4:50tell me if you've heard something Wilder 4:51but this is just nuts