Sanmi Koyejo Profile
Sanmi Koyejo

@sanmikoyejo

1,627
Followers
85
Following
1
Media
183
Statuses

I lead @stai_research at Stanford.

Stanford, CA
Joined September 2014
Don't wanna be here? Send us removal request.
@sanmikoyejo
Sanmi Koyejo
5 months
"Are Emergent Abilities of Large Language Models a Mirage?" is a NeurIPS outstanding paper!🙌🏿 Congrats especially to the students @RylanSchaeffer @BrandoHablando & other awardees. If you want to learn more, check out the oral & poster 👇🏿this afternoon (Dec 14) 1/2
Tweet media one
@NeurIPSConf
NeurIPS Conference
5 months
**Test of Time** Distributed Representations of Words and Phrases and their Compositionality **Outstanding Main Track Papers** Privacy Auditing with One (1) Training Run Are Emergent Abilities of Large Language Models a Mirage?
1
9
95
10
63
331
@sanmikoyejo
Sanmi Koyejo
4 years
My first tweet! I'm excited to share my recent interview on Metric Elicitation and Robust Distributed Learning with @samcharrington for the @twimlai podcast. Check it out! via @twimlai
5
15
77
@sanmikoyejo
Sanmi Koyejo
4 years
#NeurIPS2020 will be holding a symposium on the COVID-19 response in the @NeurIPSConf community. We ask that you do not submit workshop/symposium proposals that are entirely on the same topic. We are happy to consider workshops with additional and/or complimentary themes.
1
11
51
@sanmikoyejo
Sanmi Koyejo
1 year
(Re-) examining some of the emergence claims in large language models. Turns out the metrics matter! Work with @RylanSchaeffer and @BrandoHablando
@RylanSchaeffer
Rylan Schaeffer
1 year
We had meant to keep this under wraps for a few weeks, but it seems that the cat is out of the bag. Excited to announce our newest preprint!! **Are Emergent Abilities of Large Language Models a Mirage?** Joint w/ @sanmikoyejo & @BrandoHablando 1/12
11
48
264
1
2
28
@sanmikoyejo
Sanmi Koyejo
5 months
Location & time for our paper: "Are Emergent Abilities of Large Language Models a Mirage?" #NeurIPS2023 Presentation: 3:20pm, CST Hall C2 (level 1 gate 9 south of food court) Poster: #1108 , 5pm CST, Great Hall & Hall B1+B2 (level 1) Paper link: 2/2
0
4
20
@sanmikoyejo
Sanmi Koyejo
1 year
Are you interested in human or algorithmic challenges when learning from human feedback? Check out the @StanfordHAI Postdoc with @msbernst and me starting Fall 2023. Information here:
@msbernst
Michael Bernstein
1 year
Postdoc position: How should people and communities articulate how AIs should navigate difficult tradeoffs? Prof. @sanmikoyejo and I have a jointly mentored postdoctoral scholar position open at @Stanford CS starting in the fall. Information here:
0
7
25
0
7
18
@sanmikoyejo
Sanmi Koyejo
4 years
@NeurIPSConf #NeurIPS2020 workshop proposal deadline has been extended by one week. The new deadline is 3 July 2020. We will update the other due dates soon as we complete the planning of the virtual workshops.
0
11
13
@sanmikoyejo
Sanmi Koyejo
6 months
It was great to host you. Thanks for the awesome lecture and engagement with students!
@natolambert
Nathan Lambert
6 months
I gave an RLHF lecture at Stanford today, here are the slides. The newer figures from other talks I've given: * visuals on history of RLHF / related fields * figures on advanced RL methods (CAI / DPO / rejection sampling)
6
79
562
0
1
12
@sanmikoyejo
Sanmi Koyejo
9 months
Welcome!!!
@debcaldarola
Debora Caldarola
9 months
Thrilled to share I'll be spending the next few months at @Stanford as a visiting researcher at @sanmikoyejo 's lab 🎉 Grateful to @sanmikoyejo , @marcuswallacej and @bcaputo_iit for this opportunity 🙏
Tweet media one
3
0
17
0
1
11
@sanmikoyejo
Sanmi Koyejo
2 months
Are you at #wsdm and interested in Trustworthy Large Language Models? Come check out my tutorial with @uiuc_aisecure in Room 22B, starting at 8:30 AM.
0
2
9
@sanmikoyejo
Sanmi Koyejo
1 year
@russpoldrack @tallinzen @glupyan @RylanSchaeffer Some have argued that some improvements in model capabilities are unpredictable (along with a semi-precise definition of emergence). We argue that many claimed emergent capabilities are predictable, either using better statistics or alternative metrics. See thread for more.
@RylanSchaeffer
Rylan Schaeffer
1 year
We had meant to keep this under wraps for a few weeks, but it seems that the cat is out of the bag. Excited to announce our newest preprint!! **Are Emergent Abilities of Large Language Models a Mirage?** Joint w/ @sanmikoyejo & @BrandoHablando 1/12
11
48
264
1
0
7
@sanmikoyejo
Sanmi Koyejo
1 year
A friendly introduction to double descent, focusing on building intuition with linear models (see thread and links).
@RylanSchaeffer
Rylan Schaeffer
1 year
@SAIA_Alignment @AnthropicAI @daniela_witten Joint work with @sanmikoyejo @KhonaMikail @KaterynaPistun1 @FieteGroup Jason, Zach & Akhilan Comments, questions & feedback are welcome! Paper: Code: 8/8
1
0
5
0
4
6
@sanmikoyejo
Sanmi Koyejo
1 year
New work on improving aggregation for federated domain adaptation with @Ybo_Z and @enyij2 !
@Ybo_Z
Yibo Jacky Zhang
1 year
FedAvg / fine-tuning will fail in federated domain adaptation when the domain shift is large. To address this, we propose FedGP, an effective aggregation rule, and a theoretical framework showing why it works. . Exciting work with @enyij2 and @sanmikoyejo .
Tweet media one
1
1
6
0
3
6
@sanmikoyejo
Sanmi Koyejo
4 years
@autreche @NeurIPSConf From your title, the workshop proposal sounds like its broader than COVID-19 only and should be fine. Feel free to contact us directly if you need more details. We will be happy to answer.
0
0
1
@sanmikoyejo
Sanmi Koyejo
6 months
Generative AI adoption is growing fast, but computational resources are not keeping up. Can adaptive pricing help, and how does one implement auctions for Generative AI? See some of our early work on this (led by Zachary Robertson).
@stai_research
Stanford Trustworthy AI Research (STAIR) Lab
6 months
🚀 Thrilled to share some work out of our lab researching how to better price AI content using auction design theory! We consider both consumer and data worker payment in this work. Paper: . #OpenAI #AI #Stanford Thread 🧵
1
5
8
0
1
1