Sid Jayakumar @sidfix Twitter profile

Pinned Tweet

Sid Jayakumar

9 months

Ive hardly kept this quiet but very excited/other emotions to say we’re building @finsterai ! Finance meets GenAI meets other things. Lots to do, lots of fun, we’re hiring! Stay tuned. As they say, show don’t tell etc. Danke!

Finster AI

@finsterai

9 months

Roses are red, Violets are blue, This is a placeholder, Let’s build something new.

2

18

1

5

46

Last Seen Profiles

@lovvkkk

@QuantumBreak

@sirinsvcks

@GCOPLW96VsRDtVR

@lynchgayfarmer

@vergeburks1

@wildblur

@lil_ryoga

@jandakembangstw

@lachiboulala

@joyc_cava

@bokeplokalmalam

@1r3lia

@garret_hagler

@Sheelovezo

@TheTERzophrenic

@villaneles

@Borspodden

@nuskimcfly

@QAllyl98777

@jandakembangstw

@appleintro

@tinaDupont10

@BloodyNoraDJ

@maxmilhas

@keymobley

@selphiebooyaka

@Road_Valorant

@AliHuseinAlenzi

@q8fawazo

@Swat0218

@elyaozturk

@TIIuae

@SF9_FANCLUB

@JustMin_Estonia

@doziecuit

Sid Jayakumar

@sidfix

3 months

@CharlotteCGill I mean this is actually useful history and thats funding over 5 years. That's the equivalent of paying 1.5 junior analysts at a bank to do actual historical research of a 500 year period and build a wealth of history of our country. Not everything you dislike is woke.

9

10

2K

Sid Jayakumar

@sidfix

5 months

@BrianGitt Cherry picking data is a very easy game to play

3

26

541

Sid Jayakumar

@sidfix

13 days

@seanpk but they lost 80pc of their marketcap also so you know. And my TL is basically pornbots

10

3

215

Sid Jayakumar

@sidfix

3 months

@CharlotteCGill The “anti woke” argument is that we should preserve our culture and heritage and not attack institutions/history is something we should be proud of. How do you do that if you dont study your history, culture and heritage?

3

1

141

Sid Jayakumar

@sidfix

3 months

@CharlotteCGill The free market wont fund this. So it has to be funding bodies. This is all basic social and economic policy that regardless of whether you lean left or right is obvious. But in the name of culture war, we will dismantle academic institutions just to “own the libs”.

4

1

118

Sid Jayakumar

@sidfix

2 years

DL twitter is really something these days because you just see people constantly fighting about whether scaling is enough while the world has far bigger problems but as a cherry on top neither of the people do any real research anymore and last wrote code in 2012

7

3

101

Sid Jayakumar

@sidfix

3 years

Every few months I remember my idea for a “machine learning at the pub” podcast where as the evening progresses the ML takes get spicier. It’d be called WhiskAI and my first guest would be @egrefen 😂

6

3

83

Sid Jayakumar

@sidfix

9 months

Hola — if you’re based in LON/NYC (or well anywhere) — I’m hiring founding engineers/AI folks for a ~new thing~. The key bits are: we’re moving quickly, Im super excited by it, minimal BS, zero techbro-ness and well, bloody good fun. And shit banter. DM me! Grazie

3

12

82

Sid Jayakumar

@sidfix

5 years

Really excited about our latest work showing that large Transformer-XLs can be used in RL agents. We show SoTA performance on DMLab with gated transformers and a few small changes. Led by Emilio as an internship project! @DeepMindAI

Max Jaderberg

@maxjaderberg

5 years

Finally, Transformers working for RL! Two simple modifications: move layer-norm and add gating creates GTrXL: an incredibly stable and effective architecture for integrating experience through time in RL. Great work from Emilio interning at @DeepMindAI

9

216

750

0

27

79

Sid Jayakumar

@sidfix

10 months

A beautiful parting gift to receive from friends at @GoogleDeepMind ! Im super touched! Combining my go to passions of cricket and AI😂 not enough Virat Kohli puns on here but huge thank you to everyone! It’s been an honour!

3

0

78

Sid Jayakumar

@sidfix

11 months

A quick update from me! After 7 wonderful years of meeting great folks, my terrible banter, generally mucking about and some research to boot, today was my last day @GoogleDeepMind . Super grateful for the time of my life, but time for a wee break for me. Ill go back to tweeting

10

1

77

Sid Jayakumar

@sidfix

5 years

Really happy and a lovely early-Christmas present that our paper "Multiplicative Interactions and Where to Find Them" was accepted to #IClR2020 . We analyse such interactions, unify different forms (eg hypernets, gating), and encourage you to use more of them! 1/2

2

8

76

Sid Jayakumar

@sidfix

4 months

@ayushpranav3 starring Sora Ali Khan

2

0

68

Sid Jayakumar

@sidfix

4 months

The only reason we have this level of AI right now is cause people worked on it even when it wasn't cool and funding was dry. Work on whatever makes you happy, whatever gives purpose, whatever you cant not work on, whatever pays the bills, whatever you want.

sarah guo // conviction

@saranormous

4 months

how can you work on anything except AI right now

165

122

1K

4

3

58

Sid Jayakumar

@sidfix

4 years

Happy to share that our paper "Top-KAST: Top-K Always Sparse Training", proposing a new sparse training method, was accepted to NeurIPS 2020! This was work done at @DeepMind with @rpascanu @sindero Jack Rae and @erich_elsen ! 1/3

1

5

54

Sid Jayakumar

@sidfix

4 years

Happy to share this blogpost about our work on compressive transformers (led by Jack Rae)! We show that you can extend the effective memory of Transformers(-XL) with a simple compression scheme and show gains for LM and RL! (, accepted to ICLR 2020)

Compressive Transformers for Long-Range Sequence Modelling

We present the Compressive Transformer, an attentive sequence model which compresses past memories for long-range sequence learning. We find the Compressive Transformer obtains state-of-the-art...

arxiv.org

Google DeepMind

@GoogleDeepMind

4 years

Memory is a crucial feature of intelligence. Our new blog post overviews the use of memory in deep learning, and how modelling language may be an ideal task for developing better memory architectures:

9

295

815

0

7

52

Sid Jayakumar

@sidfix

3 years

I have a free @iclr_conf registration as a reviewer. I’d like to effectively pass this on by paying for someone’s registration who otherwise may not be able to attend. Please DM if interested or feel free to retweet :)

2

7

50

Sid Jayakumar

@sidfix

5 years

Our new paper out on Arxiv: "Distilling Policy Distillation"! We investigate policy distillation for RL: how to use it, what form to use and show some nice theoretical results on the way. Accepted as an Oral at AISTATS :) @wojczarnecki @maxjaderberg @rpascanu @sindero @DeepMindAI

1

7

49

Sid Jayakumar

@sidfix

8 months

Gopher ~2years later — some doing brilliant things at @GoogleDeepMind , some im not sure, and others founders/early employees at @inflectionAI , @AdeptAILabs , @MistralAI , @RekaAILabs , @xai , couple of early startups ( @finsterai , @Bobby_Chat_ ), or at @OpenAI .

1

6

48

Sid Jayakumar

@sidfix

4 months

Unrelated to the board decision, but I love this “started in apartment” meme because it lets you cosplay as “coming from nothing” even when thats not the case. (He was CTO of Stripe before OpenAI)

Greg Brockman

@gdb

4 months

We started OpenAI out of my apartment eight years ago. And we’re still just getting started:

184

215

3K

5

6

48

Sid Jayakumar

@sidfix

3 months

Congratulations @demishassabis what an honour! Demis saw AI coming way back when. And also personally why I got into AI. Met Demis @QueensCam CompSci dinner in 2014 and was floored at the Atari demo; hounded DM to let me intern, dropped out of Uni and stayed on. Very grateful

Google DeepMind

@GoogleDeepMind

3 months

Congratulations to our CEO and co-founder @demishassabis who has been awarded a Knighthood by His Majesty for services to Artificial Intelligence.

43

78

842

0

42

Sid Jayakumar

@sidfix

6 years

New paper with @janexwang and others @DeepMindAI on Meta-Learning in the presence of recurring tasks! Accepted to #icml2018 :) "Been There, Done That: Meta-Learning with Episodic Recall"

Been There, Done That: Meta-Learning with Episodic Recall

Meta-learning agents excel at rapidly learning new tasks from open-ended task distributions; yet, they forget what they learn about each task as soon as the next begins. When tasks reoccur - as...

arxiv.org

1

2

39

Sid Jayakumar

@sidfix

7 months

1) CONGRATULATIONS to everyone at DeepMind; this is AMAZING!!! ❤️ 2) Let this be a lesson to every techbro who blows with the wind and tweeted "Google is dead" for the last 6 months. Ignore the noise. 3) Stop confusing innovators dilemma with "Google can't ship". They're the

Demis Hassabis

@demishassabis

7 months

The Gemini era is here. Thrilled to launch Gemini 1.0, our most capable & general AI model. Built to be natively multimodal, it can understand many types of info. Efficient & flexible, it comes in 3 sizes each best-in-class & optimized for different uses

407

2K

12K

0

36

Sid Jayakumar

@sidfix

1 year

@aidangomezzz Imo the real worry is if the loss function suddenly spikes UP. Maybe the model is using morse to speak to us? Maybe it’s getting worse on purpose??? Makes you think. Very worrying. About to cancel some runs

2

35

Sid Jayakumar

@sidfix

1 year

@aidangomezzz I think we should all just use negative learning rates to be on the safe side

3

0

31

Sid Jayakumar

@sidfix

3 months

Honoured, but I think ill pass

2

0

31

Sid Jayakumar

@sidfix

3 months

@AbhikRoychoudh1 This is awesome. You should raise at a 2B valuation. (dont /s)

0

28

Sid Jayakumar

@sidfix

6 years

On the @NipsConference poll: it's hard to get to the point where we're an inclusive community if we're going to follow the results of a male-dominated sample voting on an issue that female/non-binary respondents (and a good chunk of male ones too) found offensive. #ProtestNIPS

4

6

29

Sid Jayakumar

@sidfix

4 months

This is how all startup projections sound. “If we even get half of Googles users, and also we make them pay for email, then we will be at $25B revenue”. Thanks mate

Deva Hazarika

@devahaz

4 months

Ran some numbers on this. Gmail apparently has 1.8 billion users. If Elon launches XMail as part of the Basic $3/mo subscription and gets just half of Gmail users to switch over that’s $2.7 BILLION A MONTH in new revenues for X. Wow!

197

8

222

3

2

27

Sid Jayakumar

@sidfix

11 days

@MohapatraHemant @perplexity_ai Imo irrelevant that theyre 2 years old. Theyre well funded, have some of the most well known advisors and investors and worth close to what the NYT is market cap wise. Cant hold tech startups to diff standards just because things move fast imo; regardless of ones views on this

0

1

28

Sid Jayakumar

@sidfix

7 months

@anuatluru I normally start with “Hi, I am a large language model”. Hasnt worked yet shockingly

1

0

26

Sid Jayakumar

@sidfix

3 years

It's been well...quite a year since my Danish Christmas last year. Solo virtual Christmas in London wasn't quite the same but I tried to Scandi it up -- here's wishing you all a very merry Christmas and I hope 2021 is all you want it to be!

Sid Jayakumar

@sidfix

4 years

Merry Christmas to the world from me and my first Danish Christmas!

1

0

15

3

0

26

Sid Jayakumar

@sidfix

2 years

@isaguha @chriswoakes @jimmy9 @StuartBroad8 @MAWood33 Brimful of Ashes (and he’s 45)

1

0

26

Sid Jayakumar

@sidfix

6 years

Anyone want to swap a Hamilton ticket for #NIPS2018 registration? @NipsConference

0

2

25

Sid Jayakumar

@sidfix

6 years

Coming out of twitter hibernation to say our paper "Memory-based Parameter Adaptation" () has been accepted to #ICLR2018 . Work with @OriolVinyalsML @demishassabis and others at @DeepMindAI that I can't seem to find on Twitter.

Memory-based Parameter Adaptation

Deep neural networks have excelled on a wide range of problems, from vision to language and game playing. Neural networks very gradually incorporate information into weights as they process data...

openreview.net

3

6

24

Sid Jayakumar

@sidfix

3 years

Very exciting, but why does this look like an ad for laundry detergent

BCCI

@BCCI

3 years

SEE. YOU. TOMORROW! 🙌 🙌 #TeamIndia 🇮🇳 #WTC21 Final

640

4K

61K

0

2

23

Sid Jayakumar

@sidfix

17 days

My main complaint is that someone wants to redefine OOMs

Leopold Aschenbrenner

@leopoldasch

17 days

The trillion-dollar cluster will take >20% of current US electricity production.

61

56

626

2

0

22

Sid Jayakumar

@sidfix

4 months

@sriramk @k0ol1 Think that's a bit uncharitable Sriram -- there's loads of people doing a lot on A. Both are possible and there's a lot of people working on it. We're the ones who left, we hardly get to critique people who bring up A; it's not really whataboutism to point out structural issues

2

0

20

Sid Jayakumar

@sidfix

5 months

Dishoom's net contribution to the development of AI cannot be understated. It was our personal pilgrimage site at DM and we've kept it up @finsterai for good measure. I think the chilli cheese toast brings out creativity. It's all the ChaatBots.

Aravind Srinivas

@AravSrinivas

5 months

Dishoom, King’s Cross, is a favorite for Google Deepminders. That’s also where I had my farewell lunch in 2019. Yum.

43

9

744

0

20

Sid Jayakumar

@sidfix

5 months

@freyaindiaa It's not missing though is it -- most mental health advice will tell you about volunteering and doing things for others and alot of therapy is trying to get you to be self aware about why you did something and why you felt something and looking at other people's perspectives.

1

20

Sid Jayakumar

@sidfix

18 days

Leopold Aschenbrenner

@leopoldasch

19 days

Virtually nobody is pricing in what's coming in AI. I wrote an essay series on the AGI strategic picture: from the trendlines in deep learning and counting the OOMs, to the international situation and The Project. SITUATIONAL AWARENESS: The Decade Ahead

236

760

4K

4

0

20

Sid Jayakumar

@sidfix

1 year

@fhuszar *finishing a PhD in machine learning

0

19

Sid Jayakumar

@sidfix

4 years

If you want to hear more about our NeurIPS submission, we'll be at poster session 7 tomorrow (a very early 5am GMT for me but it times well with the Australian cricket summer)!

Google DeepMind

@GoogleDeepMind

4 years

We asked @sindero about his journey to DeepMind, #NeurIPS2020 submission, and his recommended reads for those interested in becoming a research scientist (1/) #PeopleBehindThePapers

4

26

197

2

1

19

Sid Jayakumar

@sidfix

1 year

@Ned_Donovan Unashamed to say i do this regularly

1

0

18

Sid Jayakumar

@sidfix

9 months

I wish this was a subtweet to someone in particular but sadly it’s not: scrolling through some AI startup team pages and it’s like people forgot you can hire women. Just dudes in a room with a careers page talking about how open and inclusive the culture is. You can basically

0

2

18

Sid Jayakumar

@sidfix

7 years

@vivekagnihotri With current growth rates/online specials on Netflix/Amazon, seems to me that customers are pretty delighted, no? @thetanmay

0

18

Sid Jayakumar

@sidfix

6 years

One of the coolest results (benchmark-wise) I've seen recently is TransformerXL (30-ish ppl to 18 on Wiki103!) this ICLR. BigGAN's been talked about loads and despite the surprisingly average reviews TransformerXLs got -- yet another win for attention based models! Really cool

1

2

17

Sid Jayakumar

@sidfix

5 years

Excited to be in New Orleans for #ICLR2019 ! I'll be at the @DeepMindAI stand tomorrow to chat about research in general and the research engineering roles!

2

1

18

Sid Jayakumar

@sidfix

6 months

Not doing the whole PR thing as yet, but wanted to give a shout out to amazing investors/angels who backed @finsterai and me, a first time/solo founder. We're rolling our to our early customers and continue building/hiring. 1/2

1

17

Sid Jayakumar

@sidfix

1 month

@amix3k This is simply incorrect given you could use the Flash model and the new pro models 20 seconds after the event (and I did)

0

17

Sid Jayakumar

@sidfix

2 years

My yearly Christmas thread. For all those in quarantine, take care and hold on! Have personally been lucky to have my first Christmas back in India for 2 years. My Mum has also happened to take this very impromptu but Christmas-card looking photo of me. Take care you all!

Sid Jayakumar

@sidfix

3 years

It's been well...quite a year since my Danish Christmas last year. Solo virtual Christmas in London wasn't quite the same but I tried to Scandi it up -- here's wishing you all a very merry Christmas and I hope 2021 is all you want it to be!

3

0

26

1

0

17

Sid Jayakumar

@sidfix

4 months

You can make up anything you want on Twitter. Spent 7 years at DeepMind, I can assure you that tuning models is very hard and despite twitter imagination, there is no “wokeness” hyper-parameter. Not everything is a culture war. We have better things to do.

James Clark 📈📉¯\_(ツ)_/¯

@mr_james_c

4 months

Looking at my social media, the normies have now found the racist Gemini images. Their assumption is this must be a mistake by Google. But it's not. This was deliberate. Like any religious or ideological cult, Google believes its mission is to convert the unbelievers.

1

0

6

2

4

17

Sid Jayakumar

@sidfix

7 months

Awesome to see this public! I didnt contribute at all to this while at DeepMind but def slowed the team down by requesting total garbage prompts on the early versions and that they were nice enough to try. My music taste generally flits between jazz and bollywood so you know.

Demis Hassabis

@demishassabis

7 months

Thrilled to share #Lyria , the world's most sophisticated AI music generation system. From just a text prompt Lyria produces compelling music & vocals. Also: building new Music AI tools for artists to amplify creativity in partnership w/YT & music industry

111

536

3K

0

16

Sid Jayakumar

@sidfix

3 months

@shaunmmaguire Its the same date every single year - March 31st. You could have googled that. Easter is not the same date every year.

1

0

15

Sid Jayakumar

@sidfix

1 year

Given that every tweet on my timeline is now just: “This thing has happened, here’s why you should care. A 🧵 thread!” I would like to state for the record. This tweet is not a thread and you don’t need to care. Merci.

2

0

15

Sid Jayakumar

@sidfix

4 years

@ZachWeiner ...Oreos? If that count as a genre

1

0

15

Sid Jayakumar

@sidfix

3 months

Woke up to see this in @sytaylor ’s newsletter re @finsterai ! Thanks Simon! Still early doors for us but am super excited about what Finster’s Agents can do for finance/capital markets. Let people focus on the important bits & let AI do goal directed research/analysis/comps etc

2

1

15

Sid Jayakumar

@sidfix

10 months

And thank you especially to those involved in this! Twitter seems especially bad at letting me tag anyone today but shout out to Jackie, Dushyant, @egrefen , Dan, @tfgg2 , Matt, Yotam, Josh, @pfau @janexwang and Raz!

0

15

Sid Jayakumar

@sidfix

4 years

Merry Christmas to the world from me and my first Danish Christmas!

1

0

15

Sid Jayakumar

@sidfix

3 months

Just took my first day off since founding @finsterai . Attempting to leave laptop away. On the beach. Day 1. Walking back to room. Two women talking about tech enabled services and disrupting insurance. GG WP.

2

0

14

Sid Jayakumar

@sidfix

3 months

I tweet random AI things all day for 3 likes and then the thing people actually read is when I ratio todays twitter main character (the goal of Twitter is to never be the main character of the day etc). Many ways etc

0

1

12

Sid Jayakumar

@sidfix

2 months

Two different people including a tech reporter mentioned today to me that they enjoy my social media presence. Unclear if this is very good or very bad. Will continue posting till it becomes clear. Onwards

4

0

14

Sid Jayakumar

@sidfix

6 years

Happy to share our latest ICML paper -- work with Wojtek, @maxjaderberg , @sindero , @yeewhye , @lqh20 and others at @DeepMindAI ! We train large actions spaces and models by introducing simple curricula for RL agents -- mixing of policies + distillation from simpler policies.

Google DeepMind

@GoogleDeepMind

6 years

Our @ICMLconf paper introduces Mix&Match - a general-purpose framework for training complex RL agents. M&M creates curricula over the agent, using solutions found by simpler ones to train harder-to-train ones

1

63

180

0

4

14

Sid Jayakumar

@sidfix

3 years

Excited about our new workshop on Sparsity! Consider submitting any related work on pruning or sparse methods, or attending to learn more!

Jonathan Frankle

@jefrankle

3 years

NEW WORKSHOP: Sparsity in Neural Networks: Advancing Understanding and Practice (July 8-9, 2021). This workshop will bring together members of the many communities working on neural network sparsity to share their perspectives and the latest cutting-edge research (Deadline: 6/15)

4

85

336

1

13

Sid Jayakumar

@sidfix

4 months

Real innovation happens slowly, and then all at once. Easy to move very fast after decades of slow, meticulous work. We stand on the shoulders of giants etc.

Z Fellows

@ZFellows_

4 months

Sam Altman: "I have yet to meet a slow-moving person who is very successful."

8

168

1K

1

0

13

Sid Jayakumar

@sidfix

1 month

Quick self indulgent tweet: Given we’ve done p little branding/pr (so far), it’s always nice to see @finsterai turn up on a list this early :) Tis customers, product, revenue in that order of prioritisation so far. stay tuned for lots of updates in the summer hopefully!

Chief AI Officer

@chiefaioffice

2 months

AI agents are one of the hottest areas of development & investment in AI Here's a deep dive into the ecosystem from @Prosus_Ventures : 1. AI agents + AgentOps landscape

11

96

444

0

2

13

Sid Jayakumar

@sidfix

9 months

We really should teach more people stats and Bayes. My man here forgot the p(dem|city) term. This is like saying most failures of Ford cars are due to Ford and not Hyundai. Like yes.

Balaji

@balajis

9 months

Who is more responsible for the state of America’s cities: Democrats or Communists?

192

232

1K

2

1

12

Sid Jayakumar

@sidfix

7 months

5 years later, and NeurIPS rolls off the tongue just fine and Id totally forgotten that there even was a name change. See, it’s not hard to change things sometimes. Even in academia. Is good.

0

12

Sid Jayakumar

@sidfix

5 months

Top tier spam today. So glad I, Sid, an industry expert, backed Siddhant’s company

1

0

12

Sid Jayakumar

@sidfix

4 months

I also started @finsterai in my apartment. And was unemployed working on it for 4 whole months! **spent 7 years at DeepMind before, had savings, generally privileged, went to Cambridge. SO MUCH HUSTLE

2

0

11

Sid Jayakumar

@sidfix

1 month

Thank you to @Siftedeu for featuring one smart and one dumb quote from me lol Good read from Tim Smith here Am here so I can go to Lords more btw but genetically sworn to Liverpool. (Also congratulations to Arsenal and Rory; good week for the UK)

2

0

12

Sid Jayakumar

@sidfix

5 months

@MohapatraHemant Great question! An interesting variation is to say "fiction" *and* released in the 21st century to exclude James Joyce or any non fiction techinical-adjacent books like GEB haha

2

0

12

Sid Jayakumar

@sidfix

1 year

Merry Christmas to everyone from Bombay! Escaped rainy London to celebrate the holidays, attend my closest friends weddings, and also spend time with my new best mate. Stay safe, and may 2023 bring you lots of happiness, even larger LMs and most of all, more Twitter drama xx

Sid Jayakumar

@sidfix

2 years

My yearly Christmas thread. For all those in quarantine, take care and hold on! Have personally been lucky to have my first Christmas back in India for 2 years. My Mum has also happened to take this very impromptu but Christmas-card looking photo of me. Take care you all!

1

0

17

0

12

Sid Jayakumar

@sidfix

5 years

@wojczarnecki @jacobmenick @schwarzjn_ @yeewhye @sindero @_timharley @rpascanu @DeepMindAI Here's a link to the paper! .

Multiplicative Interactions and Where to Find Them

We explore the role of multiplicative interaction as a unifying framework to describe a range of classical and modern neural network architectural motifs, such as gating, attention layers...

openreview.net

1

2

10

Sid Jayakumar

@sidfix

5 months

@blader Also that the TAM for water is always bigger than whatever the rest of us are building.

1

0

8

Sid Jayakumar

@sidfix

9 months

Need a word for that sweet spot that AI products need to be at in-between copilots and backseat drivers. Like your friend who gets control of the tunes on a road trip and only plays bangers

2

11

Sid Jayakumar

@sidfix

7 months

@pfau

1

0

10

Sid Jayakumar

@sidfix

9 months

At @QueensCam almost 10 years to the day since I was first there, coincidentally. Cambridge is constantly not changing fast enough which is just right

1

0

10

Sid Jayakumar

@sidfix

2 years

Of course things are getting better but to sum up the state of the theory in Deep Learning: there’s a famous @karpathy tweet about 5e-4 being the best learning rate and while 100percent irony it was not entirely terrible advice as a first choice😂

Steven Hansen

@Zergylord

2 years

Apparently there are people out there that think deep learning is too theory obsessed I can't even rub two propositions together to create a third Guess I'll try and become a more prominent researcher, for their sake

2

0

15

1

0

9

Sid Jayakumar

@sidfix

3 months

@JulesJacobs5 @pfau David Pfau

1

0

10

Sid Jayakumar

@sidfix

4 months

To train a foundation model

The Kobeissi Letter

@KobeissiLetter

4 months

BREAKING: Warren Buffett's Berkshire Hathaway reports a record $167.6 billion cash balance in the fourth quarter. Why is Buffett holding so much cash?

722

763

5K

1

0

10

Sid Jayakumar

@sidfix

1 year

I think a lot of AI Hot Takes can be better understood if you take into account that 1) Something about Twitter makes everyone go MAD 2)People who take the time to post their Hot Takes live in the biggest bubble/ echo chamber of all time 3) Careful nuanced debate is boring

1

0

10

Sid Jayakumar

@sidfix

2 years

Hasn’t stayed long enough to really have been the Chancellor, just a Kwasi-Chancellor, if you will…sorry #Kwarteng

1

0

10

Sid Jayakumar

@sidfix

3 months

@1vnzh Actually It's only called Tooluse if it's from Southern France. Otherwise it's called Sparkling Function Calling.

1

10

Sid Jayakumar

@sidfix

5 years

Can't +1 this point enough. Just as most criticism of deep learning is instead criticism of "pure DL will not get us to AGI" -- but I don't know a single person doing DL who thinks that in that first place.

Adam Santoro

@santoroAI

5 years

@ylecun @hardmaru Bit of a pedantic point, but I think we should instead criticize model-free RL rather than "pure RL" (not sure what this is). We're still doing RL-based learning with an internal model.

0

6

2

1

9

Sid Jayakumar

@sidfix

6 months

Underrated benefit of being a CEO is I can use @finsterai 's account to like my own posts; providing instant dopamine when I get a notification on my phone 4 minutes later and forget that I liked my own post.

1

0

9

Sid Jayakumar

@sidfix

3 months

My two favourite papers I've written are Multiplicative Interactions and Where to Find Them and TopKAST (also like their titles fwiw); one was inspired by and built on HyperNetworks and the other by evolutionary approaches to sparse networks. Basically I love @hardmaru 's work

Sakana AI

@SakanaAILabs

3 months

Introducing Evolutionary Model Merge: A new approach bringing us closer to automating foundation model development. We use evolution to find great ways of combining open-source models, building new powerful foundation models with user-specified abilities!

55

417

2K

1

0

9

Sid Jayakumar

@sidfix

4 months

ooooof

Jeff Dean (@🏡)

@JeffDean

4 months

Gemini 1.5 Pro - A highly capable multimodal model with a 10M token context length Today we are releasing the first demonstrations of the capabilities of the Gemini 1.5 series, with the Gemini 1.5 Pro model. One of the key differentiators of this model is its incredibly long

196

1K

6K

1

0

9

Sid Jayakumar

@sidfix

4 years

Glad we have managed to replicate the awkwardness of real poster sessions virtually 😂 #NeurIPS2020 Particular fan of the realism of the virtual drive by

0

9

Sid Jayakumar

@sidfix

2 years

I know this is old news but I continue to be baffled by the fan following of Twitter “thought leaders” whose basic claim to credibility is they’re smart and made a lot of money being in Tech. And this somehow allows opining on complex social policy

2

0

9

Sid Jayakumar

@sidfix

1 month

Seen on the website of @BloombergBeta . The kind condescending drivel that gives VCs a bad name. Even if it were meant in jest.

0

9

Sid Jayakumar

@sidfix

8 months

It’s interesting that the community spent a long time adding in memory/planning etc to Deep RL agents only for it look like end-to-end always won and now we have a new generation of people adding memory, planning etc to LLM-based agents…jury still out?

1

9

Sid Jayakumar

@sidfix

3 years

Cant quite believe what Ive just seen! My first time away from home for the winter, but it’s been seriously enriched by having witnessed one of the greatest test series. Tearing up at this unbelievable Indian comeback. What an inspiration - cap is very much doffed! #GabbaTest

0

9

Sid Jayakumar

@sidfix

1 year

We've truly come a long way from "don't worry about AI safety, people will be careful" to "we must fight the woke AI"... Living in a very dumb timeline

0

9

Sid Jayakumar

@sidfix

10 months

A huge plus that no one told me about leaving full time employment and doing your own thing is that you can work at 7am or 1am but you can also crucially, take a nap at 1pm, and like that’s amazing. Also you can do all this while dog sitting. The lil one also naps.

0

9

Sid Jayakumar

@sidfix

5 months

Respectfully disagree - very much not a red flag. The line between ML Eng and Research is blurry and in theory you dont need any degree for either (though helpful). AI’s had a gatekeeping problem for ages and also a reverse gatekeeping problem (experience bad, indie hacker good)

0

9

Sid Jayakumar

@sidfix

4 months

This is particularly funny because DeepMinds TF and Jax libraries were known as Sonnet and Haiku, respectively

Anthropic

@AnthropicAI

4 months

Today, we're announcing Claude 3, our next generation of AI models. The three state-of-the-art models—Claude 3 Opus, Claude 3 Sonnet, and Claude 3 Haiku—set new industry benchmarks across reasoning, math, coding, multilingual understanding, and vision.