I wrote a blog post on going from not knowing anything about deep learning last year to training state of the art OSS models - . Hope it helps you. tldr; read the deep learning book, implemented papers + taught, built open source tools Tweet added by Vik Paruchuri @VikParuchuri

Vik Paruchuri

2 months

I wrote a blog post on going from not knowing anything about deep learning last year to training state of the art OSS models - . Hope it helps you. tldr; read the deep learning book, implemented papers + taught, built open source tools

How I got into deep learning

I ran an education company, Dataquest, for 8 years. Last year, I got the itch to start building again. Deep learning was always interesting to me, but I knew very little about it. I set out to fix...

www.vikas.sh

27

153

1K

Kirk Kaiser

@burningion

2 months

@VikParuchuri FYI the post is throwing 500 errors now

1

0

Vik Paruchuri

@VikParuchuri

2 months

@burningion Thanks! I'm not seeing this - do you still see it?

1

0

Tony

@CopetimusPrime

2 months

@VikParuchuri would be interesting to read a more meta version of this post too. there's nothing stopping anyone else from doing what you did, why are you the only one who did it?

1

0

2

Vik Paruchuri

@VikParuchuri

2 months

@CopetimusPrime This is an interesting question. I would also guess, at the minimum, hundreds of people have taken a similar path (software engineer without deep learning experience, learn, get a research job). I guess the interesting part is the middle, though

0

2

Parviz

@pdeyhim

2 months

@VikParuchuri Hello from OAK. Great post. Did you find any helpful discord channels to ask basic questions as you were getting started?

1

0

1

Vik Paruchuri

@VikParuchuri

2 months

@pdeyhim I didn't join discord until later, but the huggingface discord is pretty good for beginners

0

5

Andrea

@__AndreaW__

2 months

@VikParuchuri @VikParuchuri the post is amazing and inspirational. Can you share a bit about the effort (hrs/week?) you put in to follow such a cool journey? For the rest of us, I'd probably 3X it, as a rule of thumb

1

0

2

Vik Paruchuri

@VikParuchuri

2 months

@__AndreaW__ I have two kids (3 and 1 now), so I couldn't really work more than 40 hours a week

0

3

ضحي الشكيلي

@_chapter10_

2 months

@VikParuchuri What is your recommendation for newbie’s. PyTorch or Tensorflow?

1

0

1

Vik Paruchuri

@VikParuchuri

2 months

@_chapter10_ Definitely pytorch - pretty much all research is done in it these days. Even google research doesn't use tensorflow

0

2

Vhiz

@ThereBeLyte

2 months

@VikParuchuri @VikParuchuri - Great post! Two questions: 1. While doing LLM development, do you use any abstractions like DSPy, etc? Would you recommend any? 2. Hoping to read the section on "cleaning text data" in .

GitHub - VikParuchuri/zero_to_gpt: Go from no deep learning knowledge to implementing GPT.

Go from no deep learning knowledge to implementing GPT. - VikParuchuri/zero_to_gpt

github.com

1

0

3

Vik Paruchuri

@VikParuchuri

2 months

@ThereBeLyte I haven't used DSPy, but wouldn't recommend any abstractions until you understand how things work under the hood. Prompt generators are really hard to debug.

1

0

3

[email protected]

@swyx

2 months

@VikParuchuri huge huge congrats! is zero to gpt dormant? i'd be interested in the later lessons

1

0

5

Vik Paruchuri

@VikParuchuri

2 months

@swyx Thanks! I'm still planning to work on it, but other projects keep getting in the way

0

2

Anurag Bhagsain

@abhagsain

2 months

@VikParuchuri how much math is needed? 🫣

1

0

1

Vik Paruchuri

@VikParuchuri

2 months

@abhagsain A decent amount, but mostly stats and linear algebra, with some calculus - it was easier to learn than I thought it would be (fear of math blocked me from studying deep learning for a long time)

2

0

10

Sean Bergman

@sbergman

2 months

@VikParuchuri Inspirational post. Thanks for sharing and congrats on the new role! Your post really makes me think I should follow my passion, take a one year sabbatical to learn all I can, and contribute to open source projects I find interesting and valuable.

1

0

3

Vik Paruchuri

@VikParuchuri

2 months

@sbergman Highly recommend it - I've always learned the most in those periods

0

4