@VikParuchuri
Vik Paruchuri
2 months
I wrote a blog post on going from not knowing anything about deep learning last year to training state of the art OSS models - . Hope it helps you. tldr; read the deep learning book, implemented papers + taught, built open source tools
27
153
1K

Replies

@burningion
Kirk Kaiser
2 months
@VikParuchuri FYI the post is throwing 500 errors now
1
0
0
@VikParuchuri
Vik Paruchuri
2 months
@burningion Thanks! I'm not seeing this - do you still see it?
1
0
0
@CopetimusPrime
Tony
2 months
@VikParuchuri would be interesting to read a more meta version of this post too. there's nothing stopping anyone else from doing what you did, why are you the only one who did it?
1
0
2
@VikParuchuri
Vik Paruchuri
2 months
@CopetimusPrime This is an interesting question. I would also guess, at the minimum, hundreds of people have taken a similar path (software engineer without deep learning experience, learn, get a research job). I guess the interesting part is the middle, though
0
0
2
@pdeyhim
Parviz
2 months
@VikParuchuri Hello from OAK. Great post. Did you find any helpful discord channels to ask basic questions as you were getting started?
1
0
1
@VikParuchuri
Vik Paruchuri
2 months
@pdeyhim I didn't join discord until later, but the huggingface discord is pretty good for beginners
0
0
5
@__AndreaW__
Andrea
2 months
@VikParuchuri @VikParuchuri the post is amazing and inspirational. Can you share a bit about the effort (hrs/week?) you put in to follow such a cool journey? For the rest of us, I'd probably 3X it, as a rule of thumb
1
0
2
@VikParuchuri
Vik Paruchuri
2 months
@__AndreaW__ I have two kids (3 and 1 now), so I couldn't really work more than 40 hours a week
0
0
3
@_chapter10_
ضحي الشكيلي
2 months
@VikParuchuri What is your recommendation for newbie’s. PyTorch or Tensorflow?
1
0
1
@VikParuchuri
Vik Paruchuri
2 months
@_chapter10_ Definitely pytorch - pretty much all research is done in it these days. Even google research doesn't use tensorflow
0
0
2
@ThereBeLyte
Vhiz
2 months
@VikParuchuri @VikParuchuri - Great post! Two questions: 1. While doing LLM development, do you use any abstractions like DSPy, etc? Would you recommend any? 2. Hoping to read the section on "cleaning text data" in .
1
0
3
@VikParuchuri
Vik Paruchuri
2 months
@ThereBeLyte I haven't used DSPy, but wouldn't recommend any abstractions until you understand how things work under the hood. Prompt generators are really hard to debug.
1
0
3
@VikParuchuri huge huge congrats! is zero to gpt dormant? i'd be interested in the later lessons
Tweet media one
1
0
5
@VikParuchuri
Vik Paruchuri
2 months
@swyx Thanks! I'm still planning to work on it, but other projects keep getting in the way
0
0
2
@abhagsain
Anurag Bhagsain
2 months
@VikParuchuri how much math is needed? 🫣
1
0
1
@VikParuchuri
Vik Paruchuri
2 months
@abhagsain A decent amount, but mostly stats and linear algebra, with some calculus - it was easier to learn than I thought it would be (fear of math blocked me from studying deep learning for a long time)
2
0
10
@sbergman
Sean Bergman
2 months
@VikParuchuri Inspirational post. Thanks for sharing and congrats on the new role! Your post really makes me think I should follow my passion, take a one year sabbatical to learn all I can, and contribute to open source projects I find interesting and valuable.
1
0
3
@VikParuchuri
Vik Paruchuri
2 months
@sbergman Highly recommend it - I've always learned the most in those periods
0
0
4
@ralphbrooks
Ralph Brooks AI Artisan
2 months
@VikParuchuri Extremely inspiring to read about your journey into deep learning.
0
0
1
@iliasmiraoui
Ilias Miraoui
2 months
@VikParuchuri Love this
0
0
1