Nataniel Ruiz Profile Banner
Nataniel Ruiz Profile
Nataniel Ruiz

@natanielruizg

5,563
Followers
1,546
Following
257
Media
6,378
Statuses

Research Scientist @Google | author of DreamBooth | Personalization of Generative Models (HyperDreamBooth, StyleDrop, RealFill, ZipLora, Platypus, DreamBooth3D)

Boston, MA
Joined January 2011
Don't wanna be here? Send us removal request.
@natanielruizg
Nataniel Ruiz
2 years
Today, along with my collaborators at @GoogleAI , we announce DreamBooth! It allows a user to generate a subject of choice (pet, object, etc.) in myriad contexts and with text-guided semantic variations! The options are endless. (Thread 👇) webpage: 1/N
46
422
2K
@natanielruizg
Nataniel Ruiz
10 months
Today, with collaborators at @Google , we're excited to announce 🥳🥳HyperDreamBooth🥳 🥳! It's like DreamBooth, but smaller, faster and better. 25x faster. Think of 30 minutes vs. 14 hours for 100 models. And works on a single image! (Thread 👇) webpage:
Tweet media one
42
304
2K
@natanielruizg
Nataniel Ruiz
6 months
With collaborators @Google we're announcing 💫 ZipLora 💫! Merging LoRAs has been a big thing in the community, but tuning can be an onerous process. ZipLora allows us to easily combine any subject LoRA with any style LoRA! Easy to reimplement 🥳 link:
Tweet media one
33
237
1K
@natanielruizg
Nataniel Ruiz
11 months
Today, along with collaborators at @GoogleAI , we’re excited to announce StyleDrop! It allows a user to generate new images that follow a specific style of their choice given only a single style reference image 🤯 (Thread 👇) webpage:
Tweet media one
49
253
1K
@natanielruizg
Nataniel Ruiz
7 months
Today, with collaborators at Google, we're announcing 🤩RealFill🤩! A generative AI approach to fill missing regions of an image with the content that should have been there. The best way to turn almost perfect pictures into invaluable memories! page:
27
145
668
@natanielruizg
Nataniel Ruiz
2 years
We made the first comic fully generated by an AI using DreamBooth finetuning of Imagen at @GoogleAI . We use only 4 input images of our cute little character Anselmo (drawn by my friend @IsaArduz ). (thread) 1/N #DreamBooth #AIArt #Imagen #stablediffusion #midjourney #AIArtwork
Tweet media one
16
78
430
@natanielruizg
Nataniel Ruiz
1 month
AI generated writing *feels* AI-generated at a visceral level, and even if you ask an LLM to make the writing feel or read less AI-generated it horrifically fails and makes it feel even more AI-generated. Any tricks that can help? Any prompts to share?
232
8
370
@natanielruizg
Nataniel Ruiz
1 year
Passed my dissertation defense and obtained my PhD 🥳 a super fun journey, would definitely do it again if I could. Video will be uploaded soon.
Tweet media one
39
12
350
@natanielruizg
Nataniel Ruiz
9 months
We are 🔥super excited🔥 to release the Platypus family of finetuned LLMs 🥳🥳. Platypus achieves the top score in the Hugging Face Open LLM Leaderboard 🏆! The main focus of our work is to achieve cheap, fast and powerful refinement of base LLMs. page:
Tweet media one
14
85
308
@natanielruizg
Nataniel Ruiz
6 years
@cpicciolini @SamHarrisOrg Can you address his explanation, namely that you attributed specific positions to those two people which they do not actually hold?
5
2
253
@natanielruizg
Nataniel Ruiz
2 years
@bradesposito Interestingly she describes exactly what is wrong with streaming now. Many times I have to see an album cover to remember that it’s great and I should listen to it again!
3
7
262
@natanielruizg
Nataniel Ruiz
1 year
Super happy to announce that I will be joining @Google as a Research Scientist and will be starting tomorrow! Extremely excited by this new step and very grateful for everyone that made this possible. 🥳🥳🥳
31
4
258
@natanielruizg
Nataniel Ruiz
1 year
Super happy to announce that DreamBooth has been selected as an award candidate at CVPR 2023 (0.51% award rate). 🥳🥳🥳 link:
Tweet media one
10
30
242
@natanielruizg
Nataniel Ruiz
1 year
🥳 DreamBooth has been accepted to CVPR 2023. And with this comes a *big update* to the paper including the largest evaluation dataset for subject driven generation and an evaluation protocol! Find it in the project webpage: (a thread) #Dreambooth 1/N
Tweet media one
3
26
189
@natanielruizg
Nataniel Ruiz
7 months
Our team is looking for student researchers doing a PhD starting in January either full-time or part-time (prefer full-time). If you want to work on new exciting applications and methods like I did with DreamBooth, then please reach out. DMs open.
Tweet media one
6
32
169
@natanielruizg
Nataniel Ruiz
6 years
Trump and his special assistant for infrastructure policy presenting the deepest neural network.
Tweet media one
4
32
159
@natanielruizg
Nataniel Ruiz
6 months
Ok. We will release a very cool project soon. Easy to reimplement 🥳
13
5
159
@natanielruizg
Nataniel Ruiz
10 months
Excited 🥳🥳🥳to release my first senior author work, done while still a student at BU, with a start studded lineup of collaborators and an incredible student first-author @ArielNLee 🌻🙌- it's all about differences between Vision Transformers & CNNs 👇
Tweet media one
4
31
133
@natanielruizg
Nataniel Ruiz
2 years
What we've been waiting for. I'm extremely excited for this paper.
@_akhaliq
AK
2 years
On Distillation of Guided Diffusion Models abs: On ImageNet 64x64 and CIFAR-10, approach is able to generate images visually comparable to that of the original model using as few as 4 sampling steps
Tweet media one
5
129
630
3
15
122
@natanielruizg
Nataniel Ruiz
1 year
Today, at NeurIPS, we announce counterfactual simulation testing, a new framework for comparing vastly different network architectures using counterfactuals. We use it to compare the robustness of modern ConvNets and Transformers. (Thread 👇) webpage:
Tweet media one
4
18
116
@natanielruizg
Nataniel Ruiz
2 years
Our method has some surprising capabilities inherited from large diffusion models. For example it can generate novel art renditions of a subject! Here are some renditions of a specific dog in the style of famous painters. 4/N
Tweet media one
3
10
113
@natanielruizg
Nataniel Ruiz
2 years
@MIT_CSAIL GPT-3 says: "A neural network is a computer system that is modeled after the brain." It didn't understand the instruction.
5
1
104
@natanielruizg
Nataniel Ruiz
2 years
Public service announcement: Don’t use “sks” as a token for dreambooth. SKS is a type of rifle.
@originalmaderix
maderix
2 years
The runwayml version of #stablediffusion 1.5 model with Dreambooth tends to produce lot of guns by default lol 🤖
Tweet media one
6
3
52
9
9
106
@natanielruizg
Nataniel Ruiz
6 months
I think this has to be some sort of speed record 🏅
@mk1stats
mkshing
6 months
So, I quickly implemented the ZipLoRA by 🤗🧨 (Some people have already noticed though) code: I hope it helps somehow and feel free to drop your comments and feedback~ Big thanks to the authors for their awesome work 🙌
Tweet media one
5
60
323
3
9
102
@natanielruizg
Nataniel Ruiz
7 months
First RealFill open-source re-implementation finished 🔥🔥 link:
@thuanz123
Thuan Hoang Nguyen
7 months
@natanielruizg I think my job with RealFill is done @natanielruizg
2
2
27
0
16
102
@natanielruizg
Nataniel Ruiz
11 months
DreamBooth won an Honorable Mention Award at #CVPR23 (6 out of more than 8000 submissions, 0.08% rate) 🥳🥳🥳
@CVPR
#CVPR2024
11 months
Tweet media one
Tweet media two
1
2
22
4
4
100
@natanielruizg
Nataniel Ruiz
4 years
@marwilliamson Marianne, Venezuela tanked its own economy with 15 years of irresponsible economic policy.
25
3
86
@natanielruizg
Nataniel Ruiz
3 months
the more I look at the videos, the more the motions feel like a video game (the walking here), but the appereance of only some videos looks like video game footage. Maybe this model is trained on a lot of game footage? models are good at learning to change style simulated->real
@OpenAI
OpenAI
3 months
Introducing Sora, our text-to-video model. Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions. Prompt: “Beautiful, snowy…
10K
33K
141K
19
6
96
@natanielruizg
Nataniel Ruiz
11 months
@PatentlyApple There’s almost a 100% chance they knew this and named it Vision Pro anyway. They have a plan
1
0
87
@natanielruizg
Nataniel Ruiz
2 months
Some really cool work by our soon-to-be-intern @JialuLi96 link:
Tweet media one
0
12
85
@natanielruizg
Nataniel Ruiz
2 years
Text-to-image diffusion models are extremely powerful and allow for flexible generation of images with complex user captions. One limitation is that controlling the subject’s appearance and identity using text is very hard. 2/N
Tweet media one
3
5
80
@natanielruizg
Nataniel Ruiz
2 years
We can even do realistic viewpoint changes for some subjects which have a strong class prior! Here are some examples of different viewpoints for a cat. Notice the detailed fur patterns in the forehead are conserved. 🤯 7/N
Tweet media one
1
7
78
@natanielruizg
Nataniel Ruiz
2 years
By finetuning a model (Imagen here) with few images of a subject (~3-5), a user can generate variations of the subject. E.g. by controlling the environment and context of the subject. Ever wanted to have a high-quality picture of your dog in Paris (no travel required)? 3/N
Tweet media one
1
4
77
@natanielruizg
Nataniel Ruiz
2 years
And another! 🚀 @GoogleAI
@_akhaliq
AK
2 years
UniTune: Text-Driven Image Editing by Fine Tuning an Image Generation Model on a Single Image abs:
Tweet media one
7
148
714
1
8
75
@natanielruizg
Nataniel Ruiz
2 years
We can also change semantic attributes of a subject. Re-coloring, chimeras, material changes, etc. 5/N
Tweet media one
3
5
72
@natanielruizg
Nataniel Ruiz
9 months
Tweet media one
1
0
69
@natanielruizg
Nataniel Ruiz
10 months
My first paper as senior author (done while I was still a PhD student at BU!). So proud of Ariel and grateful for all coauthors 🙏🌸. I feel blessed. Thread coming out tomorrow 🔥
@_akhaliq
AK
10 months
Hardwiring ViT Patch Selectivity into CNNs using Patch Mixing paper page: Vision transformers (ViTs) have significantly changed the computer vision landscape and have periodically exhibited superior performance in vision tasks compared to convolutional…
Tweet media one
3
19
101
0
7
72
@natanielruizg
Nataniel Ruiz
4 years
@TectonixGEO @vdbDennis @xmodesocial I mean how anonymized is it really if you can track a phone location? You can easily figure out where people live, and identifying the person is one-step away (maybe even a Google search away)
3
0
63
@natanielruizg
Nataniel Ruiz
2 years
What about accessorization? Given a few images of your pet you could accessorize them with extreme flexibility. Imagination is the limit! 6/N
Tweet media one
3
9
70
@natanielruizg
Nataniel Ruiz
7 months
Cool work which proposes a very similar "lower-rank" LoRA like Lightweight DreamBooth that we proposed in our HyperDreamBooth work () but for LLMs. 10x reduction in size, just like in our case!
@_akhaliq
AK
7 months
VeRA: Vector-based Random Matrix Adaptation paper page: Low-rank adapation (LoRA) is a popular method that reduces the number of trainable parameters when finetuning large language models, but still faces acute storage challenges when scaling to even…
Tweet media one
6
62
270
0
17
68
@natanielruizg
Nataniel Ruiz
6 months
Another record - someone *cough* @mk1stats *cough* made a @Gradio demo in no time 🤍
@_akhaliq
AK
6 months
ZipLoRA-pytorch with @Gradio demo by @mk1stats local demo: Methods for finetuning generative models for concept-driven personalization generally achieve strong results for subject-driven or style-driven generation. Recently, low-rank adaptations (LoRA)…
Tweet media one
2
85
371
0
10
69
@natanielruizg
Nataniel Ruiz
2 years
Finally, our method can generate new images of a subject with different expressions/emotions. Note that the original images of the subject dog here did not exhibit any of these expressions. 8/N
Tweet media one
1
4
65
@natanielruizg
Nataniel Ruiz
10 months
Very impressive work, a must read.
@_akhaliq
AK
10 months
Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models paper page: Text-to-image (T2I) personalization allows users to guide the creative image generation process by combining their own visual concepts in natural language…
Tweet media one
1
68
258
1
7
66
@natanielruizg
Nataniel Ruiz
5 months
This is seriously crazy. No-finetuning styledrop-like generation
@kusichan
Andrey Voynov
5 months
Happy to announce StyleAligned – our new work from @GoogleAI : Style Aligned Image Generation via Shared Attention 📜 arXiv: 👀 project page: (with quiz game!) 💻 code: Details below! ⬇️
Tweet media one
19
114
516
1
3
67
@natanielruizg
Nataniel Ruiz
9 months
🚀 Presenting our latest SOTA LLM: OpenOrca-Platypus2-13B 🚀. Kudos to @ArielNLee and @ColeJHunter and the great people of @alignment_lab for topping the Hugging Face leaderboard in the 13B parameter category! Excited by this collaboration. link:
Tweet media one
2
17
65
@natanielruizg
Nataniel Ruiz
1 year
I’m defending my PhD thesis tomorrow 🎉 at 3pm EST. It’s called: Simulating to Learn. Such a fun journey. Will post the video afterwards. If you want the zoom link send me a dm.
12
2
64
@natanielruizg
Nataniel Ruiz
2 years
@JxckSweeney @elonmusk So you run a Twitter account that tracks Musk's jet purportedly because it is "of service" and "interesting", yet here you are offering to take it down if the amount they pay you is enough? I don't understand.
7
1
61
@natanielruizg
Nataniel Ruiz
2 years
@MIT_CSAIL If I constrain to 5 words it says: "A neural network is a" lol
1
1
64
@natanielruizg
Nataniel Ruiz
9 months
@wangzjeff Haha well not drinking does not mean you have to be boring!
3
0
62
@natanielruizg
Nataniel Ruiz
4 years
@JamesTodaroMD @elonmusk James, there has been a lot of criticism of the Santa Clara study and it might overestimate positive cases because of the biased sample and the false positive rate of antibody tests. The IFR that I computed of 0.1% with that data would mean prevalence of more than 100% in NYC.
3
1
56
@natanielruizg
Nataniel Ruiz
6 months
Thank you @_akhaliq for the retweet! 🙏
@natanielruizg
Nataniel Ruiz
6 months
With collaborators @Google we're announcing 💫 ZipLora 💫! Merging LoRAs has been a big thing in the community, but tuning can be an onerous process. ZipLora allows us to easily combine any subject LoRA with any style LoRA! Easy to reimplement 🥳 link:
Tweet media one
33
237
1K
0
6
55
@natanielruizg
Nataniel Ruiz
11 months
But here is a result I really didn't expect. What surprises me is how well it handles the translation of ideas into arbitrary styles, changing the object shape to fit the style - and following stylistic flourishes and geometrical style components.
Tweet media one
2
3
57
@natanielruizg
Nataniel Ruiz
1 month
Improved how
@OpenAI
OpenAI
1 month
Majorly improved GPT-4 Turbo model available now in the API and rolling out in ChatGPT.
447
755
5K
14
0
55
@natanielruizg
Nataniel Ruiz
5 months
DreamBooth SDXL Turbo works it seems
@EMostaque
Emad
5 months
Yeah so SDXL Turbo tunes. Upcoming version tunes on consumer graphics cards nicely… Let’s just say things are just getting started…
8
5
129
2
3
54
@natanielruizg
Nataniel Ruiz
11 months
Congratulations to @kihyuk_sohn , @dilipkay and to all authors involved in this work! The list is long and can be found below. For more amazing examples go to the project page. paper: project webpage:
4
4
53
@natanielruizg
Nataniel Ruiz
2 years
Another great Google work on diffusion models! 🚀🚀
@_akhaliq
AK
2 years
Imagic: Text-Based Real Image Editing with Diffusion Models abs:
Tweet media one
22
422
2K
1
7
50
@natanielruizg
Nataniel Ruiz
4 months
Some incredibly cool work by Google. Don’t miss it!
@_akhaliq
AK
4 months
Google announces PALP Prompt Aligned Personalization of Text-to-Image Models paper page: Content creators often aim to create personalized images using personal subjects that go beyond the capabilities of conventional text-to-image models. Additionally,…
2
110
455
1
3
50
@natanielruizg
Nataniel Ruiz
5 months
Tweet media one
2
4
49
@natanielruizg
Nataniel Ruiz
1 year
DreamBooth featured at Google I/O 🥳 on an insane concept: a card game with 7+ Million unique generated characters! Amazing work by the I/O Flip team! 🤯 The first instance of such a card game? (clip linked)
2
8
49
@natanielruizg
Nataniel Ruiz
11 months
Yuanzhen Li presenting DreamBooth, DreamBooth3D and StyleDrop at #CVPR2023 !
Tweet media one
3
5
50
@natanielruizg
Nataniel Ruiz
2 years
One main difficulty in finetuning a diffusion model using few images is overfitting. We tackle this problem by presenting an autogenous class-specific prior preservation loss. More details in the paper. 9/N
Tweet media one
1
2
49
@natanielruizg
Nataniel Ruiz
9 months
In order to do so we propose an optimized, small, yet very powerful dataset named Open-Platypus, which is a curated subset of open datasets and focuses on enhancing LLMs' STEM and logic proficiency. We release this dataset to the public.
2
6
47
@natanielruizg
Nataniel Ruiz
11 months
Before diving into technical details, let's explore some impressive examples. StyleDrop can extract the color palette and overall style from this watercolor cat painting, and generate almost anything one can imagine in that same style.
Tweet media one
1
3
48
@natanielruizg
Nataniel Ruiz
8 months
Anyone can now use StyleDrop 🥳 - announced at the Google Cloud Next '23 event! link:
Tweet media one
5
9
47
@natanielruizg
Nataniel Ruiz
3 months
I think @Scenario_gg are pushing the limits of DreamBooth in crazy ways. They really are alchemists working with the original DreamBooth idea to make it much stronger and to be able to do more things with it.
@araminta_k
Mint
3 months
We just made creating your next Consistent Character waaaaaaay easier :D Workflow 1/3 I am sharing THREE workflows this week to using the new "Character Base" LoRAs that we just added to Scenario to: - Use as a consistent character - Create a new consistent character from -…
Tweet media one
Tweet media two
Tweet media three
Tweet media four
5
48
238
4
5
47
@natanielruizg
Nataniel Ruiz
10 months
🚀 NeRFs are getting better and faster
@jon_barron
Jon Barron
10 months
Our freshly minted ICCV2023 paper: The nice anti-aliasing of mip-NeRF 360, but with most of the speed of Instant NGP. Error rate reductions of 8%-77% compared to either prior technique, and 24x faster than the most accurate NeRF baseline we tried.
16
93
692
0
1
45
@natanielruizg
Nataniel Ruiz
2 years
We are able to alleviate overfitting using this approach. We show that finetuning without this loss term leads to accelerated overfitting of subject pose and appearance, or context. This decreases generation variability and incorrect scenes. 10/N
Tweet media one
2
4
44
@natanielruizg
Nataniel Ruiz
2 years
We also thank the Imagen team for lending us access to their incredible model. And we deeply thank all of the great people who helped with reviews and feedback (all acknowledged in the paper). Again, our project website is: 13/13 (END)
7
0
45
@natanielruizg
Nataniel Ruiz
1 year
Train CLIP using non-industrial scale resources! Some amazing work by a good friend of min @cihangxie !
@_akhaliq
AK
1 year
An Inverse Scaling Law for CLIP Training abs: paper page:
Tweet media one
3
39
220
0
13
44
@natanielruizg
Nataniel Ruiz
4 years
@afneil The study is hard to read. From what I saw it 1. is a retrospective study 2. treats patients that are severely ill, probably later in the course of the disease. HCQ has in vitro antiviral effects against SARS-CoV-2 and should be used EARLY. Not effective to use it late!
2
2
35
@natanielruizg
Nataniel Ruiz
11 months
This also happens with 3D styles. Here, even the isometric viewpoint is captured. As well as the rounded edges, layout and color palette.
Tweet media one
1
1
42
@natanielruizg
Nataniel Ruiz
1 year
@thatfollowed @sama It’s pretty awesome to have a CEO that is tethered to reality. A lot of respect!
0
0
42
@natanielruizg
Nataniel Ruiz
2 years
@Bryce_Nickels @R_H_Ebright @cshperspectives Quick question, is Markolin a portmanteau of Market and Pangolin?
6
5
39
@natanielruizg
Nataniel Ruiz
2 months
I need to retweet this again because I feel like it needs to get more attention. Awesome read
@robrombach
Robin Rombach
2 months
Party time! The SD3 paper made it to arxiv: Key takeaways: - flow matching is very nice. - back to work with @pess_r and a fantastic team ♥️ The paper is full of details on improved flow matching, scaling and engineering. Enjoy!
Tweet media one
10
38
267
2
2
39
@natanielruizg
Nataniel Ruiz
5 years
@TheAnnaGat @clairlemon Now I understand why the bay area has so many Libertarians
2
0
31
@natanielruizg
Nataniel Ruiz
4 years
@marwilliamson The sanctions were primarily targeted towards the regime (who wine and dine at expensive restaurants while the people starve). I just don’t agree with this specific example.
13
1
33
@natanielruizg
Nataniel Ruiz
4 years
@TectonixGEO @vdbDennis @xmodesocial What do you think about authoritarian government use of this technology?
5
0
30
@natanielruizg
Nataniel Ruiz
9 months
Collecting Google offices Google Austin ✅
Tweet media one
7
1
38
@natanielruizg
Nataniel Ruiz
11 months
I'll be revealing something super super cool tomorrow 🥳🤯
8
0
37
@natanielruizg
Nataniel Ruiz
3 years
@alexandrosM @R_H_Ebright This letter is pretty startling I have to say. As scientists, how could they have been so certain about the origins of the virus about a month after the news of the outbreak? It's always important to have a little bit of doubt when the evidence is not fully there yet
1
1
34
@natanielruizg
Nataniel Ruiz
4 months
Some truly beautiful work by colleagues at Google 🌹🌻🌸
@hila_chefer
Hila Chefer@ ICLR’24
4 months
TLDR: Meet ✨Lumiere✨ our new text-to-video model from @GoogleAI ! Lumiere is designed to create entire clips in just one go! Seamlessly opening up possibilities for many applications: Image-to-video 🖼️ Stylized generation 🖌️ Video editing 🪩 and beyond. See 🧵👇
77
207
954
5
0
37
@natanielruizg
Nataniel Ruiz
8 months
Google Paris ✅
7
1
36
@natanielruizg
Nataniel Ruiz
1 year
Wow. No wayyy. Super cool!
@_akhaliq
AK
1 year
New simple @Gradio dreambooth web ui by @mirage_ml on @huggingface Spaces upload images, add concept prompt and click start training demo:
Tweet media one
3
29
157
1
4
35
@natanielruizg
Nataniel Ruiz
10 months
The core idea is to train a HyperNetwork that predicts weight deltas for the diffusion model in order to make it personalized. This initialization is strong enough that, given fast finetuning, we can achieve great identity preservation with impressive editability and variety 🔥
Tweet media one
2
3
33
@natanielruizg
Nataniel Ruiz
6 months
We observe impressive subject detail preservation with a lot of fidelity to the user-provided style. I find these very cool. 🧙‍♂️
Tweet media one
1
3
35
@natanielruizg
Nataniel Ruiz
1 month
Some insane work by colleagues at Google. Really impressive
@_akhaliq
AK
1 month
Google presents ObjectDrop Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion Diffusion models have revolutionized image editing but often generate images that violate physical laws, particularly the effects of objects on the scene, e.g.,
6
140
616
2
1
34
@natanielruizg
Nataniel Ruiz
2 years
Thank you for your time. And thank you to all of my collaborators @AbermanKfir , Yuanzhen Li, @jampani_varun , @MikiRubinstein , Yael Pritch. I had an amazing time working on this with you and am looking forward to future uses of this technology and more research! 12/N
1
0
33
@natanielruizg
Nataniel Ruiz
3 months
I’m kinda done with people posting screenshots of a single example of a failed LLM query and going “absolutely trash model, XYZ model is much better” Have seen this happen for every single popular model out there. You would think they’re all bad and never give good answers
5
3
32
@natanielruizg
Nataniel Ruiz
8 months
We have something very cool coming out soon 🤩🤩🤩 (suspense)
1
0
33
@natanielruizg
Nataniel Ruiz
2 years
These are the input images for our character Anselmo. We generate a fully-fledged comic with Anselmo in new poses, with different accessories and even with text and speech bubbles automatically drawn by the diffusion model! (just prompt for "a [V] cartoon saying XYZ"!) 2/N
Tweet media one
2
1
31
@natanielruizg
Nataniel Ruiz
10 months
Ok big thing coming out tomorrow 🚀
3
0
30
@natanielruizg
Nataniel Ruiz
11 months
How does it work? We use MUSE, a masked Transformer for Text-to-Image Synthesis. (project: ). MUSE seems to have some properties that make it excel at learning and reproducing style.
Tweet media one
1
1
30
@natanielruizg
Nataniel Ruiz
6 months
Insane speedups (and available for use) by @neuralmagic and @mgoin_ Very impressive… we need this for diffusion models.
@_akhaliq
AK
6 months
Fast Llama 2 on CPUs With Sparse Fine-Tuning and DeepSparse by @neuralmagic with @Gradio demo demo: run with docker: duplicate space for private use: blog:
Tweet media one
3
52
206
0
2
29
@natanielruizg
Nataniel Ruiz
11 months
Imagic 🤝 DreamBooth @bahjat_kawar #CVPR2023
Tweet media one
1
1
30
@natanielruizg
Nataniel Ruiz
10 months
The first key idea is Lightweight DreamBooth (LiDB), a customized model that is only ~100KB instead of more than 1GB for a typical Stable Diffusion model. This makes it 10k times smaller.
Tweet media one
2
3
28
@natanielruizg
Nataniel Ruiz
1 year
I think I have to start to mute any person that tweets things like “X is dead, Z is the new thing that will replace it/them”
6
1
30
@natanielruizg
Nataniel Ruiz
5 months
Great conversations w/ @ajayj_ 🙌
Tweet media one
1
1
30
@natanielruizg
Nataniel Ruiz
4 months
leaving.
Tweet media one
Tweet media two
Tweet media three
1
3
30