AK Profile Banner
AK Profile
AK

@_akhaliq

309,582
Followers
2,571
Following
15,970
Media
30,810
Statuses

AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face:

Joined April 2014
Don't wanna be here? Send us removal request.
@_akhaliq
AK
11 months
ChatGPT playing rock paper scissors
Tweet media one
189
4K
75K
@_akhaliq
AK
1 year
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold paper page:
346
6K
22K
@_akhaliq
AK
11 months
Fixing things with AI
155
2K
19K
@_akhaliq
AK
11 months
ChatGPT, Bro just kept going?
Tweet media one
222
2K
17K
@_akhaliq
AK
7 months
Tweet media one
78
2K
17K
@_akhaliq
AK
11 months
Hey ChatGPT, finish this building...
118
1K
14K
@_akhaliq
AK
11 months
chatgpt gives a random youtube link
Tweet media one
103
390
14K
@_akhaliq
AK
11 months
Fake Apple Products, midjourney AI 1. Apple Jetpack
Tweet media one
152
1K
13K
@_akhaliq
AK
11 months
AI is taking over
178
1K
12K
@_akhaliq
AK
11 months
AI Generative fill with memes
40
2K
10K
@_akhaliq
AK
11 months
real life Simpsons, midjourney AI 1. Flanders
Tweet media one
387
821
9K
@_akhaliq
AK
11 months
Dogs being Human, midjourney AI 1. Golden Retriever
Tweet media one
149
869
8K
@_akhaliq
AK
10 months
Harry Potter Anime using stable diffusion by u/Inner-Reflections
96
2K
7K
@_akhaliq
AK
11 months
What did you do with generative AI?
Tweet media one
111
306
6K
@_akhaliq
AK
2 years
stable diffusion img2img web UI + workflow video github: reddit thread:
40
1K
6K
@_akhaliq
AK
4 years
stylegan2 finetuning ffhq to metfaces
42
1K
5K
@_akhaliq
AK
3 years
ADOP: Approximate Differentiable One-Pixel Point Rendering abs:
71
1K
5K
@_akhaliq
AK
11 months
Celebrities if They Worked Normal Jobs, Midjourney AI 1. Tom Cruise
Tweet media one
113
323
5K
@_akhaliq
AK
2 years
Mubert-Text-to-Music 🎵🎵🎵 Colab notebooks demonstrating prompt-based music generation via Mubert API GitHub:
85
1K
5K
@_akhaliq
AK
11 months
6. Apple Orange
Tweet media one
48
476
4K
@_akhaliq
AK
4 years
Monster Mash: A Single-View Approach to Casual 3D Modeling and Animation pdf: project page:
40
1K
4K
@_akhaliq
AK
9 months
AI generative fill extending scenes, movies shot in portrait format by @Alex_Cerrato
108
807
4K
@_akhaliq
AK
9 months
Text to image with midjourney and image to video with gen2 by @commonstyle
77
823
4K
@_akhaliq
AK
10 months
Meme Legends, Photoshop generative fill AI by savvydone
30
657
4K
@_akhaliq
AK
1 year
Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields abs: project page:
99
644
4K
@_akhaliq
AK
1 year
BREAKING OpenAI released a implementation of Consistency Models consistency models, a new family of generative models that achieve high sample quality without adversarial training. They support fast one-step generation by design, while still allowing for few-step sampling to…
Tweet media one
31
810
3K
@_akhaliq
AK
1 year
Stable Diffusion AI Deepfake De-Aged Harrison Ford SD+ControlNet+EbSynth+Fusion reddit thread:
87
684
3K
@_akhaliq
AK
1 year
Scaling Transformer to 1M tokens and beyond with RMT Recurrent Memory Transformer retains information across up to 2 million tokens. During inference, the model effectively utilized memory for up to 4,096 segments with a total length of 2,048,000 tokens—significantly exceeding…
Tweet media one
98
823
3K
@_akhaliq
AK
10 months
midjourney version 5.2 zoom out feature: Unleashing the Potential of A Broader View
52
488
3K
@_akhaliq
AK
7 months
Training AI to Play Pokemon with Reinforcement Learning by @computerender github: youtube:
34
699
3K
@_akhaliq
AK
9 months
Celebrity Mortal Kombat, Midjourney AI + gen2 + ElevenLabs by u/fignewtgingrich
81
758
3K
@_akhaliq
AK
3 years
Eyes Tell All: Irregular Pupil Shapes Reveal GAN-generated Faces pdf: abs:
Tweet media one
29
790
3K
@_akhaliq
AK
2 years
DALL·E: Introducing Outpainting Extend creativity and tell a bigger story with DALL-E images of any size blog:
25
776
3K
@_akhaliq
AK
1 year
3D-aware Conditional Image Synthesis abs: project page:
19
693
3K
@_akhaliq
AK
10 months
Tweet media one
51
343
3K
@_akhaliq
AK
10 months
Midjourney AI recreating the Original 151 Pokémon - Part 1: The Starters by u/OfficialKnockout 1. Bulbasaur #001
Tweet media one
36
292
3K
@_akhaliq
AK
11 months
5. Apple Teleport
Tweet media one
29
149
3K
@_akhaliq
AK
2 years
MDM: Human Motion Diffusion Model abs: project page:
17
632
3K
@_akhaliq
AK
10 months
Another Meme Legends, Photoshop generative fill AI by @SavvyDone
28
467
3K
@_akhaliq
AK
4 months
Apple announces LLM in a flash: Efficient Large Language Model Inference with Limited Memory paper page: Large language models (LLMs) are central to modern natural language processing, delivering exceptional performance in various tasks. However, their…
Tweet media one
33
502
3K
@_akhaliq
AK
2 years
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding project page: sota FID(7.27 on COCO), without ever training on COCO, human raters find Imagen samples to be on par with the COCO data itself in image-text alignment
Tweet media one
31
684
3K
@_akhaliq
AK
1 year
Riffusion, real-time music generation with stable diffusion @huggingface model: project page:
Tweet media one
64
630
3K
@_akhaliq
AK
2 months
Microsoft presents The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits Recent research, such as BitNet, is paving the way for a new era of 1-bit Large Language Models (LLMs). In this work, we introduce a 1-bit LLM variant, namely BitNet b1.58, in which every single…
Tweet media one
53
625
3K
@_akhaliq
AK
2 months
Google presents Genie Generative Interactive Environments introduce Genie, the first generative interactive environment trained in an unsupervised manner from unlabelled Internet videos. The model can be prompted to generate an endless variety of action-controllable virtual…
81
540
2K
@_akhaliq
AK
1 year
Generative Agents: Interactive Simulacra of Human Behavior abs: project page:
Tweet media one
62
531
2K
@_akhaliq
AK
1 year
Track Anything: Segment Anything Meets Videos Track-Anything is a flexible and interactive tool for video object tracking and segmentation suitable for: - Video object tracking and segmentation with shot changes. - Visualized development and data annnotation for video object…
33
491
2K
@_akhaliq
AK
2 years
Block-NeRF: Scalable Large Scene Neural View Synthesis abs: project page:
30
555
2K
@_akhaliq
AK
3 months
TikTok presents Depth Anything Unleashing the Power of Large-Scale Unlabeled Data paper page: demo: Depth Anything is trained on 1.5M labeled images and 62M+ unlabeled images jointly, providing the most capable Monocular Depth…
39
416
2K
@_akhaliq
AK
19 days
Apple presents Ferret-UI Grounded Mobile UI Understanding with Multimodal LLMs Recent advancements in multimodal large language models (MLLMs) have been noteworthy, yet, these general-domain MLLMs often fall short in their ability to comprehend and interact effectively with
Tweet media one
33
409
2K
@_akhaliq
AK
2 years
stylegan3-projector Mario github:
64
386
2K
@_akhaliq
AK
2 months
Alibaba presents EMO: Emote Portrait Alive Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions tackle the challenge of enhancing the realism and expressiveness in talking head video generation by focusing on the dynamic and nuanced…
87
581
2K
@_akhaliq
AK
1 year
alpaca-lora: Code for reproducing the Stanford Alpaca InstructLLaMA result on consumer hardware github:
Tweet media one
25
488
2K
@_akhaliq
AK
9 months
Meta releases Llama 2: Open Foundation and Fine-Tuned Chat Models paper: blog: develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion…
Tweet media one
37
576
2K
@_akhaliq
AK
2 years
everyone on ML twitter right now
Tweet media one
22
165
2K
@_akhaliq
AK
4 years
Vid2Player: Controllable Video Sprites that Behave and Appear like Professional Tennis Players pdf: abs: project page:
40
556
2K
@_akhaliq
AK
2 years
make-a-video: text-to-video generation without text-video data paper: project page:
32
487
2K
@_akhaliq
AK
11 months
AI will take over the world?
Tweet media one
65
301
2K
@_akhaliq
AK
3 years
stylegan3 is out github:
7
407
2K
@_akhaliq
AK
11 months
10. Marge
Tweet media one
225
88
2K
@_akhaliq
AK
1 year
One is Midjourney 5.1, the other is real. Which one is which? reddit thread:
Tweet media one
478
232
2K
@_akhaliq
AK
7 months
Language Modeling Is Compression paper page: It has long been established that predictive models can be transformed into lossless compressors and vice versa. Incidentally, in recent years, the machine learning community has focused on training…
Tweet media one
46
391
2K
@_akhaliq
AK
9 months
Tweet media one
39
369
2K
@_akhaliq
AK
8 months
Tracking Anything with Decoupled Video Segmentation paper page: Training data for video segmentation are expensive to annotate. This impedes extensions of end-to-end algorithms to new video segmentation tasks, especially in large-vocabulary settings. To…
23
425
2K
@_akhaliq
AK
11 months
6. Apu
Tweet media one
22
56
2K
@_akhaliq
AK
4 years
#StyleGAN2 interps
19
395
2K
@_akhaliq
AK
2 years
. @Gradio Demo for AnimeGANv2 Face Portrait v2 now on @huggingface Spaces demo: github:
Tweet media one
48
324
2K
@_akhaliq
AK
11 months
3. Apple Jeans
Tweet media one
20
103
2K
@_akhaliq
AK
1 year
Dreamix: Video Diffusion Models are General Video Editors abs: project page: present diffusion-based method that is able to perform text-based motion and appearance editing of general videos
35
451
2K
@_akhaliq
AK
8 months
Got married 💍
Tweet media one
228
21
2K
@_akhaliq
AK
4 months
JPMorgan announces DocLLM A layout-aware generative language model for multimodal document understanding paper page: Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the…
Tweet media one
25
369
2K
@_akhaliq
AK
2 years
A implementation of text-to-3D dreamfusion, powered by stable diffusion github:
24
434
2K
@_akhaliq
AK
2 years
Imagic: Text-Based Real Image Editing with Diffusion Models abs:
Tweet media one
22
423
2K
@_akhaliq
AK
2 years
DreamFusion: Text-to-3D using 2D Diffusion paper: abs: project page: DeepDream on a pretrained 2D diffusion model enables text-to-3D synthesis
28
421
2K
@_akhaliq
AK
11 months
Meta just released MusicGen, a simple and controllable model for music generation MusicGen is a single stage auto-regressive Transformer model trained over a 32kHz EnCodec tokenizer with 4 codebooks sampled at 50 Hz. Unlike existing methods like MusicLM, MusicGen doesn't not…
46
430
2K
@_akhaliq
AK
11 months
Chatgpt, only respond in one word until I say the word no
Tweet media one
44
52
2K
@_akhaliq
AK
1 year
GeoCode: Interpretable Shape Programs abs: project page:
18
292
2K
@_akhaliq
AK
4 years
ganario v2
35
370
2K
@_akhaliq
AK
11 months
7. Apple Ship
Tweet media one
11
68
2K
@_akhaliq
AK
8 months
MVDream: Multi-view Diffusion for 3D Generation paper page: propose MVDream, a multi-view diffusion model that is able to generate geometrically consistent multi-view images from a given text prompt. By leveraging image diffusion models pre-trained on…
19
428
2K
@_akhaliq
AK
11 months
4. Patty & Selma
Tweet media one
26
58
2K
@_akhaliq
AK
2 months
Google announces Stealing Part of a Production Language Model We introduce the first model-stealing attack that extracts precise, nontrivial information from black-box production language models like OpenAI's ChatGPT or Google's PaLM-2. Specifically, our attack recovers the…
Tweet media one
42
338
2K
@_akhaliq
AK
11 months
2. Homer
Tweet media one
49
53
2K
@_akhaliq
AK
10 months
OpenLLaMA 13B Released model: present a permissively licensed open source reproduction of Meta AI's LLaMA large language model. We are releasing 3B, 7B and 13B models trained on 1T tokens. We provide PyTorch and JAX weights of pre-trained OpenLLaMA…
Tweet media one
20
419
2K
@_akhaliq
AK
4 months
Tencent announces AppAgent Multimodal Agents as Smartphone Users paper page: Recent advancements in large language models (LLMs) have led to the creation of intelligent agents capable of performing complex tasks. This paper introduces a novel LLM-based…
51
394
2K
@_akhaliq
AK
1 year
BloombergGPT: A Large Language Model for Finance a 50 billion parameter language model that is trained on a wide range of financial data. Construct a 363 billion token dataset based on Bloomberg’s extensive data sources, perhaps the largest …
Tweet media one
40
315
2K
@_akhaliq
AK
3 years
GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation (works for videos too!) pdf: abs: github:
12
427
2K
@_akhaliq
AK
4 years
Kiki's Delivery Service 3d photo inpainting
7
551
2K
@_akhaliq
AK
1 year
Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions abs: project page:
18
389
2K
@_akhaliq
AK
11 months
7. Groundskeeper Willy
Tweet media one
23
33
2K
@_akhaliq
AK
4 months
Alibaba announces DreaMoving: A Human Video Generation Framework based on Diffusion Models @Gradio demo: github:
24
331
2K
@_akhaliq
AK
11 months
4. Apple Toilet
Tweet media one
21
101
2K
@_akhaliq
AK
10 months
zeroscope_v2 XL, A watermark-free Modelscope-based video model capable of generating high quality video at 1024 x 576 Model on @huggingface : This model was trained with offset noise using 9,923 clips and 29,769 tagged frames at 24 frames, 1024x576…
47
339
2K
@_akhaliq
AK
2 years
Instant Neural Graphics Primitives with a Multiresolution Hash Encoding paper: project page: github:
21
360
2K
@_akhaliq
AK
4 months
Alibaba releases DreaMoving demo on Hugging Face A Human Video Generation Framework based on Diffusion Models demo:
23
360
2K
@_akhaliq
AK
10 months
Pika labs releases image-conditioned video generation, upload an image, and the model will animate the image, prompt is “a girl in the wind”
21
349
2K
@_akhaliq
AK
6 months
Skull-pting spooky spaces in 360° by @BlockadeLabs
11
341
2K
@_akhaliq
AK
7 months
Open AI releases GPT-4V(ision) system card paper: GPT-4 with vision (GPT-4V) enables users to instruct GPT-4 to analyze image inputs provided by the user, and is the latest capability we are making broadly available. Incorporating additional modalities…
Tweet media one
17
372
2K
@_akhaliq
AK
1 year
GPT-4 Technical Report pdf: blog:
Tweet media one
16
495
2K