AK @_akhaliq Twitter profile

Last Seen Profiles

@AyudaVFO

@rachel_luttrell

@selithecatt

@MargareteRosali

@DuchesssNewTown

@asupanmuu18

@memeworschitz

@SaFilm

@ImBink_

@Ilha__TV

@yahwehni_ooo

@_AutistiCat_

@shirlen_ray

@YpatzUig6F9cVlu

@quroh_art

@a0i9i

@SkipSZN

@in_jelly_game

@mevrouwadvocaat

@bahgatsabermasr

@terajimaat24567

@SkipSZN

@TxEdHouston

@astrolog0x

@maiko_ssb

@iamkeithliggins

@scottmbland

@manlikefola_

@thatt3slaguy

@The_OFIC

@kayla_fletch

@BigFootEvents

@coachtrotter

@tb46x

@HPScotsLive

@YUSUKE_BANQUET

AK

@_akhaliq

11 months

ChatGPT playing rock paper scissors

189

4K

75K

AK

@_akhaliq

1 year

Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold paper page:

346

6K

22K

AK

@_akhaliq

11 months

Fixing things with AI

155

2K

19K

AK

@_akhaliq

11 months

ChatGPT, Bro just kept going?

222

2K

17K

AK

@_akhaliq

7 months

78

2K

17K

AK

@_akhaliq

11 months

Hey ChatGPT, finish this building...

118

1K

14K

AK

@_akhaliq

11 months

chatgpt gives a random youtube link

103

390

14K

AK

@_akhaliq

11 months

Fake Apple Products, midjourney AI 1. Apple Jetpack

152

1K

13K

AK

@_akhaliq

11 months

AI is taking over

178

1K

12K

AK

@_akhaliq

11 months

AI Generative fill with memes

40

2K

10K

AK

@_akhaliq

11 months

real life Simpsons, midjourney AI 1. Flanders

387

821

9K

AK

@_akhaliq

11 months

Dogs being Human, midjourney AI 1. Golden Retriever

149

869

8K

AK

@_akhaliq

10 months

Harry Potter Anime using stable diffusion by u/Inner-Reflections

96

2K

7K

AK

@_akhaliq

11 months

What did you do with generative AI?

111

306

6K

AK

@_akhaliq

2 years

stable diffusion img2img web UI + workflow video github: reddit thread:

40

1K

6K

AK

@_akhaliq

4 years

stylegan2 finetuning ffhq to metfaces

42

1K

5K

AK

@_akhaliq

3 years

ADOP: Approximate Differentiable One-Pixel Point Rendering abs:

71

1K

5K

AK

@_akhaliq

11 months

Celebrities if They Worked Normal Jobs, Midjourney AI 1. Tom Cruise

113

323

5K

AK

@_akhaliq

2 years

Mubert-Text-to-Music 🎵🎵🎵 Colab notebooks demonstrating prompt-based music generation via Mubert API GitHub:

85

1K

5K

AK

@_akhaliq

11 months

6. Apple Orange

48

476

4K

AK

@_akhaliq

4 years

Monster Mash: A Single-View Approach to Casual 3D Modeling and Animation pdf: project page:

40

1K

4K

AK

@_akhaliq

9 months

AI generative fill extending scenes, movies shot in portrait format by @Alex_Cerrato

108

807

4K

AK

@_akhaliq

9 months

Text to image with midjourney and image to video with gen2 by @commonstyle

77

823

4K

AK

@_akhaliq

10 months

Meme Legends, Photoshop generative fill AI by savvydone

30

657

4K

AK

@_akhaliq

1 year

Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields abs: project page:

99

644

4K

AK

@_akhaliq

1 year

BREAKING OpenAI released a implementation of Consistency Models consistency models, a new family of generative models that achieve high sample quality without adversarial training. They support fast one-step generation by design, while still allowing for few-step sampling to…

31

810

3K

AK

@_akhaliq

1 year

Stable Diffusion AI Deepfake De-Aged Harrison Ford SD+ControlNet+EbSynth+Fusion reddit thread:

87

684

3K

AK

@_akhaliq

1 year

Scaling Transformer to 1M tokens and beyond with RMT Recurrent Memory Transformer retains information across up to 2 million tokens. During inference, the model effectively utilized memory for up to 4,096 segments with a total length of 2,048,000 tokens—significantly exceeding…

98

823

3K

AK

@_akhaliq

10 months

midjourney version 5.2 zoom out feature: Unleashing the Potential of A Broader View

52

488

3K

AK

@_akhaliq

7 months

Training AI to Play Pokemon with Reinforcement Learning by @computerender github: youtube:

34

699

3K

AK

@_akhaliq

9 months

Celebrity Mortal Kombat, Midjourney AI + gen2 + ElevenLabs by u/fignewtgingrich

81

758

3K

AK

@_akhaliq

3 years

Eyes Tell All: Irregular Pupil Shapes Reveal GAN-generated Faces pdf: abs:

29

790

3K

AK

@_akhaliq

2 years

DALL·E: Introducing Outpainting Extend creativity and tell a bigger story with DALL-E images of any size blog:

25

776

3K

AK

@_akhaliq

1 year

3D-aware Conditional Image Synthesis abs: project page:

19

693

3K

AK

@_akhaliq

10 months

51

343

3K

AK

@_akhaliq

10 months

Midjourney AI recreating the Original 151 Pokémon - Part 1: The Starters by u/OfficialKnockout 1. Bulbasaur #001

36

292

3K

AK

@_akhaliq

11 months

5. Apple Teleport

29

149

3K

AK

@_akhaliq

2 years

MDM: Human Motion Diffusion Model abs: project page:

17

632

3K

AK

@_akhaliq

10 months

Another Meme Legends, Photoshop generative fill AI by @SavvyDone

28

467

3K

AK

@_akhaliq

4 months

Apple announces LLM in a flash: Efficient Large Language Model Inference with Limited Memory paper page: Large language models (LLMs) are central to modern natural language processing, delivering exceptional performance in various tasks. However, their…

33

502

3K

AK

@_akhaliq

2 years

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding project page: sota FID(7.27 on COCO), without ever training on COCO, human raters find Imagen samples to be on par with the COCO data itself in image-text alignment

31

684

3K

AK

@_akhaliq

1 year

Riffusion, real-time music generation with stable diffusion @huggingface model: project page:

64

630

3K

AK

@_akhaliq

2 months

Microsoft presents The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits Recent research, such as BitNet, is paving the way for a new era of 1-bit Large Language Models (LLMs). In this work, we introduce a 1-bit LLM variant, namely BitNet b1.58, in which every single…

53

625

3K

AK

@_akhaliq

2 months

Google presents Genie Generative Interactive Environments introduce Genie, the first generative interactive environment trained in an unsupervised manner from unlabelled Internet videos. The model can be prompted to generate an endless variety of action-controllable virtual…

81

540

2K

AK

@_akhaliq

1 year

Generative Agents: Interactive Simulacra of Human Behavior abs: project page:

62

531

2K

AK

@_akhaliq

1 year

Track Anything: Segment Anything Meets Videos Track-Anything is a flexible and interactive tool for video object tracking and segmentation suitable for: - Video object tracking and segmentation with shot changes. - Visualized development and data annnotation for video object…

33

491

2K

AK

@_akhaliq

2 years

Block-NeRF: Scalable Large Scene Neural View Synthesis abs: project page:

30

555

2K

AK

@_akhaliq

3 months

TikTok presents Depth Anything Unleashing the Power of Large-Scale Unlabeled Data paper page: demo: Depth Anything is trained on 1.5M labeled images and 62M+ unlabeled images jointly, providing the most capable Monocular Depth…

39

416

2K

AK

@_akhaliq

19 days

Apple presents Ferret-UI Grounded Mobile UI Understanding with Multimodal LLMs Recent advancements in multimodal large language models (MLLMs) have been noteworthy, yet, these general-domain MLLMs often fall short in their ability to comprehend and interact effectively with

33

409

2K

AK

@_akhaliq

2 years

stylegan3-projector Mario github:

64

386

2K

AK

@_akhaliq

2 months

Alibaba presents EMO: Emote Portrait Alive Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions tackle the challenge of enhancing the realism and expressiveness in talking head video generation by focusing on the dynamic and nuanced…

87

581

2K

AK

@_akhaliq

1 year

alpaca-lora: Code for reproducing the Stanford Alpaca InstructLLaMA result on consumer hardware github:

25

488

2K

AK

@_akhaliq

9 months

Meta releases Llama 2: Open Foundation and Fine-Tuned Chat Models paper: blog: develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion…

37

576

2K

AK

@_akhaliq

2 years

everyone on ML twitter right now

22

165

2K

AK

@_akhaliq

4 years

Vid2Player: Controllable Video Sprites that Behave and Appear like Professional Tennis Players pdf: abs: project page:

40

556

2K

AK

@_akhaliq

2 years

make-a-video: text-to-video generation without text-video data paper: project page:

32

487

2K

AK

@_akhaliq

11 months

AI will take over the world?

65

301

2K

AK

@_akhaliq

3 years

stylegan3 is out github:

7

407

2K

AK

@_akhaliq

11 months

10. Marge

225

88

2K

AK

@_akhaliq

1 year

One is Midjourney 5.1, the other is real. Which one is which? reddit thread:

478

232

2K

AK

@_akhaliq

7 months

Language Modeling Is Compression paper page: It has long been established that predictive models can be transformed into lossless compressors and vice versa. Incidentally, in recent years, the machine learning community has focused on training…

46

391

2K

AK

@_akhaliq

9 months

39

369

2K

AK

@_akhaliq

8 months

Tracking Anything with Decoupled Video Segmentation paper page: Training data for video segmentation are expensive to annotate. This impedes extensions of end-to-end algorithms to new video segmentation tasks, especially in large-vocabulary settings. To…

23

425

2K

AK

@_akhaliq

11 months

6. Apu

22

56

2K

AK

@_akhaliq

4 years

#StyleGAN2 interps

19

395

2K

AK

@_akhaliq

2 years

. @Gradio Demo for AnimeGANv2 Face Portrait v2 now on @huggingface Spaces demo: github:

48

324

2K

AK

@_akhaliq

1 year

Its over run Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, GALACTICA, gpt4all, auto-gpt easily in a web ui, free, and open source github:

GitHub - oobabooga/text-generation-webui: A Gradio web UI for Large Language Models. Supports...

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models. - oobabooga/text-generation-webui

github.com

48

476

2K

AK

@_akhaliq

11 months

3. Apple Jeans

20

103

2K

AK

@_akhaliq

1 year

Dreamix: Video Diffusion Models are General Video Editors abs: project page: present diffusion-based method that is able to perform text-based motion and appearance editing of general videos

35

451

2K

AK

@_akhaliq

8 months

Got married 💍

228

21

2K

AK

@_akhaliq

4 months

JPMorgan announces DocLLM A layout-aware generative language model for multimodal document understanding paper page: Enterprise documents such as forms, invoices, receipts, reports, contracts, and other similar records, often carry rich semantics at the…

25

369

2K

AK

@_akhaliq

2 years

A implementation of text-to-3D dreamfusion, powered by stable diffusion github:

24

434

2K

AK

@_akhaliq

2 years

Imagic: Text-Based Real Image Editing with Diffusion Models abs:

22

423

2K

AK

@_akhaliq

2 years

DreamFusion: Text-to-3D using 2D Diffusion paper: abs: project page: DeepDream on a pretrained 2D diffusion model enables text-to-3D synthesis

28

421

2K

AK

@_akhaliq

11 months

Meta just released MusicGen, a simple and controllable model for music generation MusicGen is a single stage auto-regressive Transformer model trained over a 32kHz EnCodec tokenizer with 4 codebooks sampled at 50 Hz. Unlike existing methods like MusicLM, MusicGen doesn't not…

46

430

2K

AK

@_akhaliq

11 months

Chatgpt, only respond in one word until I say the word no

44

52

2K

AK

@_akhaliq

1 year

GeoCode: Interpretable Shape Programs abs: project page:

18

292

2K

AK

@_akhaliq

4 years

ganario v2

35

370

2K

AK

@_akhaliq

11 months

7. Apple Ship

11

68

2K

AK

@_akhaliq

8 months

MVDream: Multi-view Diffusion for 3D Generation paper page: propose MVDream, a multi-view diffusion model that is able to generate geometrically consistent multi-view images from a given text prompt. By leveraging image diffusion models pre-trained on…

19

428

2K

AK

@_akhaliq

11 months

4. Patty & Selma

26

58

2K

AK

@_akhaliq

2 months

Google announces Stealing Part of a Production Language Model We introduce the first model-stealing attack that extracts precise, nontrivial information from black-box production language models like OpenAI's ChatGPT or Google's PaLM-2. Specifically, our attack recovers the…

42

338

2K

AK

@_akhaliq

11 months

2. Homer

49

53

2K

AK

@_akhaliq

10 months

OpenLLaMA 13B Released model: present a permissively licensed open source reproduction of Meta AI's LLaMA large language model. We are releasing 3B, 7B and 13B models trained on 1T tokens. We provide PyTorch and JAX weights of pre-trained OpenLLaMA…

20

419

2K

AK

@_akhaliq

4 months

Tencent announces AppAgent Multimodal Agents as Smartphone Users paper page: Recent advancements in large language models (LLMs) have led to the creation of intelligent agents capable of performing complex tasks. This paper introduces a novel LLM-based…

51

394

2K

AK

@_akhaliq

1 year

BloombergGPT: A Large Language Model for Finance a 50 billion parameter language model that is trained on a wide range of financial data. Construct a 363 billion token dataset based on Bloomberg’s extensive data sources, perhaps the largest …

40

315

2K

AK

@_akhaliq

3 years

GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation (works for videos too!) pdf: abs: github:

12

427

2K

AK

@_akhaliq

4 years

Kiki's Delivery Service 3d photo inpainting

7

551

2K

AK

@_akhaliq

1 year

Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions abs: project page:

18

389

2K

AK

@_akhaliq

11 months

7. Groundskeeper Willy

23

33

2K

AK

@_akhaliq

4 months

Alibaba announces DreaMoving: A Human Video Generation Framework based on Diffusion Models @Gradio demo: github:

24

331

2K

AK

@_akhaliq

11 months

4. Apple Toilet

21

101

2K

AK

@_akhaliq

10 months

zeroscope_v2 XL, A watermark-free Modelscope-based video model capable of generating high quality video at 1024 x 576 Model on @huggingface : This model was trained with offset noise using 9,923 clips and 29,769 tagged frames at 24 frames, 1024x576…

47

339

2K

AK

@_akhaliq

2 years

Instant Neural Graphics Primitives with a Multiresolution Hash Encoding paper: project page: github:

21

360

2K

AK

@_akhaliq

4 months

Alibaba releases DreaMoving demo on Hugging Face A Human Video Generation Framework based on Diffusion Models demo:

23

360

2K

AK

@_akhaliq

10 months

Pika labs releases image-conditioned video generation, upload an image, and the model will animate the image, prompt is “a girl in the wind”

21

349

2K

AK

@_akhaliq

6 months

Skull-pting spooky spaces in 360° by @BlockadeLabs

11

341

2K

AK

@_akhaliq

7 months

Open AI releases GPT-4V(ision) system card paper: GPT-4 with vision (GPT-4V) enables users to instruct GPT-4 to analyze image inputs provided by the user, and is the latest capability we are making broadly available. Incorporating additional modalities…

17

372

2K

AK

@_akhaliq

1 year

GPT-4 Technical Report pdf: blog:

16

495

2K