Alex Olshevsky @alexolshevsky1 Twitter profile

Last Seen Profiles

@NurulNurulwah98

@ahmed_Siher1

@stw46

@twalitr

@swagkawai

@AvianaG86073

@MundoClay

@stwmaniax

@_KennyLamar

@SumireNicolas

@bokeplokalmalam

@milman4864

@bokeplokalmalam

@yankyhoffy

@Hijabbacol2883

@learnwithmattc

@Rajeshsingh8704

@stwmaniax

@rbyny31499552

@stwmaniax

@NickTarburton40

@stw_pdg

@Gov_NB

@stwmaniax

@bokeplokalmalam

@stw_pdg

@learnwithmattc

@bokeplokalmalam

@SumireNicolas

@AvianaG86073

@stw46

@stwmaniax

@stw_pdg

@rbyny31499552

@MoniaMazigh

Alex Olshevsky

@alexolshevsky1

11 months

@Allison_Dupont Engineers will use the concepts tested here all the time. Question 2 tests a formula which almost everyone forgets after taking calculus, but you can't take most college engineering classes without having an understanding of the answers to 1,3,4,5.

4

2

177

Alex Olshevsky

@alexolshevsky1

11 months

@paulg @ikirigin That article contains a total of one specific example where the Hamas figures broadly matched reliable estimates by other sources. The rest of it is just blandly supportive quotes from various NGOs. In the only other example cited (Al Ahli hospital bombing) we have the Hamas

23

1

140

Alex Olshevsky

@alexolshevsky1

2 years

Happy to share this paper, which was recently accepted to SIAM Journal on Control and Optimization. Actor-critic methods are widely used in reinforcement learning but there is a significant gap between theory and practice... 1/3

2

6

63

Alex Olshevsky

@alexolshevsky1

1 year

You can play 20 questions with GPT-4 by asking it to give you the base64 encoding of the object you want at the beginning of the conversation: I've tried it and it works, see screenshot. On the other-hand, ChatGPT fails at this.

Riley Goodside

@goodside

1 year

The fact ChatGPT can’t play 20 Questions reveals an important limitation vs. a human: it can’t keep secrets. It has nowhere to put a memory of an unspoken decision. In effect, it’s like each token is chosen by a new person, guessing from prior context.

82

102

1K

1

3

57

Alex Olshevsky

@alexolshevsky1

11 months

@KimIversenShow @kiyahwillis @RantusMaximus Love how Israel is being criticized here for bad things it *didn't* do.

1

2

49

Alex Olshevsky

@alexolshevsky1

1 year

I disagree with this and want to explain why. In the thread below, @aryehazan clarifies that his opinion that current LLMs don't understand is based on interactions with them -- and that he is not philosophically opposed to claiming LLMs understand something. 1/

Aryeh Kontorovich

@aryehazan

1 year

I'll repeat for the (n+1)th time: current LLMs cannot be said to "understand" any topic, for any reasonable notion of "understand". That said, it's incredibly surprising and impressive (and downright amazing) what the *can* do. Turns out lots of "intelligent" tasks don't require

4

0

17

6

9

53

Alex Olshevsky

@alexolshevsky1

2 years

A recent paper on the distributed subgradient method (with exact gradient evaluations): I show that under a wide range of step-sizes, the distributed version has the linear speedup property, i.e., a network of n nodes is n times faster than a single node.

Journal of Machine Learning Research

@JmlrOrg

2 years

"Asymptotic Network Independence and Step-Size for a Distributed Subgradient Method", by Alex Olshevsky.

0

1

2

1

6

54

Alex Olshevsky

@alexolshevsky1

1 year

@michael_nielsen I remember really enjoying this one as an undergraduate. Instead of developing Abel's theorem as a sequence of theorems and lemmas, it gives you a sequence of not-too-difficult exercises along with hints (and solutions in the back). In the process of solving all the exercises,

3

2

44

Alex Olshevsky

@alexolshevsky1

11 months

@RichardHanania Arguably you already see a version of this dynamic in Lebanon. Opinion polls show that (i) overwhelming majorities of Lebanese have contempt for Israel and support the Oct 7 massacres (ii) majority of Lebanese want to stay out of the current conflict between Israel and Hamas.

3

4

38

Alex Olshevsky

@alexolshevsky1

5 years

@rivatez A similar thought from Tolstoy:

2

0

33

Alex Olshevsky

@alexolshevsky1

1 year

@DimitrisPapail Along the same lines, a couple of years ago I found this document very helpful, (as opposed to reading standard ML fare, which has the horrible tendency to describe things in words without just giving you all the equations).

1

6

37

Alex Olshevsky

@alexolshevsky1

1 year

My productivity hack for grinding out math: find a place to work with wifi that has bad wifi quality. You need the wifi to look up results from the literature. But you also need bad wifi quality to prevent yourself from wasting time on the internet.

Daniel Feldman

@d_feldman

1 year

for a year when I was in school, I lived very close to a Somali coffee shop Unlimited coffee for $1 and they were open till 2am AND, the best part -- no wifi and poor cell reception I studied more in that year than I have before or since

0

34

5

1

37

Alex Olshevsky

@alexolshevsky1

3 years

I wasn't fully satisfied with existing expositions of the policy gradient theorem -- I wanted a short proof I could present to undergrads, without mathematically dubious steps, and each step seeming well-motivated by what preceded it -- so I wrote this up:

The Policy Gradient Theorem

I’ve recently started writing lecture notes for an undergraduate class in reinforcement learning which I’m creating for Fall 2021. The goal is to explain RL at a level that will keep th…

aomathstuff.wordpress.com

2

31

Alex Olshevsky

@alexolshevsky1

1 year

I have noticed the same, and I think this is strong evidence against the analogy between LLM hallucinations and compression artifacts proposed recently by Ted Chiang ( see ). Chiang's starting point was that LLMs are effectively compressing a vast amount

Michael Nielsen

@michael_nielsen

1 year

One odd thing about ChatGPT: I may be the one hallucinating, but simply telling it to please not make up paper references seems to substantially improve performance

14

12

284

6

3

31

Alex Olshevsky

@alexolshevsky1

1 year

@wfithian There's a general pattern prevalent on Twitter and other social networks where people love a pile-on: (1) Someone gives advice which works in our world but not in an ideal world (2) Said advice actually involves tradeoffs that wouldn't exist in an ideal world (3) The

1

4

29

Alex Olshevsky

@alexolshevsky1

1 year

Excited to be giving a tutorial at Allerton in a couple of weeks: I'll talk about some recently elaborated connections between reinforcement learning and gradient descent. If you'll be at Allerton this year, I hope to see you there.

0

1

30

Alex Olshevsky

@alexolshevsky1

1 year

But if I beg it to tell the truth and say it that it would pain me to hear false information, the hallucinated information gets discarded and I get a 100% correct reply. 3/3

1

0

28

Alex Olshevsky

@alexolshevsky1

2 years

@lreyzin If you had told me one month ago that soon there'd be something called a "B-word" that people are not spelling out in reviews, there's about a zero percent probability I would have guessed what it turned out to be.

2

5

29

Alex Olshevsky

@alexolshevsky1

3 years

Together with Julien Hendrickx, I'm teaching a week-long course on "Dynamics and Algorithms on Networks" at Université Paris-Saclay in June 2022. This is aimed at grad students who want an introduction to recent developments in the area. Registration is at

0

5

28

Alex Olshevsky

@alexolshevsky1

1 year

@darengb @TheStalwart I've tried that and it didn't work: it seems too hard for it to produce a valid hash on the examples I tried. On the other hand, asking it to write the number in base64 seems to work:

Alex Olshevsky

@alexolshevsky1

1 year

You can play 20 questions with GPT-4 by asking it to give you the base64 encoding of the object you want at the beginning of the conversation: I've tried it and it works, see screenshot. On the other-hand, ChatGPT fails at this.

1

3

57

2

0

24

Alex Olshevsky

@alexolshevsky1

4 years

Several of us are organizing a special issue of TCNS on Social Networks. Please consider sending us your work: we welcome both papers with a methodological contribution as well as interdisciplinary papers containing experimental research.

0

9

26

Alex Olshevsky

@alexolshevsky1

1 year

@goodside Doesn't it seem intuitive that the prompt could be improved by asking for the justification *first*, and only then the yes/no/unknown? All my intuition from playing around with language models suggests that asking for answer first and only then reasoning will occasionally lead

1

0

26

Alex Olshevsky

@alexolshevsky1

3 years

I wrote a short post about using Metropolis weights in consensus -- a very simple trick for avoiding bad network scaling that seems to be underused in the multi-agent control and distributed optimization communities.

Mixing with Metropolis Weights

Many algorithms in multi-agent control work by relying on a doubly stochastic matrix $latex W$ with the “consensus property” that $latex \lim_{t \rightarrow \infty} W^t x = \left( \frac…

aomathstuff.wordpress.com

0

4

23

Alex Olshevsky

@alexolshevsky1

11 months

@kamilkazani And yet at one time both Egypt and Jordan maintained that Israel has no right to exist and sought to end it through military force. A peaceful settlement only followed repeated military failures on the part of those nations, and others, to destroy Israel.

1

20

Alex Olshevsky

@alexolshevsky1

1 year

@y0b1byte The definition in terms of inner products is already geometric: Given a linear map A, one defines A* to be the unique map with the property that, angle between Ax and y = angle between x and A*y

5

0

20

Alex Olshevsky

@alexolshevsky1

11 months

@thesasho Makes sense in a way: if you spend the vast majority of your day writing code or reading/commenting on other people's code, then making sense of code will be second nature and feel instantaneous to you in a way that parsing equations is not.

1

0

20

Alex Olshevsky

@alexolshevsky1

1 year

@Ike_Saul Simon posted a convincing rebuttal to just one of the assertion yous made ("Israel is unwilling..."). Israel is clearly willing, and has tried on several occasions to create a viable peace plan, only to be rejected. Your response here doesn't really defend what you originally

7

0

21

Alex Olshevsky

@alexolshevsky1

1 year

@RealDianeYap As others in your replies have said, this story is likely false. There is now a video of the explosion in question and it does not appear to be consistent with an airstrike. Some stills are at but you can also find lots of high-quality discussion on

Nathan J Hunt

@ISNJH

1 year

@kahlua057 @hengenahm @IntelCrab explosion looks like it came from the white van

42

232

862

0

18

Alex Olshevsky

@alexolshevsky1

3 years

I'm many months late to the party here, but image generation from text feels amazing. Here's the output after putting in "the ocean at dusk | unreal engine" into VQGAN + CLIP:

2

0

17

Alex Olshevsky

@alexolshevsky1

2 years

Agree with this. When AlphaZero learns it needs to keeps its King safe before it launches an attack, I couldn't care less if it "really" understands chess or merely simulates such understanding. Either way it kicks my ass.

Perry E. Metzger

@perrymetzger

2 years

I cannot tell you on what date deeply superhuman AGI systems will appear, but when they do, I can guarantee that a considerable fraction of the chattering classes will dismiss them as trickery or say “we don’t even have a good definition of intelligence.”

21

67

517

3

0

17

Alex Olshevsky

@alexolshevsky1

1 year

While I'm not an expert in this area, I thought this post was interesting and hope it stimulates a discussion. One of the strengths of ML as a scientific field is the willingness of people to offer criticisms in public. In other areas I work in, people tend to keep their

Francesco Orabona

@bremen79

1 year

New blog post: Yet Another ICML Award Fiasco The story of the @icmlconf 2023 Outstanding Paper Award to the D-Adaptation paper with worse results that the ones from 9 years ago Please share it to start a needed conversation on mistakenly granted awards

17

107

497

0

1

18

Alex Olshevsky

@alexolshevsky1

1 year

Apropos of nothing, here is a horror story that happened 4-5 years ago when I was a reviewer on a COLT paper. Fortunately for me, this story happened to someone else. The paper that I was a reviewer for was reasonable but not amazing. I thought it was neither a clear accept nor

2

17

Alex Olshevsky

@alexolshevsky1

3 years

@Osinttechnical @oryxspioenkop There is probably strong sampling bias in these numbers because Ukrainians are more likely to put out the videos on which this data is based (both to drum up morale and because of the Ukrainian civilians who like to make videos of damaged/abandoned Russian equipment).

2

0

15

Alex Olshevsky

@alexolshevsky1

4 years

@shortstein Having worked in both, I felt much more satisfied with work in journal-driven fields: because there is no deadline, there's not an incentive to send out the paper before it is fully finished, completely to your satisfaction, with every last bit polished and revised as needed.

0

15

Alex Olshevsky

@alexolshevsky1

3 years

A writeup of some recent research from my group on finding a lockdown that minimizes job losses while holding down the reproduction number of an epidemic. Results turned out to be really counter-intuitive: the best lockdown was sometimes harshest in places with few infections.

BU Center for Information & Systems Engineering

@CISE_BU

3 years

New research by CISE Affiliated Faculty @alexolshevsky1 (ECE) attempts to minimize job losses due to COVID-19 lockdowns. Olshevsky hopes to influence policymakers in future pandemic lockdowns. Learn more about the research here: #COVID19 #OptimalLockdown

1

3

0

3

15

Alex Olshevsky

@alexolshevsky1

2 years

Very excited to share this paper with Haoxing Tiang and @YPaschalidis which is scheduled appear in ICLR 2023 in Kigali. Quick summary of the result below. (1/6)

1

3

15

Alex Olshevsky

@alexolshevsky1

1 year

What is described in this interview is amazing.

Matt Pfeffer

@inIVmatics

1 year

One of my favorite gems from this episode (which seems 🤯):

2

8

38

3

2

14

Alex Olshevsky

@alexolshevsky1

11 months

Coat of arms of the Olshevsky family

2

0

16

Alex Olshevsky

@alexolshevsky1

1 year

Many professors are reporting GPT-4 is getting good grades on their exams. Well, I tried giving it a midterm from my graduate RL class and it performed abominably. Below, see two attempts by GPT-4 to argue that a symmetric, stochastic matrix is nonexpansive.

1

0

14

Alex Olshevsky

@alexolshevsky1

1 year

@natfriedman 100% accurate summarization: GPT-4 can one-shot book summaries with almost 100% accuracy while GPT-3.5 gets confused whenever it's not obvious what information needs to be kept out and what should be left in. Where this comes up: I'm using GPT-4 to generate 1-3 sentence book

1

0

14

Alex Olshevsky

@alexolshevsky1

6 years

@cperciva ...or just learn to use the rule of 72 ( ) instead of memorizing a table.

1

14

Alex Olshevsky

@alexolshevsky1

1 year

GPT-4 seems to have improved a lot in the last month, especially on math problems. It used to be that asking it for a proof that gradient descent converges under some standard assumptions produced nonsense. Now you more or less get the correct standard analysis for non-convex

1

0

12

Alex Olshevsky

@alexolshevsky1

1 year

Best headline of all time?

Bear breaks into house, plays the piano but not very well

Bears will do a lot for food, including breaking and entering. They do not usually play the piano.

www.washingtonpost.com

1

0

12

Alex Olshevsky

@alexolshevsky1

1 year

I respect institutions like Georgia Tech which have a policy against taking institutional positions on controversial issues. I believe all universities need to adopt this approach. But...

3

0

13

Alex Olshevsky

@alexolshevsky1

3 years

@lreyzin I always find it helpful to refer to this graph in these discussions:

Matthew Hahn

@3rdreviewer

8 years

You can be a professional basketball player, no matter how tall you are! No correlation between height and scoring success in the NBA:

39

80

269

0

12

Alex Olshevsky

@alexolshevsky1

1 year

This is a thousand percent correct. Personally: -- The $20 a month I pay for access to GPT-4 (through ChatGPT+) just might be the best money I've ever spent. -- There is a large gap between the abilities of ChatGPT and GPT-4. ChatGPT will occasionally act like a "stochastic

Conrad Barski

@lisperati

1 year

There's a weird divergence right now where people skeptical of LLMs of course don't pay $20 to access chatgpt4, think chatgpt3/bard are state of the art, feeding into their low opinion of LLMs vs those who paid the $20 and hence have a completely different experience with LLMs

18

21

258

2

13

Alex Olshevsky

@alexolshevsky1

1 year

@EugeneVinitsky One change I'd love to see: evaluate the meaningfulness of a citation using some NLP, with more meanginful citations counting more. For example, "Related works include [1]-[23]..." should count differently from "Our main result is an extension of a theorem from [7]."

1

2

12

Alex Olshevsky

@alexolshevsky1

11 months

@LauriLinnea @Allison_Dupont Do you mind if I ask where your chemical engineering degree is from?

1

0

11

Alex Olshevsky

@alexolshevsky1

11 months

Will this become the new normal in academia: (math department) seminars where attendance is conditional on signing a political statement?

Jonathan Kay

@jonkay

11 months

The University of Toronto @UofT math department @UofTMath held an "Equity Forum" on October 31. Faculty attendance was conditional on attendees signing a petition denouncing Israel...

191

477

1K

5

0

11

Alex Olshevsky

@alexolshevsky1

1 year

Hope this pathway (high school --> big tech) is normalized given the pervasive discrimination against Asian applicants in university admissions.

ABC7 News

@abc7newsbayarea

1 year

Zhong had 3.97 unweighted & 4.42 weighted GPA, scored 1590 out of 1600 on SAT's, founded his own startup, but was rejected by 16 colleges. They include MIT, Carnegie Mellon, UC Berkeley, Cal Poly SLO. But Google called. Watch interview w/ @abc7kristensze :

124

253

2K

2

0

11

Alex Olshevsky

@alexolshevsky1

4 years

@juliagalef Didn't want to trespass!? You'll never cross into an alternate dimension with that attitude.

0

11

Alex Olshevsky

@alexolshevsky1

4 years

@florian_dorfler @aanna_mit I wish all the control journals would adopt the LCSS model: if you submit by a certain date and the first round of review comes back sufficiently positive, the reviews are then forwarded to a conference, and the conference will typically invite you to give a presentation.

3

0

10

Alex Olshevsky

@alexolshevsky1

1 year

I don't support cancel culture -- I don't support firing people for their controversial beliefs -- unless their "controversial beliefs" are that people of a certain ethnicity who may be living in a certain region need to die, in which case I absolutely support cancel culture.

1

10

Alex Olshevsky

@alexolshevsky1

1 year

What should we conclude from this? That the model doesn’t understand causality? No -- the followup contradicts this – rather GPT-4 misreads @yudapearl question in exactly the same way a typical person on the internet would. 7/

1

0

10

Alex Olshevsky

@alexolshevsky1

1 year

@bremen79 @KAUST_News I feel weird about "liking" this tweet since it makes me sad, but congrats and good luck!

1

0

10

Alex Olshevsky

@alexolshevsky1

3 years

@aryehazan In academic life, one eventually learns never to make statements like "this paper is definitely getting in"

1

0

10

Alex Olshevsky

@alexolshevsky1

1 year

Suppose we try the exact same prompt, and add a sentence asking GPT-4 to notice the prompt uses one word and not another. Then it gets it right: 6/

2

0

10

Alex Olshevsky

@alexolshevsky1

1 year

Very excited to be part of this team

Yannis Paschalidis

@YPaschalidis

1 year

Excited for @BU_Computing to be working with a dream team of @ENERGY 's national labs led by Peter Nugent at @BerkeleyLab

0

1

10

0

10

Alex Olshevsky

@alexolshevsky1

6 years

@DegenRolf Not quite in the same category, but argues (as far as I've been able to make out) that governments are neglecting to investigate UFOs because aliens could undermine the notion of "anthropocentric sovereignty" on which modern state power is based.

0

1

7

Alex Olshevsky

@alexolshevsky1

3 years

Spent some more time this weekend playing around with one of the public VQGAN + CLIP notebooks. I had to use several prompts sequentially to generate this image, with the final one being "Storm on the Sea of Galilee by John Constable | Unreal Engine | Matte Painting"

1

0

9

Alex Olshevsky

@alexolshevsky1

2 years

@aryehazan Sadly, the kind of papers that you are not crazy about will often sail through the review process: often reviewers will be intimidated by all the technicalities needed to make things work in the super-general setting.

1

0

9

Alex Olshevsky

@alexolshevsky1

1 year

@ESYudkowsky @skdh We really should be talking about "passing the (n, x)-Turing test" where: -- x is a quantification of the expert level of the opponent (e.g, 0 for random person who doesn't know anything about AI, 0.85 for someone who does research on LLMs, 1.0 for the principal inventors of the

0

9

Alex Olshevsky

@alexolshevsky1

1 year

@srchvrs I've noticed a similar phenomenon with the standard Lion/Goat/Grass puzzle where you have to ferry all three across the river. GPT-4 solves the puzzle perfectly, but if you rename the requirements (e.g., "can't leave Lion and Grass alone"), it will produce nonsense even though

3

0

6

Alex Olshevsky

@alexolshevsky1

3 years

Similarly to above, the final prompt was "Mountain in the Desert by John Constable | Unreal Engine | Matte Painting"

2

0

9

Alex Olshevsky

@alexolshevsky1

1 year

I have no respect for certain university presidents who have never had problems taking institutional stances but discovered the virtues of neutrality immediately after the largest single-day slaughter of Jews since the holocaust.

1

0

9

Alex Olshevsky

@alexolshevsky1

1 year

And just for fun, how about this from a reddit user asking to vary a summary by writing level: 16/

1

0

9

Alex Olshevsky

@alexolshevsky1

1 year

Likewise, there are two reasons to get a question wrong: lack of understanding and poor alignment to give correct answers. If you get a bad answer, you never know which of these two properties you got. So you can't go from bad answers to lack of understanding 12/

2

1

9

Alex Olshevsky

@alexolshevsky1

2 years

Also: tables in papers comparing the results to the previous literature.

Mathieu Viallard

@mathviallard

2 years

Competitive advantages on startup pitch decks

529

9K

113K

0

1

9

Alex Olshevsky

@alexolshevsky1

1 year

For example, asking ChatGPT for control theorists at Boston University gives an answer that is 50% accurate. Items 1 and 4 in the list below are not correct. 2/3

1

0

8

Alex Olshevsky

@alexolshevsky1

3 years

Wish this sort of thing was more common. The default norms in every scientific community I've been a part of favor puffing up papers to match page limits. Personally, I find "short and sweet" papers infinitely preferable.

Horace He

@cHHillee

3 years

2

7

86

0

8

Alex Olshevsky

@alexolshevsky1

3 years

@patrick_oshag I really love this comic version of The Last Question: Somehow, it brings out the poetic undertone in the story.

0

8

Alex Olshevsky

@alexolshevsky1

1 year

All this runs counter to our intuition as professors: we examine students all the time and while students can emulate some of the things we do in class, the only way to get the right answers consistently is genuine understanding. So we look for that consistency. 9/

1

8

Alex Olshevsky

@alexolshevsky1

11 months

We've mostly forgotten about this but 8 years ago every political webzine had their own scientists building election models, and most of these geniuses made predictions by assuming differences between polls and election results were independent across all 50 states.

End Wokeness

@EndWokeness

11 months

Happy 7 year anniversary to this tweet

451

4K

53K

2

8

Alex Olshevsky

@alexolshevsky1

3 years

Really, really wish I had read this Weinberg piece when I was a student.

Venkat Ramaswamy

@VenkRamaswamy

3 years

Not a physicist, but back when I was starting off, Steven Weinberg's insightful advice was very influential in my own development:

9

131

479

0

1

7

Alex Olshevsky

@alexolshevsky1

3 years

I wish I could say this never happened to me.

Jonathan Roth

@jondr44

3 years

I'm very frustrated reading a proof that says "the conclusion is immediate from equation (XX)", when in fact the conclusion is not obvious from the stated equation. What's worse is the author of the paper is me 2 years ago

15

37

2K

0

8

Alex Olshevsky

@alexolshevsky1

1 year

Very much hope this proof stands up to scrutiny: it would be the most spectacular refutation of the Hardy quote from A Mathematician's Apology: "No mathematician should ever allow himself to forget that mathematics, more than any other art or science, is a young man's game. ...

Scott Kominers

@skominers

1 year

Wut

0

8

1

0

8

Alex Olshevsky

@alexolshevsky1

3 years

Final prompt for this was "Cozy Magic Portal to Another World in the Style of Disney | Unreal Engine | Matte Painting"

1

0

8

Alex Olshevsky

@alexolshevsky1

4 years

@AlecStapp Check out -- its orders of magnitude better than OCW.

2

0

8

Alex Olshevsky

@alexolshevsky1

11 months

100% this. To those who didn't follow this controversy, it seems to have occurred because the editor in chief of a journal linked to a (satirical) Onion article entitled "Dying Gazans Criticized For Not Using Last Words To Condemn Hamas." Completely ridiculous to suggest

Will Fithian

@wfithian

1 year

I have seen the tweet and I don’t understand what the basis for this investigation is. If the @eLife code of conduct can be construed as forbidding affiliated scientists from publicly and disagreeably expressing unpopular political opinions, then it should be revised.

7

38

424

1

0

8

Alex Olshevsky

@alexolshevsky1

1 year

This is true but there are notable methods which are fancy and work well. I was shocked by how many bells and whistles went into PPO (this ICLR blog post was traumatic reading: ) and yet PPO generalizes to new domains better than many competing methods.

Greg Brockman

@gdb

1 year

The fancier the ML algorithm, the less likely it is to work.

84

126

2K

0

7

Alex Olshevsky

@alexolshevsky1

1 year

To sum up: *if* you believe “understanding” is a term which can, in principle, be reasonably applied to an LLM, you should reason from its successes to to conclude that, at least in some cases, it is capable of genuine understanding. 18/18

1

0

7

Alex Olshevsky

@alexolshevsky1

4 years

@aminkarbasi @thegautamkamath Doron Zeilberger, a champion of computer-aided mathematics, has put his PC as a co-author on over 30 papers: I can't decide if this is really, really stupid or A+ trolling (though I guess the two are not mutually exclusive).

In Computers We Trust? | Quanta Magazine

As the role of computers in pure mathematics grows, researchers debate their reliability.

www.quantamagazine.org

0

1

6

Alex Olshevsky

@alexolshevsky1

1 year

This is how a university president should react to what has transpired.

Chris Bakke

@ChrisJBakke

1 year

A legendary statement from the President of the University of Florida:

452

3K

22K

0

7

Alex Olshevsky

@alexolshevsky1

1 year

Best paper awards make sense in "static" areas where what is considered important changes slowly. I imagine no one in math would object to an award to a paper that made important progress towards a resolution of the Riemann Hypothesis. But in fields where what is considered

Zachary Lipton

@zacharylipton

1 year

It’s obvious to every thinking person in the ML community that we should kill the “best paper” award as an institution. It’s an impossible task. Our highest aspiration for it is “don’t embarrass ourselves”. Kill it. I wouldn’t even put it to a vote.

12

21

216

1

0

7

Alex Olshevsky

@alexolshevsky1

11 months

I want to share this letter put together by some of my colleagues which I signed in the aftermath of the Hamas massacre. I am a little late on this, but it is as relevant today as it was two weeks ago. The letter is open to signatures from members of BU faculty.

Kira Goldner

@kiragoldner

1 year

An open letter to the @BU_Tweets community on the massacres in Israel. TL;DR: Historical context and nuance are important, but *nothing* serves as a justification for Hamas's acts against humanity. Link to sign is within for the interested BU faculty.

0

3

12

0

7

Alex Olshevsky

@alexolshevsky1

2 years

@aryehazan Agree completely but all the same see screenshot below for an entertaining story (from ).

1

0

7

Alex Olshevsky

@alexolshevsky1

1 year

Intriguing research by @CollinBurns suggests that you can sometimes tell whether the model is telling the truth sometimes from looking at it’s activations directly. In other words, the model can “know” it’s making stuff up: 13/

Collin Burns

@CollinBurns4

2 years

How can we figure out if what a language model says is true, even when human evaluators can’t easily tell? We show () that we can identify whether text is true or false directly from a model’s *unlabeled activations*. 🧵

31

246

1K

1

0

7

Alex Olshevsky

@alexolshevsky1

3 years

A beautiful eulogy for Steven Weinberg:

3

0

7

Alex Olshevsky

@alexolshevsky1

4 years

@thegautamkamath Related:

0

7

Alex Olshevsky

@alexolshevsky1

1 year

@thegautamkamath Your mileage may vary, but my success rate at arguing variations on point 2 at NeurIPS/ICML is 0%

1

0

6

Alex Olshevsky

@alexolshevsky1

2 years

@lreyzin I enjoy watching human players more than AIs. Humans have understandable goals and plans and make occasional blunders. Watching Stockfish is not fun and even Alpha/Leela Zero, which have more exciting playing styles, have plans that are typically beyond human comprehension.

1

0

5

Alex Olshevsky

@alexolshevsky1

3 years

@JrKibs @ykilcher Tried it on my profile pic....not terribly impressed.

2

0

6

Alex Olshevsky

@alexolshevsky1

4 years

@florian_dorfler As others have said, the videos by @brianbdouglas are great. There is also this set of lectures from 1980s, which looks like a good resource. Finally, I've also recommended this set of lectures to students in the past.

1

0

6

Alex Olshevsky

@alexolshevsky1

1 year

Or how about this: for an LLM trained to play Othello, you can figure out exactly how it represents the state of the board from the activations: 15/

Actually, Othello-GPT Has A Linear Emergent World Representation — LessWrong

Note that this work has since been turned into a paper and published at BlackboxNLP. I think the paper version is more rigorous but much terser and l…

www.lesswrong.com

1

0

6

Alex Olshevsky

@alexolshevsky1

3 years

Same as the last one but with "by Ivan Shishkin." I like this one a lot more.

1

0

5

Alex Olshevsky

@alexolshevsky1

1 year

Also:

1

0

6

Alex Olshevsky

@alexolshevsky1

1 year

@SebastienBubeck Every time I see people on Twitter talk about how Bard is much improved followed some update, I give it the same test and it fails every time (see screenshot).

4

1

5

Alex Olshevsky

@alexolshevsky1

1 year

Interesting case where a surprising amount of anger was directed at the creator of a website summarizing books with statistics on writing style (frequency of adjectives, proportion of passive voice, etc). Not sure if this is an answer to @thegautamkamath 's challenge, but

Gautam Kamath

@thegautamkamath

1 year

Interesting to see how polarizing this was, from Twitter reactions: - authors were strongly against the site, insisting that it be shut down and the data deleted - CS/ML folks were astounded by such a gross overreaction Anyone want to argue contrary to their group?

2

0

10

1

6

Alex Olshevsky

@alexolshevsky1

2 years

Wow -- this obtains a 4/5 on the AP Calculus BC test (whereas GPT-3 scored close to the zeroth percentile). How long until this thing is better than me at proving theorems?

Greg Brockman

@gdb

2 years

We’re releasing GPT-4 — a large multimodal model (image & text in, text out) which is a significant advance in both capability and alignment. Still limited in many ways, but passes many qualification benchmarks like the bar exam & AP Calculus:

163

1K

7K

1

0

6

Alex Olshevsky

@alexolshevsky1

1 year

@aryehazan ....but to get to that real solution, you have imaginary terms that cancel from the formula (e.g., something like 2+3i + (2-3i) = 4); and there was no known way to get to the real solution using steps that didn't have square roots of negative numbers.

1

0

6