LessWrong

Ilya Sutskever and Jan Leike resign from OpenAI

165

This is a linkpost for https://www.nytimes.com/2024/05/14/technology/ilya-sutskever-leaving-openai.html

Ilya Sutskever and Jan Leike have resigned. They led OpenAI's alignment work. Superalignment will now be led by John Schulman, it seems. Jakub Pachocki replaced Sutskever as Chief Scientist.

Reasons are unclear (as usual when safety people leave OpenAI).

The NYT piece and others I've seen don't really have details. Archive of NYT if you want to read it anyway.

OpenAI announced Sutskever's departure in a blogpost.

Sutskever and Leike confirmed their departures in tweets.

PhilosophicalSoul1m10

In my opinion, a class action filed by all employees prejudiced (I say allegedly here, reserving the right to change 'prejudiced' in the event that new information arises) by the NDAs and gag orders, to terminate these agreements, would be extremely effective.

An arbitral tribunal as the format, rather than a court or internal bargaining, is far more likely to grant compensation to ex-employees.

See Trump's NDA termination.

2Tenoke43m

When considering that my thinking was that I'd expect the last day to be slightly after, but the announcement can be slightly before since that doesn't need to be quite on the last day but can and often would be a little before - e.g. be on the first day of his last week.

4Linch5h

I agree it's not a large commitment in some absolute sense. I think it'd still be instructive to see whether they're able to hit this (not very high) bar.

10Pablo13h

This, and see also Gwern's comment here.

Losing Faith In Contrarianism

omnizoid

21d

Crosspost from my blog.

If you spend a lot of time in the blogosphere, you’ll find a great deal of people expressing contrarian views. If you hang out in the circles that I do, you’ll probably have heard of Yudkowsky say that dieting doesn’t really work, Guzey say that sleep is overrated, Hanson argue that medicine doesn’t improve health, various people argue for the lab leak, others argue for hereditarianism, Caplan argue that mental illness is mostly just aberrant preferences and education doesn’t work, and various other people expressing contrarian views. Often, very smart people—like Robin Hanson—will write long posts defending these views, other people will have criticisms, and it will all be such a tangled mess that you don’t really know what to think about them.

For...

(Continue Reading – 1290 more words)

Bohaska1h10

Do you happen to have a copy of it that you can share?

Why you should learn a musical instrument

cata

I have liked music very much since I was a teenager. I spent many hours late at night in Soulseek chat rooms talking about and sharing music with my online friends. So, I tend to just have some music floating around in my head on any given day. But, I never learned to play any instrument, or use any digital audio software. It just didn't catch my interest.

My wife learned to play piano as a kid, so we happen to have a keyboard sitting around in our apartment. One day I was bored so I decided to just see whether I could figure out how to play some random song that I was thinking about right then. I found I was easily able to reconstitute a piano...

(See More – 654 more words)

Ben Pace2h20

The first two reasons that come to my mind are (1) other instruments have much more career incentive to do so (in that there are many more jobs for classical violinists or violin ensembles than for classical guitarists), and (2) it’s possible to have a much more successful career as a guitarist knowing only chord positions and not having a more detailed understanding of the fretboard, than it is with other instruments where a knowledge of how to play complicated melodies is required.

2Ben Pace2h

I do want to +1 that there is a lot of variation in right-hand-position space. For fingerpicking, my training has always been to pluck from the knuckles, which are the strongest and biggest joints in the finger, and never from the joints nearer the fingertips, which are much weaker and tire faster; nor to hook one’s fingers under the string but to simply push past the string. (In case thats helpful.) Might take some time to adjust to any new playing pattern. As with exercising any part of your body, there’s a difference between tiring your hands out (which is healthy) and hurting them (which is painful and damaging). There should be no sharp pain.

2cata6h

Learning piano I have been pretty skeptical about the importance of learning to read sheet music fluently. All piano players culturally seem to insist that it's very important, but my sense is that it's some kind of weird bias. If you tell piano players that you should hear it in your head and play it expressively, they will start saying stuff about, what if you don't already know what it's supposed to sound like, how will you figure it out, and they don't like "I will go listen to it" as an answer. So far, I am not very fluent at reading, so maybe I just don't get it yet.

2Ben Pace2h

I have also seen the culture of pianists being used to playing reams and reams of new music, and this being a signal of proficiency more so than amongst other instrumentalists (e.g. violinists or flautists). I think it is probably because the majority of a pianist’s career is spent in accompaniment rather than as a soloist or in an equal ensemble (there are ~no serious piano quartets), and so the quantity of music quickly consumable is a much more competitive asset. When I was at music school, there were professional accompanists and everyone was assigned one, pianists employed simply to go around and accompany all of the students in their performances, so they needed to be able to play a great deal of complicated music very quickly or on-sight. Personally, my primary goal with sheet music is to get off of it as soon as possible (i.e. learn the piece from memory). It is a qualitative reduction in the number of things my attention is on, and gives me much more cognitive space to focus on how to play the piece rather than what I’m playing next.

D&D.Sci (Easy Mode): On The Construction Of Impossible Structures

abstractapplic

This is a D&D.Sci scenario: a puzzle where players are given a dataset to analyze and an objective to pursue using information from that dataset.

Duke Arado’s obsession with physics-defying architecture has caused him to run into a small problem. His problem is not – he affirms – that his interest has in any way waned: the menagerie of fantastical buildings which dot his territories attest to this, and he treasures each new time-bending tower or non-Euclidean mansion as much as the first. Nor – he assuages – is it that he’s having trouble finding talent: while it’s true that no individual has ever managed to design more than one impossible structure, it’s also true that he scarcely goes a week without some architect arriving at his door, haunted...

(See More – 437 more words)

simon2h20

Looks like architects apprenticed under B. Johnson or P. Stamatin always make impossible structures.

Architects apprenticed under M. Escher, R. Penrose or T. Geisel never do.

Self-taught architects sometimes do and sometimes don't. It doesn't initially look promising to figure out who will or won't in this group - many cases of similar proposals sometimes succeeding and sometimes failing.

Fortunately, we do have 5 architects (D,E,G,H,K) apprenticed under B. Johnson or P. Stamatin, so we can pick the 4 of them likely to have the lowest cost proposals.

Cos

... (read more)

3James Bishop7h

4aphyer7h

2Unnamed6h

My idea of sacredness, divinity, and religion

Kaj_Sotala

7mo

Here’s a conception that I have about sacredness, divinity, and religion.

There’s a sense in which love and friendship didn’t have to exist.

If you look at the animal kingdom, you see all kinds of solitary species, animals that only come together for mating. Members of social species – such as humans – have companionship and cooperation, but many species do quite well without being social.

In theory, you could imagine a world with no social species at all.

In theory, you could imagine a species of intelligent humanoids akin to Tolkien’s orcs. Looking out purely for themselves, willing to kill anyone else if they got away with it and it benefited them.

And then in another sense, some versions of love and friendship do have to exist.

Social species evolved for a...

(Continue Reading – 1106 more words)

Baometrus3h10

Thank you for your thoughts.

I often reflect that, in my attempts to model life on this planet from all that I have observed, experienced, read, and reflected on, it seems like there is a persistent "force" that is supporting life at ever greater levels of organization and complexity. The fields, circumstances, and conditions of this planet seem to give chances to any strategy for organizing on top of what has already been organized. Trillions of chances over billions of years, with almost as many failures. Almost.

I'm not the most science-y, but it seems t... (read more)

Is there a place to find the most cited LW articles of all time?

keltan

I expect it would be useful when developing an understanding of the language used on LW.

Answer by habrykaMay 17, 202460

We don't have a live count, but we have a one-time analysis from late 2023: https://www.lesswrong.com/posts/WYqixmisE6dQjHPT8/2022-and-all-time-posts-by-pingback-count

My guess is not much has changed since then, so I think that's basically the answer.

2habryka5h

What do you mean by "cited"? Do you mean "articles references in other articles on LW" or "articles cited in academic journals" or some other definition?

1keltan3h

That’s an important point I neglected. I mean something like “the top LW post on the list would have the most links from other LW posts” For example, I’d expect “More Dakka” would be high up on the list. Since it is mentioned in LW posts quite often.

To get the best posts emailed to you, create an account! (2-3 posts per week, selected by the LessWrong moderation team.)

Against "argument from overhang risk"

RobertM

Epistemic status: I wrote this in August 2023, got some feedback I didn't manage to incorporate very well, and then never published it. There's been less discussion of overhang risk recently but I don't see any reason to keep sitting on it. Still broadly endorsed, though there's a mention of a "recent" hardware shortage which might be a bit dated.

I think arguments about the risks of overhangs are often unclear about what type of argument is being made. Various types arguments that I've seen include:

Pausing is net-harmful in expectation because it would cause an overhang, which [insert further argument here]
Pausing is less helpful than [naive estimate of helpfulness] because it would cause an overhang, which [insert further argument here]
We shouldn't spend effort attempting to coordinate or enforce

...

(Continue Reading – 1333 more words)

RobertM4h20

This seems to be arguing that the big labs are doing some obviously-inefficient R&D in terms of advancing capabilities, and that government intervention risks accidentally redirecting them towards much more effective R&D directions. I am skeptical.

If such training runs are not dangerous then the AI safety group loses credibility.
It could give a false sense of security when a different arch requiring much less training appears and is much more dangerous than the largest LLM.
It removes the chance to learn alignment and safety detail

... (read more)

2RobertM4h

We ran into a hardware shortage during a period of time where there was no pause, which is evidence that the hardware manufacturer was behaving conservatively. If they're behaving conservatively during a boom period like this, it's not crazy to think they might be even more conservative in terms of novel R&D investment & ramping up manufacturing capacity if they suddenly saw dramatically reduced demand from their largest customers. This and the rest of your comment seems to have ignored the rest of my post (see: multiple inputs to progress, all of which seem sensitive to "demand" from e.g. AGI labs), so I'm not sure how to respond. Do you think NVIDIA's planning is totally decoupled from anticipated demand for their products? That seems kind of crazy, but that's the scenario you seem to be describing. Big labs are just going to continue to increase their willingness-to-spend along a smooth exponential for as a long as the pause lasts? What if the pause lasts 10 years? If you think my model of how inputs to capabilities progress are sensitive to demand for those inputs from AGI labs is wrong, then please argue so directly, or explain how your proposed scenario is compatible with it.

1RussellThor10h

Only if you pause everything that could bring ASI. That is hardware, training runs, basic science on learning algorithms, brain studies etc.

2RobertM4h

This seems non-reponsive to arguments already in my post:

Advice for Activists from the History of Environmentalism

Jeffrey Heninger

14h

This is a linkpost for https://blog.aiimpacts.org/p/advice-for-activists-from-the-history

This is the fourth in a sequence of posts taken from my recent report: Why Did Environmentalism Become Partisan?

This post has more of my personal opinions than previous posts or the report itself.

Other movements should try to avoid becoming as partisan as the environmental movement. Partisanship did not make environmentalism more popular, it made legislation more difficult to pass, and it resulted in fluctuating executive action. Looking at the history of environmentalism can give insight into what to avoid in order to stay bipartisan.

Partisanship was not inevitable. It occurred as the result of choices and alliances made by individual decision makers. If they had made different choices, environmentalism could have ended up being a bipartisan issue, like it was in the 1980s and is in some countries...

(Continue Reading – 1606 more words)

3trevor6h

For those of us who haven't already, don't miss out on the paper this was based off of. It's a serious banger for anyone interested in the situation on the ground and probably one of the most interesting and relevant papers this year. It's not something to miss just because you don't find environmentalism itself very valuable; if you think about it for a while, it's pretty easy to see the reasons why they're a fantastic case study for a wide variety of purposes. Here's a snapshot of the table of contents: (the link to the report seems to be broken; are the 4 blog posts roughly the same piece?)

Jeffrey Heninger4h32

Thank you !

The links to the report are now fixed.

The 4 blog posts cover most of the same ground as the report. The report goes into more detail, especially in sections 5 & 6.

4Joseph Miller8h

Thanks, this is really useful. Do you have any particular examples as evidence of this? This is something I've been thinking a lot about for AI and I'm quite uncertain. It seems that ~0% of advocacy campaigns have good epistemics, so it's hard to have evidence about this. Emotional appeals are important and often hard to reconcile with intellectual honesty. Of course there are different standards for good epistemics and it's probably bad to outright lie, or be highly misleading. But by EA standards of "good epistemics" it seems less clear if the benefits are worth the costs. As one example, the AI Safety movement may want to partner with advocacy groups who care about AI using copyrighted data or unions concerned about jobs. But these groups basically always have terrible epistemics and partnering usually requires some level of endorsement of their positions. As an even more extreme example, as far as I can tell about 99.9% of people have terrible epistemics by LessWrong standards so to even expand to a decently sized movement you will have to fill the ranks with people who will constantly say and think things that you think are wrong.

Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

Gunnar_Zarncke

19h

This is a linkpost for https://arxiv.org/abs/2405.06624

Authors: David "davidad" Dalrymple, Joar Skalse, Yoshua Bengio, Stuart Russell, Max Tegmark, Sanjit Seshia, Steve Omohundro, Christian Szegedy, Ben Goldhaber, Nora Ammann, Alessandro Abate, Joe Halpern, Clark Barrett, Ding Zhao, Tan Zhi-Xuan, Jeannette Wing, Joshua Tenenbaum

Abstract:

Ensuring that AI systems reliably and robustly avoid harmful or dangerous behaviours is a crucial challenge, especially for AI systems with a high degree of autonomy and general intelligence, or systems used in safety-critical contexts. In this paper, we will introduce and define a family of approaches to AI safety, which we will refer to as guaranteed safe (GS) AI. The core feature of these approaches is that they aim to produce AI systems which are equipped with high-assurance quantitative safety guarantees. This is achieved by the interplay of three core components:

...

(See More – 60 more words)

7habryka5h

I am quite interested in takes from various people in alignment on this agenda. I've engaged with both Davidad's and Bengio's stuff a bunch in the last few months, and I feel pretty confused (and skeptical) about a bunch of it, and would be interested in reading more of what other people have to say.

ryan_greenblatt4h122

I wrote up some of my thoughts on Bengio's agenda here.

TLDR: I'm excited about work on trying to find any interpretable hypothesis which can be highly predictive on hard prediction tasks (e.g. next token prediction).^[1] From my understanding, the bayesian aspect of this agenda doesn't add much value.

I might collaborate with someone to write up a more detailed version of this view which engages in detail and is more clearly explained. (To make it easier to argue against and to exist as a more canonical reference.)

As far as Davidad, I think the "manually bui... (read more)

LESSWRONG
LW

LessOnline Festival

May 31st - June 2nd, in Berkeley CA

Quick Takes

Popular Comments

Recent Discussion