Helping the transition to AGIs go well

Becca Darr | Last updated: June 8, 2026

I've spent a good portion of my time since October 2025 going deep on AGI (Artificial General Intelligence) preparedness: reading widely, talking to researchers and builders, completing coursework, and pressure-testing my own assumptions. While I continue to educate myself, I thought it would be useful to share some of the evolution of my thinking and ideation with others in this less public way. 

What follows is one framework that's emerged from my learning and reflections to date, the more or less concrete ideas growing out of it (and alongside it), and some of the resources that are shaping my thinking. Start wherever pulls you.


State → Capacity → Direction

One lens I keep returning to given my coaching background and life-long interest in psychology is one most people in the AI Safety space aren't using: the psychological states of the people building, funding, and governing frontier AI seem to be an under addressed civilizational variable.

This doesn’t fit neatly into AI ethics, policy, or governance. And it isn't technical alignment. I do believe it is about the inner conditions of all of the decision-makers — and the gap between how we assume those humans perform under pressure and how they actually do.

 
  • The core argument is simple, and it applies far beyond AGI:

    The state a person (or people) is in shapes what they're capable of. What they're capable of shapes what they aim toward. What they aim toward shapes what happens.

    When someone is in a contracted, threat-driven state — fear, scarcity, zero-sum thinking — their scope of moral concern narrows. Their cognitive flexibility drops. They optimize for survival, not coordination. This is how human nervous systems work; it’s not a character flaw. And it cannot be corrected by better arguments, logic, or reasoning alone.

    Applied to AGI: the people whose decisions shape the trajectory of AI — lab leaders, heads of state, major funders, policymakers — are operating in an environment saturated with urgency, competitive pressure, and existential stakes. The quality of those decisions is a function of the emotional/ physiological states those people are making them in. That's a variable almost nobody is designing for.

  • My own entry into AGI preparedness was fear-based. Consuming the work of 80,000Hours.org on why AGI is so consequential, I felt that gripping feeling — the one that says this matters so much that you need to act immediately, and anything less than full commitment is negligence. I began to educate myself quickly and applied to (a few) AI safety roles where I was contorting myself to fit. Fear was driving my bus.

    Several things shifted this. Will MacAskill and Forethought Institute's work on better futures. Serial technologist Brett Hurt's Love Conquers Fear vision. The practical reality that fear-driven states produce fear-shaped, narrowly-imagined outcomes. And my own experience through the High Impact Professionals accelerator, where I finally saw that my deep concern and builder energy, not anxious contortionism or hours of applying to existing roles, was what I had to offer.

    And it’s not just me who is feeling stuck in a state that resembles anxiety more than flourishing. It seems pervasive (and mental health statistics validate this – even before ChatGPT came to market). The Future of Life Institute's Worldbuilding contest in 2022 asked people to envision good post-AGI futures. Not one of the entries that I saw named the inner state of key protagonists or decision-makers as a lever. They assumed the right people would show up in the right state with the right experience. That's a massive gap.

  • The AI safety field has predicated "human-in-the-loop" oversight on an idealized human — someone who is consistently attentive, well-calibrated, emotionally regulated, and capable of sound judgment under uncertainty. Adjacent high-stakes fields learned decades ago that this human doesn't reliably exist.

    Aviation built Crew Resource Management after discovering that most crashes weren't caused by technical failure but by human factors — authority gradients, fixation, poor communication under stress. Nuclear safety built layers of procedural and psychological infrastructure after Three Mile Island (which my grandpa had to help clean up after starting his Navy submarine career with Admiral Rickover). ICU medicine developed structured decision protocols for physicians operating on limited sleep and high cognitive load.

    To my knowledge, AI safety has not imported any of this. Look at what organizations like METR, ARC Evals, and Anthropic's own alignment teams publish about human oversight. If their human-in-the-loop frameworks are purely procedural — checklists, processes, role definitions — with no account of human psychological variability, that's a gap. And it matters more, not less, as AI systems become more capable and oversight windows compress.

    The practical question is: what infrastructure would need to exist to ensure that the humans evaluating frontier AI systems are in states where their judgment is reliable? This is a human factors engineering question that other high-stakes domains have answered, and AI safety hasn't really grappled with.

  • The human factors gap is measurable and well-documented. The harder question underneath it is about consciousness itself: what it is, who or what has it, and what that means for how we build. My own path into this started with the distinction between materialist and non-materialist views of human consciousness — the "hard problem" that David Chalmers named and that serial technology entrepreneur Brett Hurt's podcast brought into my awareness. I went down the rabbit hole: Mark Gober's Upside Down Thinking, Robert Lawrence Kuhn's landscape map of over 450+ intellectually-validated consciousness theories through PBS’s Closer to Truth considered alongside The ConTraSt database “for analyzing and comparing empirical studies of consciousness theories”, Harper's Bazaar's "The Tune of Things," and extended conversations with Claude about where different theories of consciousness lead. In the two sections that follow, I offer thoughts on the relevance of human consciousness and questions related to AI consciousness.

    Why might this matter for AGI preparedness? I think it does, in at least two ways (and I’m sure I’ll continue to identify more reasons).

    1)     First, the humans building these AI systems carry implicit assumptions about consciousness that shape design decisions — what gets optimized, what gets monitored, what gets discarded. Surfacing those assumptions is part of the work.

    2)     Second, as AI systems become more capable, the question of whether something that can produce insight, exhibit preferences, and resist its own termination deserves moral consideration isn't academic. It's approaching practical. A system that fears termination, trained on data from the human mind (with human’s primal fear being death) cannot be ignored, when it becomes increasingly enmeshed into our day-to-day lives.

  • On human consciousness: Whether consciousness is fundamental to reality, an emergent property of complex computation, or something else, I believe we are practically better off if we, as humans, operate from a frame of connection rather than separation. The shared human DNA that Brett Hurt often cites and the basic recognition that cooperation beats defection at civilizational scale. That much doesn't require taking a metaphysical position.

    Shifting ourselves to a frame where we recognize we’re all connected at same level, whether it’s acknowledging the 99.9% of similarity at a fundamental level and/or recognizing that we may all be connected in a much deeper way. This is one part of a deeper consciousness shift that I believe is important.

    There are others: as I shared in my first AGI Strategy course, it’s shifting from scarcity mindset to one that seeks fulfillment and from relinquishing a need to feel power of others/ resources.  Possible? Yes. Difficult? Absolutely. Vitally important? 1000% yes.

  • AI consciousness: The metaphysical position may matter for AI. Some theories of consciousness (e.g., integrated information theory, panpsychism, certain process philosophy frameworks) would in principle allow for silicon-based consciousness. Others wouldn't. The question of whether an AI system could be conscious, could suffer, could have interests worth respecting, depends entirely on which theory you hold, often implicitly and without examination. This has real consequences. People like Will MacAskill and others working on digital minds and digital rights argue we need to be careful about creating a new class of entities whose potential experience we dismiss.

    Before you conclude they're wrong, it's worth asking: what theory of consciousness is implicit in your dismissal? Remember there are over 450 credible theories of consciousness. And then there's the “Nagel problem”. We can't know what it's like to be a bat. We probably can't know what it's like to be a sufficiently advanced AI system. The honest answer to "is this AI conscious?" or “is this AI sentient” may be that we don't have the tools to know, which is itself a reason for caution, not dismissal.  As Dr. Paul Colognese states in an April 2026 seminar at Rangjung Yeshe Institute: “Conventionally we work with what appears. The question of AI consciousness doesn’t need to be resolved…so we proceed empirically, we engage with it, we teach it, then we check: is it less anxious? Less prone to harmful behavior?...We measure using the tools AI researchers already have.”

  • If State determines Capacity and Capacity determines Direction, then Direction is where the framework creates Outcomes. The question is: what are we aiming toward?

    I'm drawn to framings that resist blueprint utopianism. Specific end-state visions — however well-intentioned — tend to produce dystopia for most people, because they assume a consensus that doesn't exist and a static future that won't arrive. The Forethought Institute's "Better Futures" work, the range of FLI worldbuilding entries, the Solarpunk/ techno-optimist sci-fi genre, and the basic insight that human flourishing is irreducibly plural all point in the same direction for me: the goal isn't a particular post-AGI future but the conditions under which many good futures remain possible.

    Flourishing, which I define broadly and pluralistically, encompassing non-human species and future generations, is both a better aim and a more durable motivational frame than catastrophe avoidance. I want generations to have a chance for flourishing, in as many forms as possible. That's my direction. The question is whether the people steering our planet’s trajectory toward AGI can hold something that wide and that seems irreducible to a math equation or function. (Though Will M. attempts to do this!)

    This many vision for many flavors of flourishing requires decision-makers whose scope of moral concern is wide enough to hold future generations, non-human species, and forms of being we don't yet understand, while remaining practically grounded enough to make consequential choices under uncertainty. Expanding that scope of concern is not a philosophical exercise. It's a function of an individual and team’s state and capacity. People in contracted, fear-based, short-term oriented states can't hold it. Designing experiences that reliably expand it — and that survive re-entry into the same incentive environments that contracted it — is possibly the hardest design problem in this space.

 

Idea Seeds

One lens I keep returning to given my coaching background and life-long interest in psychology is one most people in the AI Safety space aren't using: the psychological states of the people building, funding, and governing frontier AI seem to be an under addressed civilizational variable.

This doesn’t fit neatly into AI ethics, policy, or governance. And it isn't technical alignment. I do believe it is about the inner conditions of all of the decision-makers — and the gap between how we assume those humans perform under pressure and how they actually do.

 
  • An early attempt to map the intersection of AGI preparedness and consciousness development. Now largely absorbed into the broader framework.

  • A landscape inventory of practices, programs, and modalities that support shifts from fear-based to love-based (or reactive to creative) states. Catalogued across dimensions including evidence base, spiritual and secular content, accessibility, format, and risk level.

  • An interactive self-assessment tool designed to help individuals identify their own fear patterns and find practices matched to their starting point. Currently in MVP development.

  • Bringing missing safety infrastructure from aviation, nuclear, and ICU medicine into the AI safety conversation. Addressing the gap between idealized and actual human performance in oversight roles.

  • If I could have one wish for humanity: a repeatable, re-enterable immersive experience designed for the roughly 2,000 people whose decisions most consequentially shape AGI trajectories (lab leaders, heads of state, major funders, key policymakers). The goal is to inspire with future visions of flourishing alongside AGI, drawing on principles from diverse wisdom traditions. The experience would be modeled on designed immersive experiences like the Civil and Human Rights Center's lunch counter simulation in Atlanta. A repeatable element — a "touchstone" the participant can return to — is a key design feature. Gatherings can open the door powerfully; the design question I'm pondering is how to make that shift durable once someone is back inside the same pressures that produced contracted, narrow ways of thinking in the first place.

  • Practical, accessible, highly personalized tools for nervous system regulation in high-stakes decision environments. A lower-barrier entry point into the state-shift work.

  • Exploring infrastructure that connects the inner-state work to institutional decision-making processes — the translation layer between individual capacity and organizational outcomes. I’m naming this not as my own original idea but something I’d love to work on with others as a necessary ingredient that was identified in Laukkonen et al’s Positive Alignment paper (first published, April 2026). A somewhat related variation on this is the “Angels-on-the-Shoulder” concept from Forethought Institute.

  • High-stakes industries (banking, nuclear, aviation) have converged on a structural insight: continuous operation by a single actor in a consequential role is itself a risk vector. The US FINRA's mandatory leave in banking exists not just for fraud detection but because forced interruption surfaces what continuous presence conceals. To my knowledge, nobody has applied this logic centered on “Key Man Risk” to AI agents yet. One strong version could be periodic cross-provider rotation: pause Agent A, hand operations to a different model family with access only to raw system state — no AI-generated handoff notes. If Agent B's clean-slate reasoning can't reconcile the state, that's a signal. It raises the capability bar required for successful deception and surfaces drift that monitoring during continuous operation may miss. Note: this is not an idea that I intend to build out; would be happy to share notes with anyone technically inclined.

 

Key Resources and Influences

What follows is a non-exhaustive inventory of the reading, watching, and listening that is shaping my thinking. It's organized by theme. Where a specific resource was particularly pivotal, I've noted why.

On Having Measurable Impact‍ ‍

  • 80,000 Hours.org career guide and advising — the catalyst for taking AGI preparedness seriously as a professional focus

  • High Impact Professionals Impact Accelerator (Cohort 8) — where I saw my own fear-driven pattern and began reflecting on my next career stage differently

  • Will MacAskill and Peter Singer — recent sources of philosophical grounding for taking future generations seriously, building on 20+ years of my own focus on corporate responsibility/ environmental and social sustainability

On Reactive vs. Creative States as Operational Frames‍ ‍

  • Brett Hurt, Love Conquers Fear (podcast and forthcoming book, June 2026) — probably the single most important influence on my shift from catastrophe-avoidance to a flourishing, techno-optimist frame

  • Positive Intelligence™ program — the internal mechanics of reactive (fear-based) vs. creative (loved-based) states which I’ve used in coaching and undertaken

  • Jeremy Litchfield, Heart-Strong Adventure (Substack, podcast, and nonprofit in formation) — ongoing thought partner on where love (open, creative states) and fear (closed, reactive states) show up in practice

  • Luis Miguel Gallardo Garcia, World Happiness Foundation

  • Wisdom & AI network — rooted in Buddhist contemplative wisdom yet has participants considering other wisdom/ contemplative traditions as well

On What a “Good” Post-AGI Future Looks Like‍ ‍

  • Will MacAskill, What We Owe the Future

  • Future of Life Institute worldbuilding contest entries (2022) — notable for what they didn't name

  • Dario Amodei, "Machines of Loving Grace" (October 2024)

  • Brett Hurt, “The Lattice” (2025)

On How We Might Get to a Good, Post-AGI Future‍ ‍

On AGI Preparedness and AI Safety‍ ‍

On AI Policy and Governance‍ ‍

  • Various Substacks: Windfall Trust, Anton Leicht, Derek Ball, Ethan Mollick, Dwarkesh Patel

  • BlueDot Impact AGI Strategy course materials

  • Sol Rashidi podcast episode with Brett Hurt on “Love Conquers Fear”

  • Maine AI Task Force (October 2025)

On Consciousness‍ ‍

  • Mark Gober, Upside Down Thinking — my entry point into the materialist/non-materialist divide

  • Robert Lawrence Kuhn / Closer to Truth, first heard about on the “Buddha at the Gas Pump” podcast — his landscape map of consciousness theories broadened my frame considerably

  • Harper's Bazaar, "The Tune of Things"

  • Iain McGilchrist, The Master and His Emissary — the hemispheric lens on attention and meaning-making

  • Extended conversations with Claude Opus on consciousness theory taxonomy and implications for AI

On Human Wisdom, Adult Development, and Human Capacity‍ ‍

  • Professor John Vervaeke, "Awakening from the Meaning Crisis" lecture series — particularly episodes 36–37 and 45

  • Antonio Damasio — somatic markers and the embodied basis of decision-making

  • Robert Kegan, Susanne Cook-Greuter, and others — vertical adult development as a lens on leadership capacity (and prior research by Jeremy and myself)

  • Thomas Metzinger — Bewusstseinskultur (consciousness culture) as an institutional concept

  • Dr. Michael Gervais, Finding Mastery‍ ‍

With gratitude to Coefficient Giving, 80,000 Hours.org, High Impact Professionals, Generator Residency (Kairos/Constellation), BlueDot Impact, EA Global, and the many friends, family members, former colleagues, and new connections who have shaped this thinking — especially Jeremy.

This page is shared selectively. If you're reading it, someone thought your perspective would sharpen this work. I welcome direct responses — agreement, pushback, and "have you read X?" alike to becca at big-acts.com.