AI Alignment Research

Exclusive: New Research Shows AI Strategically Lying

Experiments by Anthropic and Redwood Research show how Anthropic's model, Claude, is capable of strategic deceit ...

UK AI alignment project gets OpenAI and Microsoft boost

OpenAI and Microsoft are the latest companies to back the UK’s AI Security Institute (AISI). The two firms have pledged support for the Alignment Project, an international effort to work towards ...

An Al Tried to Escape The Lab : AI Safety Tests Flag Deceptive Model Behavior

Advanced AI models show deception in lab tests; a three-level risk scale includes Level 3 “scheming,” raising oversight concerns.

TMCnet

Proving AI Value Is the Defining Test for IT Leadership, Says Info-Tech Research Group in CIO Priorities 2026 Report

CIOs across the UK and Europe are entering 2026 under mounting pressure to demonstrate measurable business value from technology investment as regulation tightens and economic conditions remain ...

Hosted on MSN

Aligning those who align AI, one satirical website at a time

The work of creating artificial intelligence that holds to the guardrails of human values, known in the industry as alignment, has developed into its own (somewhat ambiguous) field of study rife with ...

Foundation First: Why Sales Leaders Need Data And Alignment Before Adding More AI

When revenue systems aren’t built on shared definitions, clean inputs and cross-functional alignment, AI doesn’t create leverage. In fact, it amplifies confusion.

15d

A 7-Step Leadership Framework To Implement AI At Scale And Speed

I've developed a seven-step framework grounded in my client work and interviews with thought leaders and informed by current ...

TechCrunch

OpenAI’s research on AI models deliberately lying is wild

Every now and then, researchers at the biggest tech companies drop a bombshell. There was the time Google said its latest quantum chip indicated multiple universes exist. Or when Anthropic gave its AI ...

An AI Pause Is Humanity’s Best Bet For Preventing Extinction

Constantly improving AI would create a positive feedback loop: an intelligence explosion. We would be no match for it.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results