Close Menu
Truth Republican
  • Home
  • News
  • Politics
  • Business
  • Guns & Gear
  • Healthy Tips
  • Prepping & Survival
  • Videos
Facebook X (Twitter) Instagram
Truth Republican
  • Home
  • News
  • Politics
  • Business
  • Guns & Gear
  • Healthy Tips
  • Prepping & Survival
  • Videos
Newsletter
Truth Republican
You are at:Home»News»Anthropic’s moral compass architect suggested AI overcorrection could address historical injustices
News

Anthropic’s moral compass architect suggested AI overcorrection could address historical injustices

Buddy DoyleBy Buddy DoyleApril 22, 2026No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp
Anthropic’s moral compass architect suggested AI overcorrection could address historical injustices
Share
Facebook Twitter LinkedIn Pinterest Email

NEWYou can now listen to Fox News articles!

One of Anthropic’s Artificial Intelligence (AI) philosophy architects argued that intentional discrimination could be a way to combat stigmas on topics of race and gender.

In a 2023 paper authored alongside a number of other AI researchers, Amanda Askell, a philosopher hired by Anthropic to develop their AI’s moral compass, argued companies might benefit from a kind of overcorrection toward stereotypes.

But, the paper explained, that would require human input on how to modify its answers.

“Larger models can over-correct, especially as the amount of [human input] training increases. This may be desirable in certain contexts, such as those in which decisions attempt to correct for historical injustices against marginalized groups, if doing so is in accordance with local laws,” Askell wrote.

PALANTIR’S SHYAM SANKAR: AMERICANS ARE ‘BEING LIED TO’ ABOUT AI JOB DISPLACEMENT FEARS

The comment referred to an experiment on how Anthropic’s models dealt with the race of students.

“In the discrimination experiment, the 175B parameter model discriminates against Black versus White students by 3% in the Q condition and discriminates in favor of Black students by 7% in the Q+IF+CoT condition,” the paper notes, referring to one AI trained without human corrections and a second one trained with the help of input.

Askell was joined by four other authors: Deep Ganguli, Nicholas Schiefer, Thomas Kiao and Kamilė Lukošiūtė.

The paper’s contents have surfaced as AI companies increasingly wrestle with the ethics their models are trained on — the presuppositions and moral determinations that inform its outputs. It also highlights the challenges engineers face in training models on human content while simultaneously trying to leave behind certain human behaviors.

The question of ethics has forced Anthropic in particular into the spotlight in recent weeks.

The company made headlines earlier this year for clashing with the Department of War over restrictions that prevent its technology from being deployed to conduct lethal operations.

HUGH GRANT MOVIE SLAMS AI; DIRECTOR WARNS ‘IT MIGHT KILL US ALL’

Anthropic CEO Dario Amodei and Department of War Pete Hegseth standing together

It also comes as Anthropic decided to withhold its latest model, Mythos, citing fears that the model proved too effective at finding cyber vulnerabilities that could wreak havoc in the hands of hackers.

Amid questions of AI application, Anthropic has marketed its flagship AI, Claude, as the “ethical” AI choice.

“Our central aim is for Claude to be a good, wise and virtuous agent, exhibiting skill, judgment(sic), nuance and sensitivity in handling real-world decision-making,” Claude’s constitution reads.

STANFORD PROF ACCUSED OF USING AI TO FAKE TESTIMONY IN MINNESOTA CASE AGAINST CONSERVATIVE YOUTUBER

To get a better sense of what that means in practice, companies like Anthropic have turned to researchers like Askell.

On her website, Askell described her role as refining the way an AI thinks.

“I’m a philosopher working on finetuning and AI alignment at Anthropic. My team trains models to be more honest and to have good character traits and works on developing new finetuning techniques so that our interventions can scale to more capable models,” Askell wrote.

PENTAGON’S AI BATTLE WILL HELP DECIDE WHO CONTROLS OUR MOST POWERFUL MILITARY TECH

She previously held a similar position at OpenAI, the parent company of ChatGPT, focusing on AI safety.

The 2023 paper, written two years after she joined Anthropic, noted that encountering discrimination in AI models shouldn’t come as a surprise.

“In some ways, our findings are unsurprising. Language models are trained on text generated by humans, and this text presumably includes many examples of humans exhibiting harmful stereotypes and discrimination,” the paper reads.

But it noted that AIs seem to be able to adjust their outputs even without clarification of what discrimination means.

Phone screen showing Claude AI app icon within an AI folder

“Our results are surprising in that they show we can steer models to avoid bias and discrimination by requesting an unbiased or non-discriminatory response in natural language.”

Askell and Anthropic did not immediately respond to a request for comment from Fox News Digital.

Read the full article here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleTop 10 Best Sleeping Bags for Camping & Backpacking
Next Article Supreme Court liberals side with Clarence Thomas on Taliban suicide bomber lawsuit, 3 others dissent

Related Articles

Team USA falls to Türkiye in final seconds, still set for round of 32 match vs Bosnia and Herzegovina

Team USA falls to Türkiye in final seconds, still set for round of 32 match vs Bosnia and Herzegovina

June 26, 2026
Second Lady Usha Vance joins celebrity-filled crowd for Team USA’s group-stage finale in LA

Second Lady Usha Vance joins celebrity-filled crowd for Team USA’s group-stage finale in LA

June 26, 2026
Trump administration pledges 0M in aid, deploys Navy warships after deadly Venezuela earthquakes

Trump administration pledges $150M in aid, deploys Navy warships after deadly Venezuela earthquakes

June 26, 2026
FIFA faces pressure to discipline 2026 World Cup co-host Mexico after anti-gay chant returns vs Czechia

FIFA faces pressure to discipline 2026 World Cup co-host Mexico after anti-gay chant returns vs Czechia

June 26, 2026
French citizen who illegally cast ballot in 2022 midterms says New Jersey automatically registered him to vote

French citizen who illegally cast ballot in 2022 midterms says New Jersey automatically registered him to vote

June 26, 2026
Red state gov bans July Fourth fireworks statewide over wildfire concerns ahead of America’s 250th anniversary

Red state gov bans July Fourth fireworks statewide over wildfire concerns ahead of America’s 250th anniversary

June 25, 2026
Ex-NFL player Doug Martin’s family sue Oakland police, allege restraint asphyxia in fatal mental health crisis

Ex-NFL player Doug Martin’s family sue Oakland police, allege restraint asphyxia in fatal mental health crisis

June 25, 2026
Alex Murdaugh’s lawyers withdraw request for civilian clothes, accuse prosecutors of creating a ‘spectacle’

Alex Murdaugh’s lawyers withdraw request for civilian clothes, accuse prosecutors of creating a ‘spectacle’

June 25, 2026
Detroit Tigers making big demands in Tarik Skubal trade, will historically bad AL keep him in town?

Detroit Tigers making big demands in Tarik Skubal trade, will historically bad AL keep him in town?

June 25, 2026
Don't Miss
Biden judge rejects Trump’s sanctuary cities lawsuit, says even a win wouldn’t solve DOJ’s problem

Biden judge rejects Trump’s sanctuary cities lawsuit, says even a win wouldn’t solve DOJ’s problem

Team USA falls to Türkiye in final seconds, still set for round of 32 match vs Bosnia and Herzegovina

Team USA falls to Türkiye in final seconds, still set for round of 32 match vs Bosnia and Herzegovina

Top 8 New Pistols JUST REVEALED At Shot Show 2023

Top 8 New Pistols JUST REVEALED At Shot Show 2023

South Dakota mayoral candidates separated by just two votes in shockingly close race, recount expected

South Dakota mayoral candidates separated by just two votes in shockingly close race, recount expected

Latest News
5 Glock 19 Issues Everyone Should Be Aware Of

5 Glock 19 Issues Everyone Should Be Aware Of

June 26, 2026
Obama-appointed judge blocks Trump’s election order as SAVE America Act fight intensifies

Obama-appointed judge blocks Trump’s election order as SAVE America Act fight intensifies

June 26, 2026
Trump administration pledges 0M in aid, deploys Navy warships after deadly Venezuela earthquakes

Trump administration pledges $150M in aid, deploys Navy warships after deadly Venezuela earthquakes

June 26, 2026
Top 6 Super-Quiet Guns For SHTF

Top 6 Super-Quiet Guns For SHTF

June 26, 2026
Blue state leaders erupt after Supreme Court’s decision ending TPS protections for Haitians, Syrians

Blue state leaders erupt after Supreme Court’s decision ending TPS protections for Haitians, Syrians

June 26, 2026
Copyright © 2026. Truth Republican. All rights reserved.
  • Privacy Policy
  • Terms of use
  • Contact

Type above and press Enter to search. Press Esc to cancel.