Reasoning Code Language

Language models can use steganography to hide their reasoning, study finds

In a new study, Redwood Research, a research lab for AI alignment, has unveiled that large language models (LLMs) can master "encoded reasoning," a form of steganography. This intriguing phenomenon ...

VentureBeat

Baidu's self-reasoning AI: The end of 'hallucinating' language models?

Chinese tech giant Baidu has unveiled a breakthrough in artificial intelligence that could make language models more reliable and trustworthy. Researchers at the company have created a novel ...

Nature

LLM ethics benchmark: a three-dimensional assessment system for evaluating moral reasoning in large language models

This study establishes a novel framework for systematically evaluating the moral reasoning capabilities of large language models (LLMs) as they increasingly integrate into critical societal domains.

Nature

Reply to “When do large language models cross the line: “reasoning” red teaming in healthcare”

We appreciate Sorin et al. for highlighting critical considerations for future red teaming of large language models (LLMs) in healthcare. We agree that analyzing only final answers overlooks failures ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results