Researchers develop new methods for training robots safely in risky environments.
― 4 min read
Cutting edge science explained simply
Researchers develop new methods for training robots safely in risky environments.
― 4 min read
Research highlights safety neurons' role in enhancing LLM safety and responsibility.
― 6 min read
A new approach to enhance model safety through prediction rejection.
― 6 min read
Research on magnetic islands enhances plasma stability and disruption prevention in tokamaks.
― 6 min read
This article discusses methods for improving AI alignment with various cultures.
― 6 min read
A new method helps identify weak points in deep learning models quickly.
― 5 min read
Research reveals language models struggle with false reasoning, raising safety concerns.
― 6 min read
Research focuses on managing plasma disruptions to improve fusion reactor safety.
― 4 min read
CCL ensures neural networks maintain accuracy while learning new tasks.
― 6 min read
A tool to analyze and improve computer image recognition errors.
― 7 min read
UNRealNet enhances robot navigability in tough terrain using advanced techniques.
― 5 min read
InferAct improves decision-making safety for AI agents in various tasks.
― 6 min read
A new method improves robot walking safety and efficiency.
― 7 min read
Researching how robots work together in shared spaces for safe interactions.
― 5 min read
A new method to enhance safety in critical systems using language models.
― 6 min read
This study analyzes the performance of neural network circuits and their reliability.
― 4 min read
A new method improves understanding of safety constraints in robotics.
― 8 min read
Examining how language models can refuse to answer for improved safety.
― 5 min read
This article reviews how vector quantization impacts understanding decisions in reinforcement learning systems.
― 4 min read
Learn how program verification ensures software reliability in critical industries.
― 5 min read
A new method enhances RL agents' resilience against harmful input changes.
― 7 min read
Improving fault detection and diagnostics in nuclear reactors using deep learning techniques.
― 6 min read
This method improves safety in image generation while maintaining quality.
― 6 min read
A new framework enhances robot safety and efficiency in unpredictable settings.
― 7 min read
A new method improves safety in decision-making for machines.
― 7 min read
A new approach to enhance how robots understand and respond to users.
― 7 min read
LEVIS helps find safe input spaces for reliable neural network outputs.
― 5 min read
Caution-Aware Transfer enhances safety and performance in reinforcement learning applications.
― 6 min read
A new method improves detection of harmful prompts in language models.
― 6 min read
This work focuses on explaining decision-making in AI using Monte Carlo Tree Search.
― 6 min read
Introducing CBF-LLM: a method for safer text generation in LLMs.
― 5 min read
A study on false refusals in language models and their impact on user experience.
― 6 min read
A new method combines reinforcement learning and safety to enhance robot tasks.
― 7 min read
A framework for ensuring robots perform safely and effectively in human interactions.
― 7 min read
This article discusses ways to enhance safety in RL using language models.
― 5 min read
A method to assess AI agents' evaluations for safety and reliability.
― 8 min read
A new method enhances detection of unexpected data in machine learning models.
― 6 min read
RADER system improves robotic learning through safe demonstrations in extended reality.
― 6 min read
Examining how training data impacts language model outputs and safety measures.
― 6 min read
New training method improves LLM safety and performance.
― 7 min read