Xiaoyuan Yi’s Post

View profile for Xiaoyuan Yi, graphic

Microsoft Research Asia - Senior Researcher

The second paper of our Value Compass project, Value FULCRA, is accepted by NAACL (main conference)! 😁 We propose a novel alignment paradigm based on Schwartz's Theory of Basic Human Values with a new dataset and a corresponding BaseAlign algorithm. Better harmfulness reduction, more flexible alignment targets, reflecting true human values and covering multiple AI safety issues with only 20% data compared to the HH dataset! https://lnkd.in/gnF5Hpw4 Dataset coming soon! 👍 Jing Yao Xing Xie Xiting Wang

  • No alternative text description for this image

To view or add a comment, sign in

Explore topics