Unpacking the bias of large language models
MIT News || A team of MIT researchers, including TILOS Foundations team member and associate professor Stefanie Jegelka, and postdoctoral scholar Yifei Wang, has developed a theoretical framework to study how information flows through the machine learning (ML) architecture that forms the backbone of LLMs. Their work has uncovered the root cause of “position bias” […]