The Dire Defect of ‘Multilingual’ AI Content Moderation
Three parts Bosnian text. Thirteen parts Kurdish. Fifty-five parts Swahili. Eleven thousand parts English. This is part of the data recipe for Facebook’s new large language model, which the company claims...