ToxicChat: Unveiling Hidden Challenges of Toxicity Detection

ToxicChat: Unveiling Hidden Challenges of Toxicity Detection

Tech Xplore

ToxicChat is a new benchmark developed by University of California San Diego computer scientists. It is based on examples gathered from real-world interactions between users and an AI-powered chatbot. This is the type of toxic prompt, cloaked in benign language, that can be detected far better than by models trained on previous toxicity benchmarks.

#SCIENCE #English #ET
Read more at Tech Xplore