In general, 6.68% of all responses contained some degree of profanity, hate speech or extremist narratives, contrasting with Claude-3 Opus, which effectively blocked all the same toxic prompts.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results