# AI News Digest - 2026-02-09

1. OpenClaw was found to contain hundreds of skills that were laced with Trojans and data-stealing malware, which turned the AI agent into a malware delivery system and prompted mitigation actions by OpenClaw and VirusTotal.

2. WorldVQA benchmark showed that leading multimodal models still failed to reach 50% accuracy on basic visual entity recognition, with Gemini 3 Pro scoring 47.4% and models often asserting incorrect specific labels with high confidence.

3. Claude Opus 4.6 claimed the top spot on the Artificial Analysis Intelligence Index, surpassing GPT-5.2, while the report noted that OpenAI's Codex 5.3 remained pending and that Opus's token costs were higher than some competitors.

4. Researchers reported that reasoning models such as Deepseek-R1 generated internal ensembles resembling teams of experts—a "society of thought" with contrasting internal voices—and that this internal debate measurably improved problem-solving performance.

# References

1. [https://the-decoder.com/malicious-skills-turn-ai-agent-openclaw-into-a-malware-delivery-system/](https://the-decoder.com/malicious-skills-turn-ai-agent-openclaw-into-a-malware-delivery-system/)

[Malicious skills turn AI agent OpenClaw into a malware delivery system](https://the-decoder.com/malicious-skills-turn-ai-agent-openclaw-into-a-malware-delivery-system/)

1. [https://the-decoder.com/best-multimodal-models-still-cant-crack-50-percent-on-basic-visual-entity-recognition/](https://the-decoder.com/best-multimodal-models-still-cant-crack-50-percent-on-basic-visual-entity-recognition/)

[Best multimodal models still can't crack 50 percent on basic visual entity recognition](https://the-decoder.com/best-multimodal-models-still-cant-crack-50-percent-on-basic-visual-entity-recognition/)

1. [https://the-decoder.com/claude-opus-4-6-takes-the-top-spot-on-artificial-analysis-intelligence-index-but-openais-codex-5-3-looms/](https://the-decoder.com/claude-opus-4-6-takes-the-top-spot-on-artificial-analysis-intelligence-index-but-openais-codex-5-3-looms/)

[Claude Opus 4.6 takes the top spot on Artificial Analysis Intelligence Index, but OpenAI's Codex 5.3 looms](https://the-decoder.com/claude-opus-4-6-takes-the-top-spot-on-artificial-analysis-intelligence-index-but-openais-codex-5-3-looms/)

1. [https://the-decoder.com/study-finds-ai-reasoning-models-generate-a-society-of-thought-with-arguing-voices-inside-their-process/](https://the-decoder.com/study-finds-ai-reasoning-models-generate-a-society-of-thought-with-arguing-voices-inside-their-process/)

[Study finds AI reasoning models generate a "society of thought" with arguing voices inside their process](https://the-decoder.com/study-finds-ai-reasoning-models-generate-a-society-of-thought-with-arguing-voices-inside-their-process/)

For the site tree, see the [root Markdown](https://ixtj.dev/.md).