Home

ニュース

ブログ

ポートフォリオ

Subscribe

AIニュースダイジェスト - 2026年2月20日

カテゴリ

Empty

1.

Microsoft は、オンラインコンテンツの信頼性を証明するための青写真を公開し、さまざまな失敗シナリオに対して 60 通りの検証方法の組み合わせを評価した後、来歴マニフェスト、機械可読透かし、暗号化指紋を組み合わせた技術標準を推奨しました。

2.

Google は、前モデルと比較して、要求の厳しい推論ベンチマークでパフォーマンスが 2 倍以上向上した更新モデル、Gemini 3.1 Pro をリリースしました。

3.

Google DeepMind は、大規模言語モデルの道徳的推論を厳密に評価することを求める研究を発表し、実質的な道徳的能力と表面的な反応を区別するために、堅牢性テスト、思考連鎖のモニタリング、メカニズムの解釈可能性などの手法を提案しました。

4.

デビッド・シルバーは、ロンドンを拠点とする新興企業 Ineffable Intelligence のシードラウンドで 10 億ドルを調達し、大規模な言語モデルに頼ることなく継続的に学習するスーパーインテリジェンスに向けた強化学習主導のアプローチを追求しています。

5.

OpenAI と Paradigm は、AI エージェントが Ethereum スマートコントラクトの脆弱性を発見、修正、悪用する能力を測定するベンチマークである EVMbench をリリースし、エージェントがほとんどの脆弱性を自律的に悪用できることを示しました。

参考文献

1.

https://www.technologyreview.com/2026/02/19/1133360/microsoft-has-a-new-plan-to-prove-whats-real-and-whats-ai-online/

Microsoft has a new plan to prove what’s real and what’s AI online

A new proposal calls on social media and AI companies to adopt strict verification, but the company hasn’t committed to following its own recommendations.

technologyreview.com

1.

https://the-decoder.com/google-releases-gemini-3-1-pro-with-improved-reasoning-capabilities/

Google releases Gemini 3.1 Pro with improved reasoning capabilities

With Gemini 3.1 Pro, Google wants to improve the core intelligence of its model family. On a demanding reasoning benchmark, performance has more than doubled compared to its predecessor. But benchmarks are just that: benchmarks.

the-decoder.com

1.

https://www.technologyreview.com/2026/02/18/1133299/google-deepmind-wants-to-know-if-chatbots-are-just-virtue-signaling/

Google DeepMind wants to know if chatbots are just virtue signaling

We need to better understand how LLMs address moral questions if we're to trust them with more important tasks.

technologyreview.com

1.

https://the-decoder.com/deepmind-veteran-david-silver-raises-1b-seed-round-to-build-superintelligence-without-llms/

Deepmind veteran David Silver raises $1B seed round to build superintelligence without LLMs

Long-time DeepMind researcher David Silver is raising one billion dollars for his London-based AI start-up Ineffable Intelligence, the largest seed round in European start-up history. Instead of training on internet text like today's LLMs, Silver is betting on reinforcement learning in simulated environments to build an "endlessly learning superintelligence."

the-decoder.com

1.

https://the-decoder.com/new-benchmark-shows-ai-agents-can-exploit-most-smart-contract-vulnerabilities-on-their-own/

New benchmark shows AI agents can exploit most smart contract vulnerabilities on their own

OpenAI and crypto investment firm Paradigm have built EVMbench, a benchmark that measures how well AI agents can find, fix, and exploit security vulnerabilities in Ethereum smart contracts.

the-decoder.com

Made with Slashpage