홈

소식

블로그

포트폴리오

구독

AI 뉴스 요약 - 2025년 12월 12일

범주

비어 있음

1.

OpenAI는 GPT-5.2를 출시했는데, 이는 AI 벤치마크에서 구글의 제미니 3를 능가했으며, GPT-5.1 출시 후 4주 만에 상당한 벤치마크 개선을 이루었습니다.

2.

구글은 업데이트된 딥 리서치 에이전트를 출시하고, 새로운 API를 통해 개발자들이 이를 사용할 수 있도록 개방했으며, 복잡한 웹 검색을 위한 오픈 소스 벤치마크를 공개했습니다.

3.

구글은 앤트로픽의 모델 컨텍스트 프로토콜(MCP)을 자사의 클라우드 인프라에 통합하여 MCP를 통해 AI 모델이 자사의 인프라에 접근할 수 있도록 했습니다.

4.

구글 딥마인드는 FACTS 벤치마크 결과를 발표했는데, 제미니 3 프로와 GPT-5.1을 포함한 최상위 모델조차도 사실성 측면에서 어려움을 겪는다는 것을 보여주었습니다.

참고 자료

1.

https://the-decoder.com/gpt-5-2-lands-to-top-googles-gemini-3-in-the-ai-benchmark-game-just-four-weeks-after-gpt-5-1/

GPT-5.2 lands to top Google's Gemini 3 in the AI benchmark game just four weeks after GPT-5.1

Just four weeks after releasing GPT-5.1, OpenAI is back with GPT-5.2 and some substantial benchmark improvements.

the-decoder.com

1.

https://the-decoder.com/google-opens-updated-deep-research-agent-to-developers-with-new-api/

Google opens updated Deep Research Agent to developers with new API

Google releases a more powerful version of its Deep Research Agent and opens it to developers for the first time. The company also introduces a new open-source benchmark for complex web searches.

the-decoder.com

1.

https://the-decoder.com/google-opens-its-infrastructure-for-ai-models-via-mcp/

Google opens its infrastructure for AI models via MCP

Google is integrating Anthropic's Model Context Protocol (MCP) directly into its cloud infrastructure.

the-decoder.com

1.

https://the-decoder.com/facts-benchmark-shows-that-even-top-ai-models-struggle-with-the-truth/

FACTS benchmark shows that even top AI models struggle with the truth

A new benchmark from Google Deepmind aims to measure AI model reliability more comprehensively than ever before. The results reveal that even top-tier models like Gemini 3 Pro and GPT-5.1 are far from perfect.

the-decoder.com

Slashpage로 제작됨