AI Tools Nav
HomeToolsCompareGuideNewsSkills
中
AI Tools Nav

Curated AI tools directory — from choosing to mastering, all in one place.

RSSAPI

Navigation

  • Home
  • Tools
  • Compare
  • Guide
  • News
  • Skills

Platform

  • Overview
  • API
  • RSS
  • Submit

About

  • About Us
  • Changelog
© 2026 AI Tools Nav - AI Tools Directory
AI News

AI News Brief | 2026-05-20

Google I/O dominates with Gemini 3.5, Gemini Spark, and proactive agentic apps. OpenAI adds C2PA/SynthID watermarks, Anthropic hires Andrej Karpathy, SpaceX plans to acquire Cursor for $60B, and China’s AI players clash with price cuts and new models.

2026-05-20

AI Industry Overview

Today’s AI landscape is shaped by two massive events: Google I/O’s barrage of Gemini announcements and OpenAI’s push for content provenance. Google unveiled its flagship Gemini 3.5 model and a new AI agent called Gemini Spark, while also making the Gemini app more proactive and announcing subscription tier updates. On the safety front, OpenAI began adding C2PA metadata and SynthID watermarks to all AI-generated images from ChatGPT and DALL·E 3, aiming to increase transparency. Additionally, the general AI news roundup highlighted a dozen other stories, from China’s price wars to startup valuations.


Tool-by-Tool Breakdown

Gemini

Google’s Gemini ecosystem dominated I/O. The company introduced Gemini 3.5, a flagship model with improved reasoning, and Gemini Spark, a new agent designed for autonomous task completion. The Gemini app is now “more agentic” – it can proactively offer help 24/7 and analyze images for AI-generated content provenance (CNBC, Google Blog). Google also updated its AI subscription plans (Google Blog) and Sundar Pichai’s keynote set the stage for a year of agentic AI (Washington Post).

ChatGPT / OpenAI

OpenAI announced Guaranteed Capacity – a new enterprise offering that lets customers reserve compute for inference and training (CNBC). In parallel, the company began rolling out C2PA metadata and Google’s SynthID watermarks on all images generated by ChatGPT and DALL·E 3, making it easier to trace AI-generated content (The Next Web).

Claude / Anthropic

Anthropic landed a major hire: Andrej Karpathy, co-founder of OpenAI and former Tesla AI lead, joins the team (CNBC). Separately, Claude Managed Agents are now available on Cloudflare, bringing Anthropic’s models to edge workers (Cloudflare Blog). Claude Code also received an update to v2.1.145.

Cursor

In a stunning business move, SpaceX plans to acquire Cursor for $60 billion after the coding startup’s IPO later this year (The Next Web). Meanwhile, Cursor launched Composer 2, a major update to its AI code composer, and published new changelog entries (Cursor Blog).

Bolt.new

Bolt.new announced that Claude Sonnet 4.6 is now available in the platform, alongside a “Bolt for Teams” upgrade that improves collaborative coding (Bolt Blog, Bolt Blog). An industry analysis highlighted why vibe-coded prototypes still need a design system.

DeepSeek

DeepSeek rolled out image recognition capabilities ahead of the V4.1 model update. The company also slashed prices for its new model, positioning itself as a cost leader in China’s competitive AI market. Notably, DeepSeek’s funding is increasingly backed by state capital (Yahoo Finance).

Devin

Cognition released Devin 2.1, which includes new automation features and improved triage. The update introduces “Auto-Triage” for faster bug fixing. Cursor’s Composer 2.5 release was also noted in the same coding newsletter (Cognition Labs, CodeNewsletter).

Doubao

ByteDance’s Doubao is exploring paid subscription tiers to move toward profitability, while also cutting some features. The company launched Doubao-Seed-2.0-lite, a full-modal model that can listen, watch, and act. Forbes covered China’s AI self-correction cycle amid these shifts (Forbes, Chinadaily).

ElevenLabs

ElevenLabs partnered with Splice to create new AI audio products for music production. The company also released a changelog update on April 20, 2026. Google Gemini’s ability to detect AI-generated content was also linked to ElevenLabs’ detection work (Billboard, ElevenLabs Docs).

Flux

Flux AI launched CRAISEE Teams Enterprise, described as the first scalable AI solution for teams. The company also published general news and a changelog, focusing on enterprise deployment.

GitHub Copilot

GitHub announced that Gemini 3.5 Flash is now generally available for Copilot, offering a fast, cost-efficient model for simple coding tasks. The May 2026 changelog also included updates to Copilot Cloud Agent.

Kimi

Kimi released WebBridge, an open-source tool that turns AI into a local browser operator. Cerebras also brought Kimi K2.6 inference to enterprises, leveraging their wafer-scale chips. The startup is raising funds at a $20B valuation (Forbes).

Kling

Kuaishou’s AI video division Kling launched the 3.0 model, enabling users to create professional-grade videos with simple prompts. The division targets a $20B valuation, though a key founder moved to Alibaba (PRNewswire).

MetaSo AI

MetaSo published its roadmap and documentation, with a notable commit migrating extensions to a new plugin-sdk architecture. The project is positioning itself as a decentralized AI search and knowledge layer (GitHub).

Midjourney

Midjourney released V8 Alpha, its latest model version, and updated its release notes for May 2026. The release focuses on improved photorealism and style consistency.

Notion AI

Notion introduced its Developer Platform, allowing third-party AI agents and workflow automation to integrate deeply with Notion. Version 3.5 of the platform was released on May 13, 2026 (Notion Blog, InfoWorld).

Perplexity

Perplexity faced backlash after limiting usage for Pro accounts. The company blamed promo code abuse for the tightened limits, but users reported significant reductions. The incident sparked discussions about AI service sustainability (PCMag).

Pika

Pika raised $80M in Series B funding from investors including Stanford, positioning itself as a faster, more affordable alternative to OpenAI’s Sora. The startup is now valued at $470M and aims to make AI video creation consumer-first (Washington Post via Neuron Expert).

Qwen

Alibaba integrated its Qwen model with Taobao to create a conversational shopping experience. The company also published new model releases on Qwen Cloud and general news on the Qwen AI site.

Sora

OpenAI confirmed it will discontinue Sora, its AI video platform. The New York Times analyzed the decision as part of OpenAI’s strategy to focus on core products ahead of a potential IPO (NYT, CNN).

Consensus

Consensus AI showcased how it uses GPT-5 and the Responses API to complete weeks of research in minutes. The tool is also being adopted by nonprofits for research discovery.

DALL·E 3

As part of OpenAI’s content provenance push, DALL·E 3 now includes C2PA metadata and SynthID watermarks. A review notes that DALL·E 3 may be retired soon as OpenAI consolidates its image generation pipeline.

Descript

Descript launched its API in Open Beta, allowing developers to programmatically edit video and audio. The release roundup for April 2026 included workflow updates and bug fixes.

ERNIE (Baidu)

Baidu released ERNIE 5.1, claiming it tops multiple leaderboards and cuts pre-training costs to just 6% of industry peers using a novel “once-for-all” framework. The model is now available on Baidu Cloud.

Gamma

Gamma, the AI-powered presentation tool, launched Gamma Imagine to bring AI-native design to slides. The company previously raised $68M from investors, positioning itself as a PowerPoint replacement for the AI era.

Grok

Elon Musk’s xAI showcased Grok Agent Mode in a 4+ minute video, calling it a “major ability unlock.” Meanwhile, Grok experienced a temporary outage affecting users globally, and xAI deprecated older model versions on May 15 (Economic Times, Gate News).

Hailuo AI (MiniMax)

MiniMax’s Hailuo 2.0 added Start & End Frames capability, enabling precise video scene transitions. The company also released Hailuo 2.3 with better physics dynamics and showcased its AI vision at Cannes during WAIFF 2026.

HeyGen

HeyGen announced that its Avatar Agent is now live on Agent.ai, allowing conversational AI avatars to be deployed easily. The March and April 2026 product updates included new voice and lip-sync improvements.

Ideogram

Ideogram released Version 3.0, emphasizing realism, design, and consistent style generation. A Digital Trends article suggested it as a strong alternative to Gemini and ChatGPT for AI image generation.

iFlyrec (iFlytek)

iFlytek’s iFlyrec audio transcription tool surpassed 100 million users, becoming the company’s second core product at that scale. The tool is particularly popular among highly educated users, with iFLYTEK Spark (the underlying model) seeing high adoption in professional settings.


This brief was compiled from the latest AI news on May 19–20, 2026. Sources are linked inline.

Tools Mentioned

Bolt.new→ChatGPT→Claude→Cursor→DeepSeek→Devin→豆包→ElevenLabs→Flux→Gemini→GitHub Copilot→Kimi→可灵→秘塔 AI→Midjourney→Notion AI→Perplexity→Pika→通义千问→Sora→Consensus→DALL·E 3→Descript→文心一言→Gamma→Grok→海螺 AI→HeyGen→Ideogram→讯飞听见→
Featured
B
Freemium

Bolt.new

StackBlitz's browser-based AI full-stack app generator that creates runnable web apps from prompts with one-click deploy.

AgentCodingFull StackDeploy
Featured
C
Freemium

ChatGPT

OpenAI's conversational AI assistant supporting text generation, coding, creative writing, and more.

ChatChatWritingCoding
Featured
C
Freemium

Claude

Anthropic's AI assistant, excels at long-text analysis, code review, and complex reasoning.

ChatChatCodingAnalysis
Featured
C
Freemium

Cursor

VS Code-based AI-first code editor with powerful AI completions, inline editing, and codebase chat.

AgentEditorCodingCode Completion
Featured
D
Freemium

DeepSeek

High-performance LLM from DeepSeek achieving GPT-4-level performance at a fraction of the cost, supporting 128K context and deep reasoning.

ChatChatCodingReasoning
Featured
D
Paid

Devin

Cognition AI's fully autonomous AI software engineer that independently handles the full dev cycle from requirements to deployment.

AgentCodingAutomationFull Stack
Featured
D
Free

Doubao

ByteDance's AI assistant with text-to-image, voice chat, web search and other multimodal capabilities, excellent Chinese experience.

ChatChatMultimodalSearch
Featured
E
Freemium

ElevenLabs

Leading AI voice synthesis platform supporting multilingual text-to-speech and voice cloning.

AudioTTSVoice CloneMultilingual
Featured
F
Freemium

Flux

Image generation model by the original Stable Diffusion team at Black Forest Labs, with industry-leading image quality and text rendering.

ImageImage GenHigh QualityText Render
Featured
G
Freemium

Gemini

Google's multimodal AI model, deeply integrated with Google ecosystem, supporting text, image, and code understanding.

ChatChatMultimodalSearch
Featured
G
Freemium

GitHub Copilot

GitHub's AI coding assistant deeply integrated with VS Code, JetBrains, and other IDEs, supporting code completion and conversational coding.

AgentCode CompletionIDE IntegrationCoding
Featured
K
Freemium

Kimi

Moonshot AI's assistant known for ultra-long context (2M characters), excelling at document analysis, long-form summarization, and deep research.

ChatChatLong ContextAnalysis
Featured
K
Freemium

Kling

Kuaishou's AI video generation tool, creating high-quality short videos from text and images with realistic physics effects.

VideoVideo GenShort VideoPhysics
Featured
M
Free

MetaSo AI

AI search engine providing structured answers and mind maps, ideal for deep research and knowledge organization.

SearchSearchResearchMind Map
Featured
M
Paid

Midjourney

Top-tier AI image generation tool, renowned for artistic style and high-quality output.

ImageImage GenArtDesign
Featured
N
Paid

Notion AI

Notion's built-in AI features for writing assistance, summarization, translation, and brainstorming.

OfficeWritingSummarizationCollaboration
Featured
P
Freemium

Perplexity

AI search engine combining LLM with real-time web search, providing accurate answers with citations and deep research mode.

SearchSearchResearchReal-time
Featured
P
Freemium

Pika

AI video generation tool by Pika Labs, supporting text/image-to-video with rich creative effects and easy operation.

VideoVideo GenCreativeText-to-Video
Featured
Q
Freemium

Qwen

Alibaba's LLM series covering chat, coding, multimodal, and more, supporting long context and complex reasoning.

ChatChatCodingMultimodal
Featured
S
Paid

Sora

OpenAI's text-to-video model, capable of generating high-quality videos up to one minute long.

VideoVideo GenText-to-Video
C
Freemium

Consensus

AI academic search engine that extracts and summarizes findings directly from research papers.

SearchAcademicPapersResearch
D
Freemium

DALL·E 3

OpenAI's image generation model, excels at understanding complex prompts and creating accurate, creative images.

ImageImage GenCreative
D
Freemium

Descript

AI-powered audio/video editor that lets you edit podcasts and videos like documents, with auto-transcription, filler removal, and AI voiceover.

AudioPodcastVideo EditTranscription
E
Freemium

ERNIE

Baidu's knowledge-enhanced LLM, deeply integrated with Baidu Search, excelling in Chinese language understanding and knowledge Q&A.

ChatChatKnowledgeChinese
G
Freemium

Gamma

AI-powered presentation and document creation tool for quickly generating polished slides and web docs.

OfficeSlidesDocsPPT
G
Freemium

Grok

xAI's AI assistant featuring real-time X platform data access, humorous conversational style, and deep thinking mode.

ChatChatReal-timeReasoning
H
Freemium

Hailuo AI

MiniMax's AI voice and creation platform, supporting voice synthesis, AI music, and intelligent conversation.

AudioTTSMusicChat
H
Paid

HeyGen

AI avatar video generation platform supporting virtual hosts, video translation, and lip-sync, widely used in marketing and education.

VideoAvatarVideo TranslateLip Sync
I
Freemium

Ideogram

AI image generation tool specializing in text rendering, capable of accurately generating clear text and logos within images.

ImageImage GenText RenderLogo
i
Freemium

iFlyrec

iFlytek's speech-to-text platform, supporting real-time voice recognition, meeting transcription, and multi-language translation.

AudioSpeech-to-TextMeetingTranslation
← Back to AI News