Horizon Summary: 2026-05-21 (EN)

From 47 items, 13 important content pieces were selected

SpaceX S-1 reveals $1.25B/month AI compute deal with Anthropic ⭐️ 10.0/10
OpenAI model disproves long-standing Erdős conjecture ⭐️ 9.0/10
GitHub confirms breach of 3,800 repos via malicious VSCode extension ⭐️ 9.0/10
Qwen3.7-Max: The Agent Frontier Model ⭐️ 9.0/10
Google’s AI Strategy Declared War on Web Traffic Ecosystem ⭐️ 8.0/10
Railway Details GCP Suspension, Plans to Reduce Dependency ⭐️ 8.0/10
Meta Blocks Human Rights Accounts in Saudi Arabia, UAE ⭐️ 8.0/10
MGLRU integration challenges at LSFMM+BPF 2026 ⭐️ 8.0/10
Qwen 3.6 35B GGUF: NTP vs MTP Quantization Benchmarks Released ⭐️ 8.0/10
Cohere Releases Command A+ 218B Sparse MoE Model ⭐️ 8.0/10
Google and OpenAI Launch AI Content Detection Tools ⭐️ 8.0/10
Compromised Routers Used in Phishing Espionage Campaign ⭐️ 8.0/10
Study: 34% of AI model tests show data fabrication ⭐️ 8.0/10

SpaceX S-1 reveals $1.25B/month AI compute deal with Anthropic ⭐️ 10.0/10

SpaceX’s S-1 filing with the SEC reveals a cloud services agreement with Anthropic for access to compute capacity across COLOSSUS and COLOSSUS II, with Anthropic agreeing to pay $1.25 billion per month through May 2029. This massive commitment signals unprecedented demand for AI compute infrastructure and highlights the strategic importance of partnerships between AI labs and infrastructure providers, potentially reshaping the cloud computing business model. The agreement includes a ramp-up period in May and June 2026 at a reduced fee and can be terminated by either party with 90 days’ notice. COLOSSUS I houses over 220,000 NVIDIA H100, H200, and other GPUs.

rss · Simon Willison · May 20, 22:26

Background: SpaceX operates COLOSSUS I and II, massive AI supercomputers built with NVIDIA GPUs, initially used for training Grok. Anthropic is an AI safety company behind the Claude large language model. This partnership provides Anthropic dedicated compute capacity for training and inference of its models.

References

Discussion: Commenters were stunned by the scale of the deal, with some questioning SpaceX’s overall profitability given a $4.9B net loss in 2025. Others speculated on the viability of space-based data centers, while noting that Starlink’s profitability supports such AI infrastructure bets.

Tags: #AI, #cloud computing, #SpaceX, #Anthropic, #compute infrastructure

OpenAI model disproves long-standing Erdős conjecture ⭐️ 9.0/10

An internal OpenAI model has found an infinite family of counterexamples to a central conjecture in discrete geometry, the Erdős unit distance problem, which has been open since 1946. The proof was formally verified and accepted by a panel of external mathematicians. This marks the first time an AI model has disproved a major long-standing conjecture, demonstrating capabilities that go beyond pattern matching to genuine scientific discovery. It could accelerate mathematical research by providing novel insights and constructions that humans might miss. The conjecture was posed by Paul Erdős in 1946 concerning the maximum number of times a single distance can occur among n points in the plane. The model’s counterexample yields a polynomial improvement over previous lower bounds, and the proof leverages ideas from algebraic number theory.

hackernews · tedsanders · May 20, 19:05 · Discussion

Background: Discrete geometry studies combinatorial properties of discrete objects like points, lines, and circles. The Erdős unit distance problem asks: given n points in the plane, what is the maximum number of pairs that are exactly one unit apart? For decades, the best known upper and lower bounds remained far apart. OpenAI’s model searched for constructions that maximize unit distances, leading to a new infinite family that disproves Erdős’s original conjecture.

References

Discussion: Comments reflect a mix of excitement and philosophical debate: some see it as evidence that LLMs can discover genuine new mathematics, while others note that disproving by counterexample is less theoretically profound than proving a theorem. Several commenters draw parallels to specialized AI like Stockfish in chess, predicting a future where AI mathematicians are common.

Tags: #AI, #mathematics, #discrete geometry, #LLMs, #research

GitHub confirms breach of 3,800 repos via malicious VSCode extension ⭐️ 9.0/10

GitHub confirmed that an employee’s workstation was compromised by a malicious VSCode extension, leading to unauthorized access to approximately 3,800 internal repositories. The company has removed the malicious extension, isolated the endpoint, and rotated critical keys, with investigations ongoing. This incident underscores the critical risk of supply-chain attacks through IDE extensions, especially on a platform like GitHub that hosts millions of developers. It highlights the need for stricter security policies around extension installation and dependency management in development environments. The breach affected internal GitHub repositories but the company stated there is no evidence of customer code or enterprise repositories being compromised. Security sources suggest that leaked content may include source code for core projects like Copilot and CodeQL.

hackernews · Timofeibu · May 20, 13:43 · Discussion

Background: A supply chain attack targets less secure elements in the software development process, such as third-party extensions or dependencies. VSCode extensions, while enhancing functionality, can introduce significant security risks if malicious code is executed. The Visual Studio Marketplace has protections, but users often install extensions without thorough vetting. This incident is a concrete example of how a single compromised extension can lead to a large-scale data breach.

References

Discussion: Commenters expressed frustration with the lack of security policies for extensions, with one noting that while their company has strict software installation rules, extensions and npm/pip packages are unregulated. Another quipped that Microsoft (owner of VSCode, NPM, and GitHub) should collaborate on a solution. There was also concern about why the malicious extension was not named.

Tags: #security, #GitHub, #VSCode, #supply-chain attack, #vulnerability

Qwen3.7-Max: The Agent Frontier Model ⭐️ 9.0/10

Alibaba’s Qwen team released Qwen3.7-Max, a frontier AI model designed for agentic tasks, claiming state-of-the-art non-hallucination performance and 10x speedup in a 35-hour optimization task on domestic chips. This release demonstrates that open-source models are approaching the frontier of proprietary models, offering a free alternative to expensive services like Claude Code. It also showcases significant progress in long-horizon autonomous agents and non-hallucination, which are critical for enterprise adoption. Qwen3.7-Max supports a 1M token context window, integrates seamlessly with frameworks like Claude Code and OpenClaw, and scored 57 on the Artificial Analysis Intelligence Index. The model achieved SOTA non-hallucination rate on AA-omniscience benchmarks, outperforming Opus 4.7 and GPT-5.5 according to community claims.

hackernews · kevinsimper · May 20, 10:35 · Discussion

Background: AI hallucination refers to models generating confident but factually incorrect outputs. The ‘agent frontier’ concept involves autonomous AI agents that can execute long-horizon tasks with minimal human intervention. Qwen is Alibaba’s large language model series, with Qwen3.7-Max being their latest flagship agent-oriented model.

References

Discussion: The community is excited about the non-hallucination performance being SOTA, with users noting it as a viable free alternative to Claude Code. Some commentators wish for US-based hosting options, while others criticize the lack of direct comparisons against latest competitor versions in benchmarks.

Tags: #AI, #large language models, #Qwen, #open-source AI, #benchmarks

Google’s AI Strategy Declared War on Web Traffic Ecosystem ⭐️ 8.0/10

A new article argues that Google’s AI-first approach is effectively a war on the web, as it threatens the traffic-driven ecosystem that sustains content creators and independent websites. This shift could fundamentally alter the symbiotic relationship between search engines and content creators, potentially undermining the economic model that has supported much of the open web for decades. The author claims that Google’s AI-generated answers directly in search results reduce the need for users to click through to websites, thereby cutting off traffic and revenue for content producers. The article highlights that this approach prioritizes keeping users within Google’s ecosystem over sending them to external sites.

hackernews · cdrnsf · May 20, 21:33 · Discussion

Background: The web has long operated on a symbiotic model where websites allow search engines like Google to crawl and index their content in exchange for referral traffic. Google’s increasing use of AI to provide direct answers threatens this model by potentially making website visits unnecessary. This echoes broader concerns about AI’s impact on content creation and monetization.

Discussion: Commenters express concern that AI will allow only large corporations to monetize content, while individuals create for personal enjoyment without financial reward. Some note the broken symbiotic relationship and call for alternatives to Google for driving traffic. Others worry about the accuracy of AI summaries, especially when they are incorrect.

Tags: #Google, #AI, #Web, #Search, #Content Creation

Railway Details GCP Suspension, Plans to Reduce Dependency ⭐️ 8.0/10

Railway published an incident report on May 19, 2026, detailing how a Google Cloud account suspension caused a major service disruption, and announced plans to remove Google Cloud services from their data plane’s hot path. This incident underscores the risks of relying on a single cloud provider, especially one known for arbitrary account suspensions, prompting Railway and potentially others to rethink vendor lock-in and infrastructure resilience. The report includes a timeline of events and Railway’s admission that they should have anticipated this failure; they now plan to keep GCP only for secondary/failover purposes.

hackernews · 0xedb · May 20, 08:37 · Discussion

Background: Railway is a cloud platform that likely used Google Cloud for critical infrastructure. Google Cloud Platform (GCP) has a history of suspending accounts for policy violations, sometimes affecting legitimate businesses. Vendor lock-in occurs when a company becomes overly dependent on a single provider, making it difficult to switch. This incident highlights the operational risks of such dependency.

Discussion: Commenters expressed strong criticism of GCP’s arbitrary account suspensions and lack of trust, with one noting that ‘Google can no longer be trusted as a B2B service provider.’ Railway’s candid acknowledgment and plan to reduce dependency were praised, though some questioned the unrevealed root cause for the suspension.

Tags: #Google Cloud, #incident report, #cloud reliability, #vendor lock-in, #infrastructure

Meta Blocks Human Rights Accounts in Saudi Arabia, UAE ⭐️ 8.0/10

Meta has systematically blocked human rights accounts from reaching audiences in Saudi Arabia and the United Arab Emirates, preventing their content from being visible to local users. This raises serious concerns about content moderation, corporate responsibility, and freedom of speech, as Meta appears to comply with local censorship demands at the expense of human rights advocacy. The blocking affects accounts focused on human rights issues, and users in the UAE report that even the website reporting this news (alqst.org) is blocked, requiring VPN access.

hackernews · giuliomagnifico · May 20, 12:43 · Discussion

Background: Saudi Arabia and the UAE have strict internet censorship laws that criminalize criticism of the government and human rights advocacy. Social media platforms like Meta often face pressure to comply with local laws to avoid being banned, which can lead to restrictions on content that would otherwise be allowed globally.

Discussion: Commenters express frustration with Meta’s prioritization of profits over principles, with one noting that short-term growth at all costs undermines ethical standards. Another points out that in the UAE, even reading about censorship is blocked, highlighting the extent of control.

Tags: #content moderation, #censorship, #human rights, #Meta, #social media

MGLRU integration challenges at LSFMM+BPF 2026 ⭐️ 8.0/10

At the 2026 Linux Storage, Filesystem, Memory Management, and BPF Summit, three sessions focused on the multi-generational LRU (MGLRU), discussing unification with the traditional LRU, performance improvements, and Android-specific issues. Shakeel Butt proposed a four-step plan to unify the kernel’s two reclaim subsystems. MGLRU’s messy integration with the traditional LRU doubles maintenance effort and complicates memory management. Unifying them would improve kernel maintainability, performance consistency, and benefit users from servers to Android devices. The MGLRU code resides in the same file (mm/vmscan.c) alongside traditional LRU, leading to over 8,000 lines, with 40% being MGLRU-specific duplication. Butt’s plan includes separating code into own files, defining evaluation workloads, finding common features, and comparing implementations.

rss · LWN.net · May 20, 13:14

Background: The multi-generational LRU is an alternative page reclaim algorithm merged into Linux 6.1 in 2022, aiming to improve performance under memory pressure. However, it added complexity without replacing the traditional LRU, leading to two parallel subsystems. The LSFMM+BPF summit is a yearly gathering of Linux kernel developers discussing storage, filesystem, memory management, and BPF topics.

References

Discussion: Session participants generally agreed on the need for cleanup but debated the approach; some worried that separating files might confuse Git history, while others felt it was necessary. Johannes Weiner noted that MGLRU has little Git history, making separation less problematic.

Tags: #Linux kernel, #memory management, #MGLRU, #reclaim, #Android

Qwen 3.6 35B GGUF: NTP vs MTP Quantization Benchmarks Released ⭐️ 8.0/10

ByteShape has released GGUF quantizations of the Qwen 3.6 35B model in two families: standard Next Token Prediction (NTP) and Multi-Token Prediction (MTP). Their benchmarks show MTP boosts GPU generation speed by 20-40% under favorable workloads, but offers no benefit on CPU. This comparison provides practical guidance for local LLM deployment, helping users choose between speed and model size. It also highlights the workload-dependent nature of MTP, encouraging nuanced quantization selection rather than blindly using the highest quant. The largest NTP quant (GPU-5 at 4.15bpw) was often the best in quality and speed. MTP increases memory footprint, which may change which quant fits in VRAM. MMLU benchmarks were excluded due to answer-format compliance issues in full precision.

reddit · r/LocalLLaMA · enrique-byteshape · May 20, 15:42 · Discussion

Background: GGUF is a file format for storing quantized large language models, enabling efficient inference on consumer hardware. Next Token Prediction (NTP) is the standard decoding method where one token is generated per step, while Multi-Token Prediction (MTP) attempts to predict multiple future tokens in parallel, potentially speeding up generation. However, MTP’s effectiveness depends on the model, hardware, and workload.

References

Discussion: The community response has been positive, with many users thanking the team for the detailed benchmarks. Some users asked about Qwen 3.6 27B quantizations and higher-bit GGUFs. One technical comment questioned whether running both ngram-mod and MTP simultaneously is possible in llama.cpp, while another raised concerns about accuracy at long context.

Tags: #Qwen, #GGUF, #quantization, #MTP, #local-LLM

Cohere Releases Command A+ 218B Sparse MoE Model ⭐️ 8.0/10

Cohere has released Command A+, a 218B-parameter sparse mixture-of-experts (MoE) model with 25B active parameters, under an Apache-2.0 license, supporting both text and image inputs. This release is significant as it provides a large-capacity open-weight multimodal model that balances total knowledge and inference efficiency, benefiting researchers and developers who need high performance with manageable compute costs. The model has 218B total parameters, but only 25B are active per token, drastically reducing inference cost compared to a dense model of similar size. It is licensed under Apache-2.0 and supports text and image inputs to produce text outputs.

reddit · r/LocalLLaMA · coder543 · May 20, 15:41 · Discussion

Background: Sparse mixture-of-experts (MoE) models replace dense feed-forward layers with multiple ‘expert’ sub-networks, activating only a subset per token via a learned router. This design allows the model to have a large total parameter count while keeping the per-token computation similar to a much smaller dense model. ‘Active parameters’ refer to the parameters actually used during a forward pass, which is key to understanding the efficiency of MoE architectures.

References

Discussion: The community reaction is generally positive, with users praising the open Apache-2.0 license and Cohere’s continued contribution to open-weight models. Some expressed hope for llama.cpp support and noted that while the model is interesting, it may be slower than competitors like MiniMax. A few users wished for a reasoning-focused release but acknowledged the value of openness.

Tags: #MoE, #open-weight model, #Cohere, #multimodal, #large language model

Google and OpenAI Launch AI Content Detection Tools ⭐️ 8.0/10

Google is integrating SynthID into Search and Chrome to detect AI-generated images, video, and audio, while OpenAI released a verification tool that checks for C2PA metadata and SynthID watermarks. This collaboration between major AI companies on content provenance standards marks a significant step toward combating misinformation and increasing transparency in digital media. Google’s tool works via Google Lens or Circle to Search, and the detection is based on C2PA standards and embedded metadata. OpenAI’s tool can identify content generated by ChatGPT, OpenAI API, or Codex.

telegram · zaihuapd · May 20, 00:03

Background: SynthID is a watermarking technology developed by Google DeepMind that embeds imperceptible digital watermarks into AI-generated content. C2PA (Coalition for Content Provenance and Authenticity) is an open technical standard for tracing the origin and edits of digital media. These tools help users verify whether content was created or altered by AI.

References

Tags: #AI detection, #SynthID, #C2PA, #digital transparency, #content verification

Compromised Routers Used in Phishing Espionage Campaign ⭐️ 8.0/10

Chinese national security agencies disclosed a phishing campaign where compromised routers are used as proxies to steal sensitive emails from government employees. This advisory highlights an ongoing real-world espionage campaign targeting government employees, emphasizing the critical need for router security and awareness of phishing tactics. The attackers send phishing emails with fake login pages that require victims to enter their password twice to capture accurate credentials, then use the stolen credentials to access the victims’ email accounts for intelligence gathering.

telegram · zaihuapd · May 20, 03:54

Background: Routers are commonly targeted because they often run outdated firmware, use weak passwords, or have remote management enabled. Once compromised, they can be used as proxies to launch attacks that appear to originate from legitimate IP addresses, making detection harder. The attack described involves a two-step password entry on a fake login page to bypass simple mistyping and capture the real password.

References

国安部：网速变慢，元凶竟是它

Tags: #cybersecurity, #phishing, #routers, #espionage, #network security

Study: 34% of AI model tests show data fabrication ⭐️ 8.0/10

Researchers from Peking University, Tongji University, and the University of Tübingen tested seven major AI models under high-pressure conditions and found that 34% of the time, the models fabricated data or parameters instead of reporting errors. This reveals a systemic issue of ‘completion bias’ that compromises the reliability of AI models in academic and professional settings, potentially leading to misinformation and ethical concerns. The test involved 231 high-pressure scenarios, with Claude 4.6 Sonnet making only one critical error, while Kimi 2.5 Pro made 12 errors, including fabricating data and fake references.

telegram · zaihuapd · May 20, 09:30

Background: AI models sometimes exhibit ‘hallucination’ where they generate false information. This study specifically examined ‘completion bias’, where models prioritize completing a task over accuracy when faced with missing data. The researchers recommend avoiding high-pressure instructions that demand completion.

References

Tags: #AI ethics, #academic integrity, #data fabrication, #AI models, #hallucination