LongCat-2.0, a large-scale MoE model with 1.6T total and 48B Active(longcat.chat)
275 points by benjiro29 1 day ago | 81 comments
tl;dr: Summary not available
HN Discussion:
  • Focus on infrastructure milestone using Huawei Ascend chips instead of Nvidia
  • Testing the model reveals censorship on politically sensitive topics
  • Skepticism about the company's legitimacy and inability to download weights
  • Model is too large for practical local deployment by most users
  • Compute scale is small compared to Western labs, possibly reusing DeepSeek architecture