{"id":998,"date":"2026-05-29T07:29:49","date_gmt":"2026-05-28T23:29:49","guid":{"rendered":"https:\/\/connectword.dpdns.org\/?p=998"},"modified":"2026-05-29T07:29:49","modified_gmt":"2026-05-28T23:29:49","slug":"liquid-ai-releases-lfm2-5-8b-a1b-an-on-device-moe-model-with-8-3b-total-and-1-5b-active-parameters","status":"publish","type":"post","link":"https:\/\/connectword.dpdns.org\/?p=998","title":{"rendered":"Liquid AI Releases LFM2.5-8B-A1B: An On-Device MoE Model With 8.3B Total and 1.5B Active Parameters"},"content":{"rendered":"<p class=\"wp-block-paragraph\">Liquid AI just shipped <strong>LFM2.5-8B-A1B<\/strong>. It is an on-device Mixture-of-Experts (MoE) model built for tool calling. The model holds 8.3B total parameters but activates only 1.5B per token. That sparsity is what lets it run on consumer hardware.<\/p>\n<p class=\"wp-block-paragraph\">The release follows LFM2-8B-A1B, which Liquid AI team published earlier. LFM2.5 is a new family of hybrid models for on-device deployment. This version adds a 128K context window, reasoning, and scaled-up training. <\/p>\n<h2 class=\"wp-block-heading\"><strong>What is LFM2.5-8B-A1B <\/strong><\/h2>\n<p class=\"wp-block-paragraph\">The model uses a sparse MoE design. It activates 1.5B of 8.3B total parameters per forward pass. That keeps each generated token cheap to compute.<\/p>\n<p class=\"wp-block-paragraph\">The architecture has 24 layers. Eighteen are double-gated LIV convolution blocks; six are GQA layers. It combines MoE, GQA, and gated short convolution blocks. The context length is 131,072 tokens. The model covers nine languages, including Arabic, Chinese, and Japanese.<\/p>\n<p class=\"wp-block-paragraph\">Liquid AI team recommends a temperature of 0.2, top_k of 80, and repetition_penalty of 1.05.<\/p>\n<p class=\"wp-block-paragraph\">Unlike its predecessor, LFM2.5-8B-A1B is a reasoning-only model. It produces an explicit chain of thought before its final answer. Liquid AI team chose this because MoE models run in compute-bound settings. A smaller active parameter count makes each reasoning token inexpensive.<\/p>\n<h2 class=\"wp-block-heading\"><strong>What Changed Since LFM2-8B-A1B<\/strong><\/h2>\n<p class=\"wp-block-paragraph\">Liquid expanded the context window from 32,768 to 128,000 tokens. Pretraining scaled from 12T to 38T tokens. The vocabulary doubled from 65,536 to 128,000 tokens.<\/p>\n<p class=\"wp-block-paragraph\">The larger vocabulary tokenizes non-Latin scripts more efficiently. Liquid AI team reports the strongest compression gains in Hindi, Thai, Vietnamese, Indonesian, and Arabic. The rest of the architecture stays the same as LFM2-8B-A1B.<\/p>\n<h2 class=\"wp-block-heading\"><strong>How Liquid AI Trained It<\/strong><\/h2>\n<p class=\"wp-block-paragraph\">Liquid AI team extended the tokenizer in place rather than retraining from scratch. It continued BPE merge training from the original merges on a multilingual corpus. New embedding rows initialize as the mean of their sub-token decompositions. A brief two-stage adaptation then recovers quality.<\/p>\n<p class=\"wp-block-paragraph\">Context extension came in two phases. A 2T token midtraining phase reached 32K, focused on reasoning, math, and tool use. Raising the RoPE base \u03b8, plus a 400B token stage, reached 128K.<\/p>\n<p class=\"wp-block-paragraph\">Two reinforcement learning stages target known failure modes. A preference optimization stage reduces \u2018doom loops\u2019 in long reasoning traces. It redistributes probability mass toward plausible alternatives. A separate RL shaping reward discourages loop-inducing restart words like \u2018Wait\u2026\u2019. Another RL stage uses an avg@k-based reward to cut hallucinations. The goal is abstention on queries beyond reliable knowledge.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1592\" height=\"1068\" data-attachment-id=\"80169\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/05\/28\/liquid-ai-releases-lfm2-5-8b-a1b-an-on-device-moe-model-with-8-3b-total-and-1-5b-active-parameters\/screenshot-2026-05-28-at-4-29-30-pm-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-28-at-4.29.30-PM-1.png\" data-orig-size=\"1592,1068\" data-comments-opened=\"0\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;,&quot;alt&quot;:&quot;&quot;}\" data-image-title=\"Screenshot 2026-05-28 at 4.29.30\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-28-at-4.29.30-PM-1-1024x687.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-28-at-4.29.30-PM-1.png\" alt=\"\" class=\"wp-image-80169\" \/><figcaption class=\"wp-element-caption\">https:\/\/www.liquid.ai\/blog\/lfm2-5-8b-a1b<\/figcaption><\/figure>\n<\/div>\n<h2 class=\"wp-block-heading\"><strong>The Benchmark Case<\/strong><\/h2>\n<p class=\"wp-block-paragraph\">LFM2.5-8B-A1B improves over its predecessor across the board. The AA-Omniscience Non-Hallucination Rate jumped from 7.46 to 63.47. IFEval rose from 79.44 to 91.84. MATH500 climbed from 74.80 to 88.76. Tau\u00b2 Telecom rose from 13.60 to 88.07.<\/p>\n<p class=\"wp-block-paragraph\">Liquid AI team compared the model against dense and MoE alternatives. On instruction following, it matches Gemma-4-26B-A4B-IT on IFEval. It does so at a fraction of the active parameter count. On Tau\u00b2 Telecom, it scores 88.07, ahead of much larger models.<\/p>\n<p class=\"wp-block-paragraph\">The avg@k reward drives a much lower hallucination rate. Accuracy stays reasonable for the model\u2019s size. On agentic benchmarks, it remains competitive with bigger models.<\/p>\n<figure class=\"wp-block-table\">\n<table class=\"has-fixed-layout\">\n<thead>\n<tr>\n<th>Benchmark<\/th>\n<th>LFM2-8B-A1B<\/th>\n<th>LFM2.5-8B-A1B<\/th>\n<th>\u0394<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>AA-Omniscience Non-Hallucination Rate<\/td>\n<td>7.46<\/td>\n<td>63.47<\/td>\n<td>+56.01<\/td>\n<\/tr>\n<tr>\n<td>IFEval<\/td>\n<td>79.44<\/td>\n<td>91.84<\/td>\n<td>+12.40<\/td>\n<\/tr>\n<tr>\n<td>MATH500<\/td>\n<td>74.80<\/td>\n<td>88.76<\/td>\n<td>+13.96<\/td>\n<\/tr>\n<tr>\n<td>Tau\u00b2 Telecom<\/td>\n<td>13.60<\/td>\n<td>88.07<\/td>\n<td>+74.47<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/figure>\n<h2 class=\"wp-block-heading\"><strong>Running It: CPU, GPU, and Tooling<\/strong><\/h2>\n<p class=\"wp-block-paragraph\">The model ships with day-one support across the inference ecosystem. Frameworks include llama.cpp, MLX, vLLM, and SGLang. ONNX and Liquid\u2019s LEAP edge platform are also supported.<\/p>\n<p class=\"wp-block-paragraph\">On CPU, it decodes 253 tokens\/s on an M5 Max. It reaches 146 tokens\/s on a Ryzen AI Max+ 395. It stays under 6 GB of memory throughout. On a phone, it holds about 30 tokens\/s.<\/p>\n<p class=\"wp-block-paragraph\">On a single NVIDIA H100 SXM5, output throughput hits 18.5K tokens per second. That is over 1.6B tokens per day at high concurrency.<\/p>\n<p class=\"wp-block-paragraph\">For tool use, LFM2.5 writes Pythonic function calls by default. They appear between the <code>&lt;|tool_call_start|&gt;<\/code> and <code>&lt;|tool_call_end|&gt;<\/code> special tokens. You can override this to JSON in the system prompt.<\/p>\n<h2 class=\"wp-block-heading\"><strong>Strengths <\/strong><strong>and What to Watch<\/strong><\/h2>\n<h4 class=\"wp-block-heading\"><strong>Strengths:<\/strong><\/h4>\n<ul class=\"wp-block-list\">\n<li>Activates only 1.5B parameters, keeping inference cheap on edge hardware<\/li>\n<li>Competitive instruction-following and agentic scores for its size class<\/li>\n<li>128K context window and nine-language coverage<\/li>\n<li>Open-weight under the LFM1.0 license, with base and post-trained checkpoints<\/li>\n<\/ul>\n<h4 class=\"wp-block-heading\"><strong><\/strong><strong>What to Watch<\/strong>:<\/h4>\n<ul class=\"wp-block-list\">\n<li>Limited knowledge capacity from the small active parameter count<\/li>\n<li>Not a fit for heavy programming or knowledge-intensive QA without retrieval<\/li>\n<li>Reasoning-only output adds chain-of-thought tokens to every turn<\/li>\n<li>Text-only; this variant has no vision or audio input<\/li>\n<\/ul>\n<h2 class=\"wp-block-heading\"><strong>Marktechpost\u2019s Visual Explainer<\/strong><\/h2>\n<p><!-- ============================================================\n     LFM2.5-8B-A1B \u2014 WordPress Embeddable Slider Guide\n     Palette: Liquid AI (ink #0A0B0D \/ cream #ECEAE3 \/ teal #20D5C4)\n     Paste into a WordPress \"Custom HTML\" block. Self-contained.\n     ============================================================ --><\/p>\n<div role=\"region\" aria-label=\"LFM2.5-8B-A1B guide\">\n<div class=\"lf-progress\">\n<div class=\"lf-progress-fill\" data-lf=\"fill\"><\/div>\n<\/div>\n<div class=\"lf-stage\">\n<div class=\"lf-track\" data-lf=\"track\">\n<p>      <!-- SLIDE 1 \u2014 COVER --><\/p>\n<section class=\"lf-slide\" aria-label=\"1 of 8\">\n<div class=\"lf-eyebrow\">On-Device Model Guide<\/div>\n<h2 class=\"lf-cover\">LFM2.5-8B-A1B<\/h2>\n<p class=\"lf-lede\">Liquid AI\u2019s on-device Mixture-of-Experts model, built for tool calling and complex instruction following on consumer hardware.<\/p>\n<div class=\"lf-chips\">\n          <span class=\"lf-chip\"><b>8.3B<\/b> total params<\/span><br \/>\n          <span class=\"lf-chip\"><b>1.5B<\/b> active<\/span><br \/>\n          <span class=\"lf-chip\"><b>128K<\/b> context<\/span><br \/>\n          <span class=\"lf-chip\">reasoning\u2011only<\/span><br \/>\n          <span class=\"lf-chip\">open\u2011weight<\/span>\n        <\/div>\n<div class=\"lf-coverfoot\">Released <b>May 28, 2026<\/b> \u00a0\u00b7\u00a0 Liquid AI \u00a0\u00b7\u00a0 LFM1.0 license<\/div>\n<\/section>\n<p>      <!-- SLIDE 2 \u2014 WHAT IT IS --><\/p>\n<section class=\"lf-slide\" aria-label=\"2 of 8\">\n<div class=\"lf-eyebrow\">What It Is<\/div>\n<h2 class=\"lf-title\">A sparse MoE that activates 1.5B of 8.3B parameters per token<\/h2>\n<ul class=\"lf-list\">\n<li><b>24 layers<\/b> \u2014 18 double-gated LIV convolution blocks plus 6 GQA layers.<\/li>\n<li>Combines <b>MoE<\/b>, <b>GQA<\/b>, and <b>gated short convolution<\/b> blocks.<\/li>\n<li>Context length of <span class=\"k\">131,072<\/span> tokens; covers <b>9 languages<\/b>.<\/li>\n<li><b>Reasoning-only:<\/b> produces an explicit chain of thought before answering.<\/li>\n<li>Recommended params: <span class=\"k\">temperature 0.2<\/span>, <span class=\"k\">top_k 80<\/span>, <span class=\"k\">repetition_penalty 1.05<\/span>.<\/li>\n<\/ul>\n<\/section>\n<p>      <!-- SLIDE 3 \u2014 WHAT CHANGED --><\/p>\n<section class=\"lf-slide\" aria-label=\"3 of 8\">\n<div class=\"lf-eyebrow\">What Changed Since LFM2-8B-A1B<\/div>\n<h2 class=\"lf-title\">Bigger context, more training, a wider vocabulary<\/h2>\n<div class=\"lf-grid\">\n<div class=\"lf-card\">\n<div class=\"lf-cap\">Context window<\/div>\n<p class=\"big\">32,768 \u2192 128,000<\/p>\n<p>Processes longer documents and reasons for longer.<\/p>\n<\/div>\n<div class=\"lf-card\">\n<div class=\"lf-cap\">Pretraining tokens<\/div>\n<p class=\"big\">12T \u2192 38T<\/p>\n<p>Scaled-up pretraining plus large-scale RL.<\/p>\n<\/div>\n<div class=\"lf-card\">\n<div class=\"lf-cap\">Vocabulary size<\/div>\n<p class=\"big\">65,536 \u2192 128,000<\/p>\n<p>Tokenizes non-Latin scripts more efficiently.<\/p>\n<\/div>\n<div class=\"lf-card\">\n<div class=\"lf-cap\">Best compression gains<\/div>\n<p class=\"big\">5 languages<\/p>\n<p>Hindi, Thai, Vietnamese, Indonesian, Arabic.<\/p>\n<\/div>\n<\/div>\n<\/section>\n<p>      <!-- SLIDE 4 \u2014 HOW IT WAS TRAINED --><\/p>\n<section class=\"lf-slide\" aria-label=\"4 of 8\">\n<div class=\"lf-eyebrow\">How It Was Trained<\/div>\n<h2 class=\"lf-title\">Tokenizer extension, staged context growth, targeted RL<\/h2>\n<ul class=\"lf-list\">\n<li><b>Tokenizer:<\/b> extended in place, with continued BPE merge training on a multilingual corpus.<\/li>\n<li><b>Context:<\/b> a 2T-token midtraining phase to 32K, then RoPE base \u03b8 plus 400B tokens to 128K.<\/li>\n<li><b>Doom loops:<\/b> preference optimization redistributes probability mass toward plausible alternatives.<\/li>\n<li>A separate RL shaping reward discourages loop-inducing restart words like \u201cWait\u2026\u201d.<\/li>\n<li><b>Hallucinations:<\/b> an avg@k-based RL reward encourages abstention beyond reliable knowledge.<\/li>\n<\/ul>\n<\/section>\n<p>      <!-- SLIDE 5 \u2014 BENCHMARKS --><\/p>\n<section class=\"lf-slide\" aria-label=\"5 of 8\">\n<div class=\"lf-eyebrow\">Benchmarks vs LFM2-8B-A1B<\/div>\n<h2 class=\"lf-title\">Largest gains in non-hallucination and tool use<\/h2>\n<div class=\"lf-tablewrap\">\n<table class=\"lf-tbl\">\n<thead>\n<tr>\n<th>Benchmark<\/th>\n<th>LFM2<\/th>\n<th>LFM2.5<\/th>\n<th>\u0394<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>AA-Omniscience Non-Hallucination Rate<\/td>\n<td class=\"num\">7.46<\/td>\n<td class=\"num\">63.47<\/td>\n<td class=\"delta\">+56.01<\/td>\n<\/tr>\n<tr>\n<td>IFEval<\/td>\n<td class=\"num\">79.44<\/td>\n<td class=\"num\">91.84<\/td>\n<td class=\"delta\">+12.40<\/td>\n<\/tr>\n<tr>\n<td>MATH500<\/td>\n<td class=\"num\">74.80<\/td>\n<td class=\"num\">88.76<\/td>\n<td class=\"delta\">+13.96<\/td>\n<\/tr>\n<tr>\n<td>Tau\u00b2 Telecom<\/td>\n<td class=\"num\">13.60<\/td>\n<td class=\"num\">88.07<\/td>\n<td class=\"delta\">+74.47<\/td>\n<\/tr>\n<\/tbody>\n<\/table><\/div>\n<div class=\"lf-stepline\"><\/div>\n<p class=\"lf-body\">On IFEval it matches Gemma-4-26B-A4B-IT at a fraction of the active parameter count.<\/p>\n<\/section>\n<p>      <!-- SLIDE 6 \u2014 INFERENCE PERFORMANCE --><\/p>\n<section class=\"lf-slide\" aria-label=\"6 of 8\">\n<div class=\"lf-eyebrow\">Inference Performance<\/div>\n<h2 class=\"lf-title\">Fast on CPU and GPU, with day-one framework support<\/h2>\n<div class=\"lf-grid\">\n<div class=\"lf-card\">\n<div class=\"lf-cap\">CPU decode<\/div>\n<p class=\"big\">253 tok\/s<\/p>\n<p>M5 Max, under 6 GB memory. 146 tok\/s on a Ryzen AI Max+ 395.<\/p>\n<\/div>\n<div class=\"lf-card\">\n<div class=\"lf-cap\">On a phone<\/div>\n<p class=\"big\">~30 tok\/s<\/p>\n<p>Runs locally and privately on device.<\/p>\n<\/div>\n<div class=\"lf-card\">\n<div class=\"lf-cap\">GPU throughput<\/div>\n<p class=\"big\">18.5K tok\/s<\/p>\n<p>High concurrency, &gt;1.6B tokens\/day on a single H100.<\/p>\n<\/div>\n<div class=\"lf-card\">\n<div class=\"lf-cap\">Day-one support<\/div>\n<p>llama.cpp, MLX, vLLM, SGLang.<\/p>\n<p>Also ONNX and Liquid\u2019s LEAP.<\/p>\n<\/div>\n<\/div>\n<\/section>\n<p>      <!-- SLIDE 7 \u2014 TOOL USE --><\/p>\n<section class=\"lf-slide\" aria-label=\"7 of 8\">\n<div class=\"lf-eyebrow\">Tool Use &amp; Agents<\/div>\n<h2 class=\"lf-title\">Pythonic function calls, ready for on-device agents<\/h2>\n<ul class=\"lf-list\">\n<li>By default, writes <b>Pythonic function calls<\/b> between <span class=\"k\">&lt;|tool_call_start|&gt;<\/span> and <span class=\"k\">&lt;|tool_call_end|&gt;<\/span> tokens.<\/li>\n<li>You can override this to <b>JSON<\/b> function calls in the system prompt.<\/li>\n<li>The <b>LocalCowork<\/b> demo runs <span class=\"k\">67 tools<\/span> across <span class=\"k\">13 MCP servers<\/span>.<\/li>\n<li>It runs on one laptop \u2014 no cloud, no API keys, no data leaving the machine.<\/li>\n<\/ul>\n<\/section>\n<p>      <!-- SLIDE 8 \u2014 RUN IT --><\/p>\n<section class=\"lf-slide\" aria-label=\"8 of 8\">\n<div class=\"lf-eyebrow\">Run It<\/div>\n<h2 class=\"lf-title\">Serve in two lines, or load directly<\/h2>\n<pre class=\"lf-code\"><code><span class=\"c\"># Serve with vLLM (OpenAI-compatible API)<\/span>\npip install vllm\nvllm serve \"LiquidAI\/LFM2.5-8B-A1B\"\n\n<span class=\"c\"># Or load directly with Transformers<\/span>\nfrom transformers import AutoModelForCausalLM, AutoTokenizer\nmodel_id = \"LiquidAI\/LFM2.5-8B-A1B\"\nmodel = AutoModelForCausalLM.from_pretrained(\n    model_id, device_map=\"auto\", dtype=\"bfloat16\")\ntokenizer = AutoTokenizer.from_pretrained(model_id)<\/code><\/pre>\n<div class=\"lf-mini\">Recommended for<\/div>\n<div class=\"lf-tagrow\">\n          <span class=\"lf-tag good\">Agentic workflows<\/span><br \/>\n          <span class=\"lf-tag good\">Tool use<\/span><br \/>\n          <span class=\"lf-tag good\">Structured outputs<\/span><br \/>\n          <span class=\"lf-tag good\">Multilingual assistants<\/span><br \/>\n          <span class=\"lf-tag good\">On-device assistants<\/span>\n        <\/div>\n<div class=\"lf-mini\">Less suited to<\/div>\n<div class=\"lf-tagrow\">\n          <span class=\"lf-tag\">Heavy programming<\/span><br \/>\n          <span class=\"lf-tag\">Knowledge-intensive QA without retrieval<\/span>\n        <\/div>\n<\/section><\/div>\n<p><!-- \/track -->\n  <\/p><\/div>\n<p><!-- \/stage --><\/p>\n<div class=\"lf-controls\">\n<div class=\"lf-nav\">\n      <button class=\"lf-btn\" data-lf=\"prev\" aria-label=\"Previous slide\">\u2190<\/button><br \/>\n      <button class=\"lf-btn\" data-lf=\"next\" aria-label=\"Next slide\">\u2192<\/button>\n    <\/div>\n<div class=\"lf-dots\" data-lf=\"dots\" role=\"tablist\" aria-label=\"Slide navigation\"><\/div>\n<div class=\"lf-counter\"><b data-lf=\"cur\">01<\/b> \/ <span data-lf=\"total\">08<\/span><\/div>\n<\/div>\n<\/div>\n<h2 class=\"wp-block-heading\"><strong>Key Takeaways<\/strong><\/h2>\n<ul class=\"wp-block-list\">\n<li>Liquid AI&#8217;s LFM2.5-8B-A1B holds 8.3B total parameters but activates only 1.5B per token.<\/li>\n<li>It is reasoning-only, with a 128K context window and nine-language coverage.<\/li>\n<li>Non-Hallucination Rate jumped from 7.46 to 63.47 over LFM2-8B-A1B; IFEval reached 91.84.<\/li>\n<li>It decodes 253 tok\/s on an M5 Max under 6 GB, and ~30 tok\/s on a phone.<\/li>\n<li>Day-one support spans llama.cpp, MLX, vLLM, and SGLang, with open base and post-trained weights.<\/li>\n<\/ul>\n<p class=\"wp-block-paragraph\">\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n<\/p><p class=\"wp-block-paragraph\">\n<\/p><p class=\"wp-block-paragraph\">Check out\u00a0the\u00a0<strong><a href=\"https:\/\/huggingface.co\/LiquidAI\/LFM2.5-8B-A1B\" target=\"_blank\" rel=\"noreferrer noopener\">Model Weights<\/a>\u00a0<\/strong>and<strong>\u00a0<a href=\"https:\/\/www.liquid.ai\/blog\/lfm2-5-8b-a1b\" target=\"_blank\" rel=\"noreferrer noopener\">Technical details<\/a>.\u00a0<\/strong>Also,\u00a0feel free to follow us on\u00a0<strong><a href=\"https:\/\/x.com\/intent\/follow?screen_name=marktechpost\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Twitter<\/mark><\/a><\/strong>\u00a0and don\u2019t forget to join our\u00a0<strong><a href=\"https:\/\/www.reddit.com\/r\/machinelearningnews\/\" target=\"_blank\" rel=\"noreferrer noopener\">150k+ ML SubReddit<\/a><\/strong>\u00a0and Subscribe to\u00a0<strong><a href=\"https:\/\/www.aidevsignals.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">our Newsletter<\/a><\/strong>. Wait! are you on telegram?\u00a0<strong><a href=\"https:\/\/t.me\/machinelearningresearchnews\" target=\"_blank\" rel=\"noreferrer noopener\">now you can join us on telegram as well.<\/a><\/strong><\/p>\n<p class=\"wp-block-paragraph\">Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.?\u00a0<strong><a href=\"https:\/\/forms.gle\/wbash1wF6efRj8G58\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Connect with us<\/mark><\/a><\/strong><\/p>\n<p>The post <a href=\"https:\/\/www.marktechpost.com\/2026\/05\/28\/liquid-ai-releases-lfm2-5-8b-a1b-an-on-device-moe-model-with-8-3b-total-and-1-5b-active-parameters\/\">Liquid AI Releases LFM2.5-8B-A1B: An On-Device MoE Model With 8.3B Total and 1.5B Active Parameters<\/a> appeared first on <a href=\"https:\/\/www.marktechpost.com\/\">MarkTechPost<\/a>.<\/p>","protected":false},"excerpt":{"rendered":"<p>Liquid AI just shipped LFM2.5-&hellip;<\/p>\n","protected":false},"author":1,"featured_media":999,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-998","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts\/998","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=998"}],"version-history":[{"count":0,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts\/998\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/media\/999"}],"wp:attachment":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=998"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=998"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=998"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}