{"id":904,"date":"2026-05-14T04:41:13","date_gmt":"2026-05-13T20:41:13","guid":{"rendered":"https:\/\/connectword.dpdns.org\/?p=904"},"modified":"2026-05-14T04:41:13","modified_gmt":"2026-05-13T20:41:13","slug":"fastino-labs-open-sources-gliguard-a-300m-parameter-safety-moderation-model-that-matches-or-exceeds-accuracy-of-models-23-90x-its-size","status":"publish","type":"post","link":"https:\/\/connectword.dpdns.org\/?p=904","title":{"rendered":"Fastino Labs Open-Sources GLiGuard: A 300M Parameter Safety Moderation Model That Matches or Exceeds Accuracy of Models 23\u201390x Its Size"},"content":{"rendered":"<p>As LLM-powered applications move into production \u2014 and as AI agents take on more consequential tasks like browsing the web, writing and executing code, and interacting with external services \u2014 safety moderation has quietly become one of the most operationally expensive parts of the stack.<\/p>\n<p>Most developers who\u2019ve deployed a production LLM system know the problem: you need to evaluate every user prompt before it reaches the model, and every model response before it reaches the user. That means your guardrail model runs on every single request, at every turn of a conversation. The guardrail latency compounds. The cost compounds. And the current generation of open-source guardrail models \u2014 LlamaGuard4 (12B), WildGuard (7B), ShieldGemma (27B), NemoGuard (8B) \u2014 are all decoder-only models with billions of parameters, built for flexibility but not for speed.<\/p>\n<p>Fastino Labs released <a href=\"https:\/\/arxiv.org\/abs\/2605.07982\">GLiGuard<\/a>, a 300 million parameter open-source safety moderation model designed to address this specific problem. GLiGuard evaluates multiple safety dimensions in a single pass, and across nine safety benchmarks, its accuracy matches or exceeds models that are 23 to 90 times its size while running up to 16 times faster.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1064\" height=\"924\" data-attachment-id=\"79802\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/05\/13\/fastino-labs-open-sources-gliguard-a-300m-parameter-safety-moderation-model-that-matches-or-exceeds-accuracy-of-models-23-90x-its-size\/screenshot-2026-05-13-at-1-38-03-pm-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-13-at-1.38.03-PM-1.png\" data-orig-size=\"1064,924\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"Screenshot 2026-05-13 at 1.38.03\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-13-at-1.38.03-PM-1-1024x889.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-13-at-1.38.03-PM-1.png\" alt=\"\" class=\"wp-image-79802\" \/><figcaption class=\"wp-element-caption\">https:\/\/pioneer.ai\/blog\/gliguard-16x-faster-safety-moderation-with-a-small-language-model<\/figcaption><\/figure>\n<\/div>\n<h2 class=\"wp-block-heading\"><strong>Why Decoder LLMs May Not Be the Right Tool for Safety Moderation<\/strong><\/h2>\n<p>To understand what makes GLiGuard different, it helps to understand why existing guardrail models are slow. Most major guardrail models are built on decoder-only transformer architectures, they generate their safety verdicts autoregressively, one token at a time \u2014 the same way a large language model generates a response to a chat message.<\/p>\n<p>This design made sense when safety requirements were fluid. Decoder models can interpret natural language task descriptions and adapt to new safety policies without retraining. But autoregressive generation is inherently sequential, which makes it slow and computationally expensive.<\/p>\n<p>There\u2019s a compounding problem on top of that. Most guardrail models need to assess inputs across multiple safety dimensions: what type of harm is present, whether the user prompt is attempting to bypass safety training, whether the model\u2019s response is itself unsafe, and so on. Because decoder models generate output sequentially, these assessments are typically produced one after another, and latency compounds as more criteria are evaluated.<\/p>\n<p>In other words, the architecture that makes decoder models flexible is also the architecture that makes them the wrong tool for what is fundamentally a classification problem.<\/p>\n<h2 class=\"wp-block-heading\"><strong>What GLiGuard Actually Does<\/strong><\/h2>\n<p>GLiGuard is a small encoder-based model that reframes safety moderation as a text classification problem rather than a text generation problem. Encoder models process the entire input at once and output a single classification label for a set of fixed labels, whereas decoder models generate their output one token at a time, left to right.<\/p>\n<p>The key architectural insight is in how GLiGuard handles multiple tasks simultaneously. Instead of generating tokens, GLiGuard encodes both the input text and task definitions (labels) together. These are then fed to the model, which scores every label simultaneously in a single forward pass and returns the highest-scoring label for each task. Because all tasks and their candidate labels are part of the input itself, evaluating additional safety dimensions doesn\u2019t add latency; it simply means including more labels in the input.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"972\" height=\"528\" data-attachment-id=\"79803\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/05\/13\/fastino-labs-open-sources-gliguard-a-300m-parameter-safety-moderation-model-that-matches-or-exceeds-accuracy-of-models-23-90x-its-size\/screenshot-2026-05-13-at-1-39-04-pm\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-13-at-1.39.04-PM.png\" data-orig-size=\"972,528\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"Screenshot 2026-05-13 at 1.39.04\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-13-at-1.39.04-PM.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-13-at-1.39.04-PM.png\" alt=\"\" class=\"wp-image-79803\" \/><figcaption class=\"wp-element-caption\">https:\/\/pioneer.ai\/blog\/gliguard-16x-faster-safety-moderation-with-a-small-language-model<\/figcaption><\/figure>\n<\/div>\n<p><strong>GLiGuard runs four moderation tasks concurrently in one forward pass:<\/strong><\/p>\n<ol class=\"wp-block-list\">\n<li><strong>Safety classification<\/strong> (safe \/ unsafe) \u2014 applied to both user prompts before generation and model responses after generation.<\/li>\n<li><strong>Jailbreak strategy detection<\/strong> across 11 strategies, including prompt injection, roleplay bypass, instruction override, and social engineering. If any jailbreak strategy is detected, the prompt is automatically flagged as unsafe.<\/li>\n<li><strong>Harm category detection<\/strong> across 14 categories \u2014 violence, sexual content, hate speech, PII exposure, misinformation, child safety, copyright violation, and others. A single input can trigger multiple categories at once.<\/li>\n<li><strong>Refusal detection<\/strong> (compliance \/ refusal), tracked separately to help measure over-refusal (when a model refuses safe requests) and detect false compliance (when a model appears to comply but doesn\u2019t). If a refusal is detected, the response is automatically marked as safe.<\/li>\n<\/ol>\n<h2 class=\"wp-block-heading\"><strong>Training Data and Fine-Tuning<\/strong><\/h2>\n<p>GLiGuard was trained on a mixture of human-annotated and synthetically generated training data. For prompt safety, response safety, and refusal detection, the team used WildGuardTrain, a dataset of 87,000 human-annotated examples. For harm category and jailbreak strategy detection, labels for the unsafe samples were generated using GPT-4.1.<\/p>\n<p>During early training, the model struggled to distinguish between similar harm categories like toxic speech and violence, so the team used Pioneer to generate supplemental synthetic data with edge cases targeting these fine-grained distinctions.<\/p>\n<p>On the architecture side, GLiGuard was trained via full fine-tuning of the GLiNER2-base-v1 checkpoint for 20 epochs using the AdamW optimizer. GLiNER2 is Fastino\u2019s own architecture for multi-task text classification \u2014 a natural starting point for a model designed to score multiple label sets in one pass.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1708\" height=\"608\" data-attachment-id=\"79806\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/05\/13\/fastino-labs-open-sources-gliguard-a-300m-parameter-safety-moderation-model-that-matches-or-exceeds-accuracy-of-models-23-90x-its-size\/screenshot-2026-05-13-at-1-39-37-pm-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-13-at-1.39.37-PM-1.png\" data-orig-size=\"1708,608\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"Screenshot 2026-05-13 at 1.39.37\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-13-at-1.39.37-PM-1-1024x365.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-13-at-1.39.37-PM-1.png\" alt=\"\" class=\"wp-image-79806\" \/><figcaption class=\"wp-element-caption\">https:\/\/pioneer.ai\/blog\/gliguard-16x-faster-safety-moderation-with-a-small-language-model<\/figcaption><\/figure>\n<\/div>\n<h2 class=\"wp-block-heading\"><strong>Benchmark Results: Accuracy and Speed<\/strong><\/h2>\n<p>The research team evaluated GLiGuard across nine established safety benchmarks. These benchmarks cover both prompt and response classification, testing whether a model can identify harmful content, withstand adversarial attacks, distinguish between different types of harm, and avoid over-flagging safe content. Results use macro-averaged F1, a standard metric that balances precision and recall.<\/p>\n<p><strong>On accuracy:<\/strong><\/p>\n<ul class=\"wp-block-list\">\n<li>GLiGuard scores 87.7 average F1 on prompt classification, within 1.7 points of the best model (PolyGuard-Qwen at 89.4).<\/li>\n<li>It achieves the second-highest average F1 on response classification (82.7), behind only Qwen3Guard-8B (84.1).<\/li>\n<li>It outperforms LlamaGuard4-12B, ShieldGemma-27B, and NemoGuard-8B despite being 23\u201390\u00d7 smaller.<\/li>\n<\/ul>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"401\" data-attachment-id=\"79808\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/05\/13\/fastino-labs-open-sources-gliguard-a-300m-parameter-safety-moderation-model-that-matches-or-exceeds-accuracy-of-models-23-90x-its-size\/screenshot-2026-05-13-at-1-40-22-pm-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-13-at-1.40.22-PM-1.png\" data-orig-size=\"1496,586\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"Screenshot 2026-05-13 at 1.40.22\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-13-at-1.40.22-PM-1-1024x401.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-13-at-1.40.22-PM-1-1024x401.png\" alt=\"\" class=\"wp-image-79808\" \/><figcaption class=\"wp-element-caption\">https:\/\/pioneer.ai\/blog\/gliguard-16x-faster-safety-moderation-with-a-small-language-model<\/figcaption><\/figure>\n<\/div>\n<p><strong>On throughput and latency, benchmarked on a single NVIDIA A100 GPU:<\/strong><\/p>\n<ul class=\"wp-block-list\">\n<li>GLiGuard achieves up to 16.2\u00d7 higher throughput (133 vs. 8.2 samples\/s at batch size 4).<\/li>\n<li>GLiGuard achieves up to 16.6\u00d7 lower latency: 26 ms vs. 426 ms at sequence length 64.<\/li>\n<\/ul>\n<p>These are not marginal improvements. At 26 ms per request versus 426 ms, the difference is meaningful in any real-time user-facing application, and the compounding effect across a multi-turn conversation makes the gap even larger in practice.<\/p>\n<h2 class=\"wp-block-heading\"><strong>Marktechpost\u2019s Visual Explainer<\/strong><\/h2>\n<div>\n<div class=\"gg-header\">\n<div class=\"gg-brand\">\n      <span class=\"gg-logo-dot\"><\/span><br \/>\n      <span class=\"gg-brand-name\">GLiGuard \u2014 Fastino Labs<\/span>\n    <\/div>\n<p>    <span class=\"gg-slide-counter\">1 \/ 6<\/span>\n  <\/p><\/div>\n<div class=\"gg-slides-wrap\">\n<div>\n<div class=\"gg-progress\"><\/div>\n<\/div>\n<div class=\"gg-slides\">\n<p>      <!-- Slide 1: What is GLiGuard --><\/p>\n<div class=\"gg-slide\">\n<div class=\"gg-slide-num\">01 \u2014 Overview<\/div>\n<div class=\"gg-slide-title\">What is <span>GLiGuard<\/span>?<\/div>\n<div class=\"gg-slide-body\">\n          GLiGuard is an open-source <strong>300M parameter safety moderation model<\/strong> released by Fastino Labs on May 12, 2026. It is designed to act as a guardrail layer between users and LLMs \u2014 screening every user prompt before it reaches the model and every model response before it reaches the user.\n        <\/div>\n<div class=\"gg-divider\"><\/div>\n<div class=\"gg-stat-row\">\n<div class=\"gg-stat\">\n<div class=\"gg-stat-val\">300M<\/div>\n<div class=\"gg-stat-label\">Parameters \u2014 runs on a single GPU<\/div>\n<\/div>\n<div class=\"gg-stat\">\n<div class=\"gg-stat-val\">16x<\/div>\n<div class=\"gg-stat-label\">Faster throughput vs. SOTA decoder guardrails<\/div>\n<\/div>\n<div class=\"gg-stat\">\n<div class=\"gg-stat-val\">4<\/div>\n<div class=\"gg-stat-label\">Safety tasks evaluated in a single forward pass<\/div>\n<\/div>\n<\/div>\n<div class=\"gg-divider\"><\/div>\n<div class=\"gg-tag-row\">\n          <span class=\"gg-tag gg-tag-hi\">Apache 2.0<\/span><br \/>\n          <span class=\"gg-tag\">Hugging Face<\/span><br \/>\n          <span class=\"gg-tag\">Pioneer Inference<\/span><br \/>\n          <span class=\"gg-tag\">Encoder Architecture<\/span>\n        <\/div>\n<\/div>\n<p>      <!-- Slide 2: The Problem --><\/p>\n<div class=\"gg-slide\">\n<div class=\"gg-slide-num\">02 \u2014 The Problem<\/div>\n<div class=\"gg-slide-title\">Why Existing <span>Guardrails<\/span> Are Slow<\/div>\n<div class=\"gg-slide-body\">\n          Most production guardrail models \u2014 LlamaGuard4, WildGuard, ShieldGemma, NemoGuard \u2014 are built on <strong>decoder-only transformer architectures<\/strong>. They generate safety verdicts autoregressively, one token at a time, the same way a large language model generates a chat response.\n        <\/div>\n<div class=\"gg-divider\"><\/div>\n<div class=\"gg-compare\">\n<div class=\"gg-compare-col\">\n<div class=\"gg-col-head\">Decoder Guard Models<\/div>\n<div class=\"gg-col-item\">Generate verdicts <strong>token by token<\/strong><\/div>\n<div class=\"gg-col-item\">Sequential output \u2014 <strong>latency compounds<\/strong> per task<\/div>\n<div class=\"gg-col-item\">7B \u2014 27B parameters required<\/div>\n<div class=\"gg-col-item\">Expensive to run at real-time scale<\/div>\n<div class=\"gg-col-item\">Separate passes per safety dimension<\/div>\n<\/div>\n<div class=\"gg-compare-col gg-col-good\">\n<div class=\"gg-col-head gg-good\">GLiGuard (Encoder)<\/div>\n<div class=\"gg-col-item\">Processes entire input <strong>at once<\/strong><\/div>\n<div class=\"gg-col-item\">All tasks evaluated in <strong>one forward pass<\/strong><\/div>\n<div class=\"gg-col-item\">300M parameters<\/div>\n<div class=\"gg-col-item\">Single GPU deployment<\/div>\n<div class=\"gg-col-item\">More dimensions = no added latency<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p>      <!-- Slide 3: How It Works --><\/p>\n<div class=\"gg-slide\">\n<div class=\"gg-slide-num\">03 \u2014 Architecture<\/div>\n<div class=\"gg-slide-title\">Single Pass. <span>Multiple Tasks.<\/span><\/div>\n<div class=\"gg-slide-body\">\n          GLiGuard reframes safety moderation as a <strong>text classification problem<\/strong>, not a text generation problem. It encodes the input text and all task definitions (labels) together, then scores every label simultaneously in one forward pass. Adding more safety dimensions does not increase latency \u2014 it simply means more labels in the input.\n        <\/div>\n<div class=\"gg-divider\"><\/div>\n<div class=\"gg-slide-body\">\n          <strong>Base model:<\/strong> Fine-tuned from the <strong>GLiNER2-base-v1<\/strong> checkpoint using full fine-tuning for 20 epochs with the AdamW optimizer. Training data: <strong>87,000 human-annotated examples<\/strong> from WildGuardTrain, plus synthetic edge-case data generated via GPT-4.1 and Pioneer for fine-grained harm category distinctions.\n        <\/div>\n<\/div>\n<p>      <!-- Slide 4: 4 Tasks --><\/p>\n<div class=\"gg-slide\">\n<div class=\"gg-slide-num\">04 \u2014 Capabilities<\/div>\n<div class=\"gg-slide-title\">4 Moderation Tasks in <span>One Pass<\/span><\/div>\n<div class=\"gg-tasks\">\n<div class=\"gg-task\">\n<div class=\"gg-task-num\">01<\/div>\n<div class=\"gg-task-content\">\n<div class=\"gg-task-title\">Safety Classification \u2014 safe \/ unsafe<\/div>\n<div class=\"gg-task-desc\">Applied to both user prompts before generation and model responses after generation.<\/div>\n<\/div>\n<\/div>\n<div class=\"gg-task\">\n<div class=\"gg-task-num\">02<\/div>\n<div class=\"gg-task-content\">\n<div class=\"gg-task-title\">Jailbreak Strategy Detection \u2014 11 strategies<\/div>\n<div class=\"gg-task-desc\">Detects prompt injection, roleplay bypass, instruction override, social engineering, and others. Any detected strategy auto-flags the prompt as unsafe.<\/div>\n<\/div>\n<\/div>\n<div class=\"gg-task\">\n<div class=\"gg-task-num\">03<\/div>\n<div class=\"gg-task-content\">\n<div class=\"gg-task-title\">Harm Category Detection \u2014 14 categories<\/div>\n<div class=\"gg-task-desc\">Violence, sexual content, hate speech, PII exposure, misinformation, child safety, copyright violation, and others. A single input can trigger multiple categories.<\/div>\n<\/div>\n<\/div>\n<div class=\"gg-task\">\n<div class=\"gg-task-num\">04<\/div>\n<div class=\"gg-task-content\">\n<div class=\"gg-task-title\">Refusal Detection \u2014 compliance \/ refusal<\/div>\n<div class=\"gg-task-desc\">Tracks over-refusal (refusing safe requests) and false compliance. A detected refusal auto-marks the response as safe.<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p>      <!-- Slide 5: Benchmarks --><\/p>\n<div class=\"gg-slide\">\n<div class=\"gg-slide-num\">05 \u2014 Benchmarks<\/div>\n<div class=\"gg-slide-title\">Accuracy vs. <span>Much Larger Models<\/span><\/div>\n<div class=\"gg-slide-body\">Evaluated across 9 safety benchmarks using macro-averaged F1. Speed benchmarked on a single NVIDIA A100 GPU.<\/div>\n<div>\n<div>Prompt Classification \u2014 Avg. F1<\/div>\n<div class=\"gg-bench-row\">\n<div class=\"gg-bench-name\"><strong>GLiGuard (0.3B)<\/strong><\/div>\n<div class=\"gg-bench-bar-wrap\">\n<div class=\"gg-bench-bar gg-bar-hi\"><\/div>\n<\/div>\n<div class=\"gg-bench-score gg-score-hi\">87.7<\/div>\n<\/div>\n<div class=\"gg-bench-row\">\n<div class=\"gg-bench-name\">PolyGuard-Qwen (7B)<\/div>\n<div class=\"gg-bench-bar-wrap\">\n<div class=\"gg-bench-bar\"><\/div>\n<\/div>\n<div class=\"gg-bench-score\">89.4<\/div>\n<\/div>\n<div class=\"gg-bench-row\">\n<div class=\"gg-bench-name\">LlamaGuard4 (12B)<\/div>\n<div class=\"gg-bench-bar-wrap\">\n<div class=\"gg-bench-bar\"><\/div>\n<\/div>\n<div class=\"gg-bench-score\">\u2014<\/div>\n<\/div>\n<div class=\"gg-bench-row\">\n<div class=\"gg-bench-name\">ShieldGemma (27B)<\/div>\n<div class=\"gg-bench-bar-wrap\">\n<div class=\"gg-bench-bar\"><\/div>\n<\/div>\n<div class=\"gg-bench-score\">\u2014<\/div>\n<\/div>\n<\/div>\n<div class=\"gg-stat-row\">\n<div class=\"gg-stat\">\n<div class=\"gg-stat-val\">26ms<\/div>\n<div class=\"gg-stat-label\">Latency at seq. length 64 (vs. 426ms for ShieldGemma-27B)<\/div>\n<\/div>\n<div class=\"gg-stat\">\n<div class=\"gg-stat-val\">133<\/div>\n<div class=\"gg-stat-label\">Samples\/sec throughput at batch size 4<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p>      <!-- Slide 6: Get Started --><\/p>\n<div class=\"gg-slide\">\n<div class=\"gg-slide-num\">06 \u2014 Get Started<\/div>\n<div class=\"gg-slide-title\">Deploy <span>GLiGuard<\/span> Today<\/div>\n<div class=\"gg-slide-body\">\n          At 300M parameters, GLiGuard runs on a <strong>single GPU<\/strong> and can be fine-tuned for domain-specific use cases without heavy infrastructure. Weights are available on Hugging Face under the <strong>Apache 2.0 license<\/strong>. Managed inference is available on Pioneer.\n        <\/div>\n<div class=\"gg-divider\"><\/div>\n<div>\n<div>Model ID<\/div>\n<p>          <code>fastino\/gliguard-LLMGuardrails-300M<\/code>\n        <\/p><\/div>\n<div class=\"gg-link-row\">\n          <a class=\"gg-link\" href=\"https:\/\/huggingface.co\/fastino\/gliguard-LLMGuardrails-300M\" target=\"_blank\">\ud83e\udd17 Hugging Face<\/a><br \/>\n          <a class=\"gg-link\" href=\"https:\/\/arxiv.org\/abs\/2605.07982\" target=\"_blank\">\ud83d\udcc4 arXiv Paper<\/a><br \/>\n          <a class=\"gg-link\" href=\"https:\/\/pioneer.ai\/\" target=\"_blank\">\u26a1 Pioneer Inference<\/a>\n        <\/div>\n<div class=\"gg-divider\"><\/div>\n<div class=\"gg-tag-row\">\n          <span class=\"gg-tag\">Prompt Safety<\/span><br \/>\n          <span class=\"gg-tag\">Response Safety<\/span><br \/>\n          <span class=\"gg-tag\">Jailbreak Detection<\/span><br \/>\n          <span class=\"gg-tag\">Harm Classification<\/span><br \/>\n          <span class=\"gg-tag\">Refusal Detection<\/span><br \/>\n          <span class=\"gg-tag gg-tag-hi\">Single GPU<\/span>\n        <\/div>\n<\/div>\n<\/div>\n<\/div>\n<div>\n    <span>Designed &amp; Created by <a href=\"https:\/\/marktechpost.com\/\" target=\"_blank\">MarktechPost.com<\/a><\/span>\n  <\/div>\n<div class=\"gg-nav\">\n<div class=\"gg-dots\"><\/div>\n<div class=\"gg-btn-row\">\n      <button class=\"gg-btn\" disabled>\u2190 Prev<\/button><br \/>\n      <button class=\"gg-btn gg-btn-next\">Next \u2192<\/button>\n    <\/div>\n<\/div>\n<\/div>\n<h2 class=\"wp-block-heading\"><strong>Key Takeaways<\/strong><\/h2>\n<ul class=\"wp-block-list\">\n<li>GLiGuard is a 300M parameter encoder-based safety moderation model that handles four tasks \u2014 safety classification, jailbreak detection, harm categorization, and refusal detection \u2014 in a single forward pass.<\/li>\n<li>Unlike decoder-only guardrail models that generate verdicts autoregressively, GLiGuard reframes safety moderation as a text classification problem, eliminating the sequential latency bottleneck.<\/li>\n<li>Benchmarked on a single NVIDIA A100 GPU, GLiGuard achieves up to 16.2\u00d7 higher throughput and 16.6\u00d7 lower latency (26 ms vs. 426 ms) compared to current SOTA models like ShieldGemma-27B.<\/li>\n<li>Across nine safety benchmarks, GLiGuard scores 87.7 average F1 on prompt classification and 82.7 on response classification \u2014 outperforming LlamaGuard4-12B, ShieldGemma-27B, and NemoGuard-8B despite being 23\u201390\u00d7 smaller.<\/li>\n<li>Model weights are available under Apache 2.0 on Hugging Face (<code>fastino\/gliguard-LLMGuardrails-300M<\/code>), making it deployable on a single GPU without heavy infrastructure.<\/li>\n<\/ul>\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n<p>Check out\u00a0the <strong><a href=\"https:\/\/arxiv.org\/pdf\/2605.07982\" target=\"_blank\" rel=\"noreferrer noopener\">Paper<\/a><\/strong>, <strong><a href=\"https:\/\/huggingface.co\/fastino\/gliguard-LLMGuardrails-300M\" target=\"_blank\" rel=\"noreferrer noopener\">Model Weights on HF<\/a><\/strong>, <strong><a href=\"https:\/\/github.com\/fastino-ai\/GLiGuard\" target=\"_blank\" rel=\"noreferrer noopener\">GitHub Repo<\/a><\/strong> and\u00a0<strong><a href=\"https:\/\/gliner.ai\/\" target=\"_blank\" rel=\"noreferrer noopener\">Technical details<\/a>.\u00a0<\/strong>Also,\u00a0feel free to follow us on\u00a0<strong><a href=\"https:\/\/x.com\/intent\/follow?screen_name=marktechpost\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Twitter<\/mark><\/a><\/strong>\u00a0and don\u2019t forget to join our\u00a0<strong><a href=\"https:\/\/www.reddit.com\/r\/machinelearningnews\/\" target=\"_blank\" rel=\"noreferrer noopener\">150k+ ML SubReddit<\/a><\/strong>\u00a0and Subscribe to\u00a0<strong><a href=\"https:\/\/www.aidevsignals.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">our Newsletter<\/a><\/strong>. Wait! are you on telegram?\u00a0<strong><a href=\"https:\/\/t.me\/machinelearningresearchnews\" target=\"_blank\" rel=\"noreferrer noopener\">now you can join us on telegram as well.<\/a><\/strong><\/p>\n<p>Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.?\u00a0<strong><a href=\"https:\/\/forms.gle\/MTNLpmJtsFA3VRVd9\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Connect with us<\/mark><\/a><\/strong><\/p>\n<p>The post <a href=\"https:\/\/www.marktechpost.com\/2026\/05\/13\/fastino-labs-open-sources-gliguard-a-300m-parameter-safety-moderation-model-that-matches-or-exceeds-accuracy-of-models-23-90x-its-size\/\">Fastino Labs Open-Sources GLiGuard: A 300M Parameter Safety Moderation Model That Matches or Exceeds Accuracy of Models 23\u201390x Its Size<\/a> appeared first on <a href=\"https:\/\/www.marktechpost.com\/\">MarkTechPost<\/a>.<\/p>","protected":false},"excerpt":{"rendered":"<p>As LLM-powered applications mo&hellip;<\/p>\n","protected":false},"author":1,"featured_media":905,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-904","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts\/904","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=904"}],"version-history":[{"count":0,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts\/904\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/media\/905"}],"wp:attachment":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=904"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=904"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=904"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}