{"id":799,"date":"2026-04-27T13:33:44","date_gmt":"2026-04-27T05:33:44","guid":{"rendered":"https:\/\/connectword.dpdns.org\/?p=799"},"modified":"2026-04-27T13:33:44","modified_gmt":"2026-04-27T05:33:44","slug":"the-lora-assumption-that-breaks-in-production","status":"publish","type":"post","link":"https:\/\/connectword.dpdns.org\/?p=799","title":{"rendered":"The LoRA Assumption That Breaks in Production\u00a0"},"content":{"rendered":"<p>LoRA is widely used for fine-tuning large models because it\u2019s efficient, but it quietly assumes that all updates to a model are similar. In reality, they\u2019re not. When you fine-tune for style (like tone, format, or persona), the changes are simple and concentrated in just a few dimensions \u2014 which LoRA handles well with low-rank updates. But when you try to teach the model new factual knowledge (like medical data or statistics), the information is spread across many dimensions. A low-rank setup (like rank-8) can\u2019t capture all of it, so the model may sound correct but give wrong or incomplete answers.<\/p>\n<p>Trying to fix this by increasing the rank introduces another problem: instability. As rank increases, the scaling used in standard LoRA causes the learning signal to weaken, making training ineffective. RS-LoRA solves this by slightly adjusting the scaling formula (changing from dividing by <em>r<\/em> to dividing by \u221ar), which stabilizes learning even at higher ranks. This small change allows the model to better retain complex, high-dimensional information without breaking training.<\/p>\n<figure class=\"wp-block-image size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"819\" height=\"361\" data-attachment-id=\"79331\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/04\/26\/the-lora-assumption-that-breaks-in-production\/image-464\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-56.png\" data-orig-size=\"819,361\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"image\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-56.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-56.png\" alt=\"\" class=\"wp-image-79331\" \/><\/figure>\n<figure class=\"wp-block-image size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"815\" height=\"433\" data-attachment-id=\"79327\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/04\/26\/the-lora-assumption-that-breaks-in-production\/image-461\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-54.png\" data-orig-size=\"815,433\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"image\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-54.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-54.png\" alt=\"\" class=\"wp-image-79327\" \/><\/figure>\n<p>In the code walkthrough below, we demonstrate this failure from first principles using NumPy \u2014 no training loops, no frameworks. We simulate two types of weight updates, measure exactly how much information survives at each rank, and expose the secondary failure: that naively increasing the rank to compensate triggers a scaling collapse that kills the learning signal entirely. We then show the fix \u2014 RS-LoRA\u2019s rank-stabilized scaling \u2014 and why a single character change in the denominator (r \u2192 \u221ar) is what makes high-rank adaptation stable.\u00a0<\/p>\n<h1 class=\"wp-block-heading\">Setting up the dependencies<\/h1>\n<div class=\"dm-code-snippet dark dm-normal-version default no-background-mobile\">\n<div class=\"control-language\">\n<div class=\"dm-buttons\">\n<div class=\"dm-buttons-left\">\n<div class=\"dm-button-snippet red-button\"><\/div>\n<div class=\"dm-button-snippet orange-button\"><\/div>\n<div class=\"dm-button-snippet green-button\"><\/div>\n<\/div>\n<div class=\"dm-buttons-right\"><a><span class=\"dm-copy-text\">Copy Code<\/span><span class=\"dm-copy-confirmed\">Copied<\/span><span class=\"dm-error-message\">Use a different Browser<\/span><\/a><\/div>\n<\/div>\n<pre class=\" no-line-numbers\"><code class=\" no-wrap language-php\">import numpy as np\nimport matplotlib.pyplot as plt\nimport matplotlib.gridspec as gridspec\n \nnp.random.seed(42)<\/code><\/pre>\n<\/div>\n<\/div>\n<h1 class=\"wp-block-heading\">The Setup \u2014 What are we simulating?<\/h1>\n<p>In this setup, we\u2019re simulating how fine-tuning affects a model\u2019s weight matrix by creating a simplified environment. We assume a pre-trained weight matrix of size 64\u00d764 and introduce two types of updates: low-rank \u201cstyle\u201d changes (like tone or formatting) and high-rank \u201cfact\u201d changes (like detailed cricket statistics). We then define two LoRA configurations \u2014 a small rank (r=4), which represents typical LoRA usage, and a larger rank (r=32), which is more suitable for capturing complex information as in RS-LoRA. This allows us to compare how well different ranks can recover these simulated updates and highlight where standard LoRA struggles.<\/p>\n<div class=\"dm-code-snippet dark dm-normal-version default no-background-mobile\">\n<div class=\"control-language\">\n<div class=\"dm-buttons\">\n<div class=\"dm-buttons-left\">\n<div class=\"dm-button-snippet red-button\"><\/div>\n<div class=\"dm-button-snippet orange-button\"><\/div>\n<div class=\"dm-button-snippet green-button\"><\/div>\n<\/div>\n<div class=\"dm-buttons-right\"><a><span class=\"dm-copy-text\">Copy Code<\/span><span class=\"dm-copy-confirmed\">Copied<\/span><span class=\"dm-error-message\">Use a different Browser<\/span><\/a><\/div>\n<\/div>\n<pre class=\" no-line-numbers\"><code class=\" no-wrap language-php\">d, k = 64, 64          # weight matrix dimensions\nr_low  = 4             # LoRA rank -- small (standard choice)\nr_high = 32            # LoRA rank -- large (RS-LoRA compatible)\n \nprint(f\"Weight matrix shape : ({d} x {k})\")\nprint(f\"Low  rank (standard): r = {r_low}\")\nprint(f\"High rank (RS-LoRA) : r = {r_high}\")\nprint(f\"Max possible rank   : {min(d, k)}\")<\/code><\/pre>\n<\/div>\n<\/div>\n<h1 class=\"wp-block-heading\">Simulate the \u201cTrue\u201d Update Matrices<\/h1>\n<p>Here, we simulate the two fundamentally different types of fine-tuning updates. The style update is intentionally constructed as low-rank: only a few singular values are large and the rest drop off quickly, meaning most of the important information is concentrated in just a handful of dimensions. This mirrors real-world behavior where tone or formatting changes don\u2019t require widespread modification of the model.<\/p>\n<p>In contrast, the fact update is high-rank: the singular values decay slowly, indicating that many dimensions contribute meaningful information. This reflects how factual knowledge (like statistics or domain data) is distributed across the model. The printed singular values make this clear \u2014 style updates show a sharp drop after the first few values, while fact updates remain consistently large across many dimensions, proving they cannot be easily compressed into a low-rank approximation.<\/p>\n<div class=\"dm-code-snippet dark dm-normal-version default no-background-mobile\">\n<div class=\"control-language\">\n<div class=\"dm-buttons\">\n<div class=\"dm-buttons-left\">\n<div class=\"dm-button-snippet red-button\"><\/div>\n<div class=\"dm-button-snippet orange-button\"><\/div>\n<div class=\"dm-button-snippet green-button\"><\/div>\n<\/div>\n<div class=\"dm-buttons-right\"><a><span class=\"dm-copy-text\">Copy Code<\/span><span class=\"dm-copy-confirmed\">Copied<\/span><span class=\"dm-error-message\">Use a different Browser<\/span><\/a><\/div>\n<\/div>\n<pre class=\" no-line-numbers\"><code class=\" no-wrap language-php\">def make_low_rank_delta(d, k, true_rank, noise=0.01):\n    \"\"\"Simulates a style update -- low intrinsic rank.\"\"\"\n    U = np.random.randn(d, true_rank)\n    S = np.linspace(5, 0.5, true_rank)   # fast-decaying singular values\n    V = np.random.randn(k, true_rank)\n    U, _ = np.linalg.qr(U)\n    V, _ = np.linalg.qr(V)\n    delta = (U[:, :true_rank] * S) @ V[:, :true_rank].T\n    delta += noise * np.random.randn(d, k)\n    return delta\n \ndef make_high_rank_delta(d, k, noise=0.01):\n    \"\"\"Simulates a fact\/knowledge update -- high intrinsic rank.\"\"\"\n    U = np.random.randn(d, d)\n    S = np.linspace(3, 0.5, min(d, k))   # slow-decaying -- many dimensions matter\n    V = np.random.randn(k, k)\n    U, _ = np.linalg.qr(U)\n    V, _ = np.linalg.qr(V)\n    delta = (U[:, :min(d,k)] * S) @ V[:, :min(d,k)].T\n    delta += noise * np.random.randn(d, k)\n    return delta\n \ndelta_style = make_low_rank_delta(d, k, true_rank=4)\ndelta_facts = make_high_rank_delta(d, k)\n \nprint(\"nStyle  update -- top 10 singular values:\", np.linalg.svd(delta_style, compute_uv=False)[:10].round(2))\nprint(\"Facts  update -- top 10 singular values:\", np.linalg.svd(delta_facts,  compute_uv=False)[:10].round(2))\nprint(\"nNotice: Style decays fast \u2192 low-rank. Facts decay slowly \u2192 high-rank.\")<\/code><\/pre>\n<\/div>\n<\/div>\n<h1 class=\"wp-block-heading\">LoRA Approximation (Standard Scaling: alpha\/r)<\/h1>\n<p>This part compares how well standard LoRA and RS-LoRA can reconstruct the original updates using different ranks. Both methods first use SVD to get the best possible rank-r approximation (i.e., compress the update into r dimensions), but they differ in how they scale the result: standard LoRA divides by r, while RS-LoRA divides by \u221ar. The table shows the reconstruction error \u2014 lower means better.<\/p>\n<p>The key takeaway is clear: for style updates, even small ranks (like 4 or 8) work well because the information is naturally low-rank, so the error quickly drops. But for fact updates, the error stays high at low ranks, proving that important information is being lost. Increasing the rank helps, but standard LoRA becomes unstable due to over-scaling (error doesn\u2019t consistently improve). RS-LoRA, with its \u221ar scaling, handles higher ranks more gracefully and reduces error more steadily, making it better suited for capturing complex, high-dimensional knowledge.<\/p>\n<div class=\"dm-code-snippet dark dm-normal-version default no-background-mobile\">\n<div class=\"control-language\">\n<div class=\"dm-buttons\">\n<div class=\"dm-buttons-left\">\n<div class=\"dm-button-snippet red-button\"><\/div>\n<div class=\"dm-button-snippet orange-button\"><\/div>\n<div class=\"dm-button-snippet green-button\"><\/div>\n<\/div>\n<div class=\"dm-buttons-right\"><a><span class=\"dm-copy-text\">Copy Code<\/span><span class=\"dm-copy-confirmed\">Copied<\/span><span class=\"dm-error-message\">Use a different Browser<\/span><\/a><\/div>\n<\/div>\n<pre class=\" no-line-numbers\"><code class=\" no-wrap language-php\">def lora_approx_standard(delta, r, alpha=16):\n    \"\"\"Approximate delta using rank-r LoRA with standard alpha\/r scaling.\"\"\"\n    U, S, Vt = np.linalg.svd(delta, full_matrices=False)\n    # Truncate to rank r\n    B = U[:, :r] * S[:r]          # shape (d, r)\n    A = Vt[:r, :]                  # shape (r, k)\n    scaling = alpha \/ r\n    delta_approx = scaling * (B @ A)\n    error = np.linalg.norm(delta - delta_approx, 'fro') \/ np.linalg.norm(delta, 'fro')\n    return delta_approx, error\n \ndef lora_approx_rslora(delta, r, alpha=16):\n    \"\"\"Approximate delta using rank-r LoRA with RS-LoRA sqrt(r) scaling.\"\"\"\n    U, S, Vt = np.linalg.svd(delta, full_matrices=False)\n    B = U[:, :r] * S[:r]\n    A = Vt[:r, :]\n    scaling = alpha \/ np.sqrt(r)   # &lt;-- the key change\n    delta_approx = scaling * (B @ A)\n    error = np.linalg.norm(delta - delta_approx, 'fro') \/ np.linalg.norm(delta, 'fro')\n    return delta_approx, error\n \nranks = [2, 4, 8, 16, 32, 48]\n \nstyle_errors_standard, facts_errors_standard = [], []\nstyle_errors_rslora,   facts_errors_rslora   = [], []\n \nfor r in ranks:\n    _, e = lora_approx_standard(delta_style, r);  style_errors_standard.append(e)\n    _, e = lora_approx_standard(delta_facts, r);  facts_errors_standard.append(e)\n    _, e = lora_approx_rslora(delta_style, r);    style_errors_rslora.append(e)\n    _, e = lora_approx_rslora(delta_facts, r);    facts_errors_rslora.append(e)\n \nprint(\"Rank | Style Err (std) | Facts Err (std) | Facts Err (RS-LoRA)\")\nprint(\"-\" * 60)\nfor i, r in enumerate(ranks):\n    print(f\"  {r:2d} |      {style_errors_standard[i]:.3f}      |      {facts_errors_standard[i]:.3f}      |      {facts_errors_rslora[i]:.3f}\")<\/code><\/pre>\n<\/div>\n<\/div>\n<h1 class=\"wp-block-heading\">Scaling Collapse Demo<\/h1>\n<p>This section explains why standard LoRA struggles at higher ranks. As the rank r increases, standard LoRA scales the update by \u03b1 \/ r, which shrinks rapidly \u2014 you can see it drop from 16 (at r=1) to just 0.25 (at r=64). This means that even though you\u2019re adding more dimensions (trying to capture more information), the overall update gets weaker and weaker, effectively suppressing the learning signal. The optimizer then has to compensate by pushing weights harder, which often leads to instability or poor convergence.<\/p>\n<p>RS-LoRA fixes this by changing the scaling to \u03b1 \/ \u221ar. Instead of shrinking too aggressively, the scale decreases more gradually \u2014 staying strong enough even at higher ranks (e.g., still 2.0 at r=64). This keeps the effective update magnitude meaningful, allowing the model to actually benefit from higher-rank representations without killing the signal. In simple terms: standard LoRA adds capacity but kills its impact, while RS-LoRA preserves both.<\/p>\n<div class=\"dm-code-snippet dark dm-normal-version default no-background-mobile\">\n<div class=\"control-language\">\n<div class=\"dm-buttons\">\n<div class=\"dm-buttons-left\">\n<div class=\"dm-button-snippet red-button\"><\/div>\n<div class=\"dm-button-snippet orange-button\"><\/div>\n<div class=\"dm-button-snippet green-button\"><\/div>\n<\/div>\n<div class=\"dm-buttons-right\"><a><span class=\"dm-copy-text\">Copy Code<\/span><span class=\"dm-copy-confirmed\">Copied<\/span><span class=\"dm-error-message\">Use a different Browser<\/span><\/a><\/div>\n<\/div>\n<pre class=\" no-line-numbers\"><code class=\" no-wrap language-php\">alpha = 16\nrs = np.arange(1, 65)\nstandard_scale = alpha \/ rs\nrslora_scale   = alpha \/ np.sqrt(rs)\n \nprint(\"nRank | Standard Scale (alpha\/r) | RS-LoRA Scale (alpha\/sqrt(r))\")\nprint(\"-\" * 55)\nfor r in [1, 4, 8, 16, 32, 64]:\n    print(f\"  {r:2d} |         {alpha\/r:.4f}          |         {alpha\/np.sqrt(r):.4f}\")\n \nprint(\"nStandard scaling vanishes as rank grows.\")\nprint(\"RS-LoRA scaling stays meaningful at high ranks.\")<\/code><\/pre>\n<\/div>\n<\/div>\n<figure class=\"wp-block-image size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"810\" height=\"433\" data-attachment-id=\"79326\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/04\/26\/the-lora-assumption-that-breaks-in-production\/image-460\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-53.png\" data-orig-size=\"810,433\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"image\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-53.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-53.png\" alt=\"\" class=\"wp-image-79326\" \/><\/figure>\n<h1 class=\"wp-block-heading\">Singular Value Spectrum<\/h1>\n<p>This section shows the core difference in how information is distributed between style and factual updates. For style, most of the important signal is concentrated in just a few dimensions \u2014 you can see that with rank 4, over 99% of the information is already captured. This is why low-rank methods like LoRA work so well for tone, format, or persona changes. There\u2019s a clear \u201celbow\u201d in the singular values \u2014 after a few components, the rest don\u2019t matter much.<\/p>\n<p>For facts, it\u2019s the opposite. The information is spread out across many dimensions \u2014 even at rank 8, you\u2019re only capturing about 28% of the total signal, which means most of the knowledge is still missing. This is the \u201clong tail\u201d problem: each additional dimension contributes something important. When LoRA truncates to a low rank, it cuts off this tail, leading to incomplete or incorrect knowledge. That\u2019s why the model may sound confident but still get factual details wrong.<\/p>\n<div class=\"dm-code-snippet dark dm-normal-version default no-background-mobile\">\n<div class=\"control-language\">\n<div class=\"dm-buttons\">\n<div class=\"dm-buttons-left\">\n<div class=\"dm-button-snippet red-button\"><\/div>\n<div class=\"dm-button-snippet orange-button\"><\/div>\n<div class=\"dm-button-snippet green-button\"><\/div>\n<\/div>\n<div class=\"dm-buttons-right\"><a><span class=\"dm-copy-text\">Copy Code<\/span><span class=\"dm-copy-confirmed\">Copied<\/span><span class=\"dm-error-message\">Use a different Browser<\/span><\/a><\/div>\n<\/div>\n<pre class=\" no-line-numbers\"><code class=\" no-wrap language-php\">sv_style = np.linalg.svd(delta_style, compute_uv=False)\nsv_facts  = np.linalg.svd(delta_facts,  compute_uv=False)\n \nprint(\"Cumulative variance captured by top-r components:n\")\nprint(f\"{'Rank':&gt;5} | {'Style (%)':&gt;10} | {'Facts (%)':&gt;10}\")\nprint(\"-\" * 32)\ntotal_style = np.sum(sv_style**2)\ntotal_facts  = np.sum(sv_facts**2)\nfor r in [2, 4, 8, 16, 32]:\n    cs = 100 * np.sum(sv_style[:r]**2) \/ total_style\n    cf = 100 * np.sum(sv_facts[:r]**2)  \/ total_facts\n    print(f\"  {r:3d} | {cs:9.1f}% | {cf:9.1f}%\")\n \nprint(\"nWith r=8, style is nearly fully captured.\")\nprint(\"With r=8, facts are still poorly captured -- the tail matters!\")<\/code><\/pre>\n<\/div>\n<\/div>\n<figure class=\"wp-block-image size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"809\" height=\"449\" data-attachment-id=\"79329\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/04\/26\/the-lora-assumption-that-breaks-in-production\/image-463\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-57.png\" data-orig-size=\"809,449\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"image\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-57.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-57.png\" alt=\"\" class=\"wp-image-79329\" \/><\/figure>\n<figure class=\"wp-block-image size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"808\" height=\"525\" data-attachment-id=\"79328\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/04\/26\/the-lora-assumption-that-breaks-in-production\/image-462\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-55.png\" data-orig-size=\"808,525\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"image\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-55.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-55.png\" alt=\"\" class=\"wp-image-79328\" \/><\/figure>\n<figure class=\"wp-block-image size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"807\" height=\"500\" data-attachment-id=\"79332\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/04\/26\/the-lora-assumption-that-breaks-in-production\/image-465\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-58.png\" data-orig-size=\"807,500\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"image\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-58.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-58.png\" alt=\"\" class=\"wp-image-79332\" \/><\/figure>\n<figure class=\"wp-block-image size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"814\" height=\"586\" data-attachment-id=\"79333\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/04\/26\/the-lora-assumption-that-breaks-in-production\/image-466\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-59.png\" data-orig-size=\"814,586\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"image\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-59.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/04\/image-59.png\" alt=\"\" class=\"wp-image-79333\" \/><\/figure>\n<hr class=\"wp-block-separator aligncenter has-alpha-channel-opacity is-style-wide\" \/>\n<p>Check out\u00a0the\u00a0<strong><a href=\"https:\/\/github.com\/Marktechpost\/AI-Agents-Projects-Tutorials\/blob\/main\/Data%20Science\/LoRA_Assumption.ipynb\" target=\"_blank\" rel=\"noreferrer noopener\">Full Codes here<\/a><\/strong>.<strong>\u00a0<\/strong>Find 100s of ML\/Data Science\u00a0<strong><a href=\"https:\/\/github.com\/Marktechpost\/Machine-learning-Data-science-Tutorials\" target=\"_blank\" rel=\"noreferrer noopener\">Colab Notebooks here<\/a><\/strong>. Also,\u00a0feel free to follow us on\u00a0<strong><a href=\"https:\/\/x.com\/intent\/follow?screen_name=marktechpost\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Twitter<\/mark><\/a><\/strong>\u00a0and don\u2019t forget to join our\u00a0<strong><a href=\"https:\/\/www.reddit.com\/r\/machinelearningnews\/\" target=\"_blank\" rel=\"noreferrer noopener\">130k+ ML SubReddit<\/a><\/strong>\u00a0and Subscribe to\u00a0<strong><a href=\"https:\/\/www.aidevsignals.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">our Newsletter<\/a><\/strong>. Wait! are you on telegram?\u00a0<strong><a href=\"https:\/\/t.me\/machinelearningresearchnews\" target=\"_blank\" rel=\"noreferrer noopener\">now you can join us on telegram as well.<\/a><\/strong><\/p>\n<p>Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.?\u00a0<strong><a href=\"https:\/\/forms.gle\/MTNLpmJtsFA3VRVd9\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Connect with us<\/mark><\/a><\/strong><\/p>\n<p>The post <a href=\"https:\/\/www.marktechpost.com\/2026\/04\/26\/the-lora-assumption-that-breaks-in-production\/\">The LoRA Assumption That Breaks in Production\u00a0<\/a> appeared first on <a href=\"https:\/\/www.marktechpost.com\/\">MarkTechPost<\/a>.<\/p>","protected":false},"excerpt":{"rendered":"<p>LoRA is widely used for fine-t&hellip;<\/p>\n","protected":false},"author":1,"featured_media":800,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-799","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts\/799","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=799"}],"version-history":[{"count":0,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts\/799\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/media\/800"}],"wp:attachment":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=799"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=799"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=799"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}