{"id":930,"date":"2026-05-19T05:26:04","date_gmt":"2026-05-18T21:26:04","guid":{"rendered":"https:\/\/connectword.dpdns.org\/?p=930"},"modified":"2026-05-19T05:26:04","modified_gmt":"2026-05-18T21:26:04","slug":"meet-memprivacy-an-edge-cloud-framework-that-uses-local-reversible-pseudonymization-to-protect-user-data-without-breaking-memory-utility","status":"publish","type":"post","link":"https:\/\/connectword.dpdns.org\/?p=930","title":{"rendered":"Meet MemPrivacy: An Edge-Cloud Framework that Uses Local Reversible Pseudonymization to Protect User Data Without Breaking Memory Utility"},"content":{"rendered":"<p>As LLM-powered agents move from research to production, one design tension is becoming harder to ignore: the more useful cloud-hosted memory becomes, the more private user data it exposes. Researchers from MemTensor (Shanghai), HONOR Device and Tongji University have introduced <strong>MemPrivacy<\/strong>, a framework that attempts to resolve this tension without sacrificing the utility that makes personalized memory worthwhile in the first place.<\/p>\n<h2 class=\"wp-block-heading\"><strong>The Core Problem With Cloud Memory<\/strong><\/h2>\n<p>When you interact with an AI agent, your conversation often contains sensitive details like health conditions, email addresses, financial figures, passwords, and more. In a typical edge-cloud deployment, the user\u2019s device (the edge) handles input, while computation-heavy memory management and reasoning happen in the cloud. This architecture is efficient, but it means raw, unfiltered user data travels to and persists in cloud systems.<\/p>\n<p>The risk is not theoretical. Prior studies show that multi-turn memory attacks can induce privacy violations with success rates up to 69%, and leakage attacks against memory systems can reach 75% success. Indirect prompt injection can even manipulate agents into actively eliciting private information from users. Once sensitive content enters cloud logs, vector databases, or external memory stores, it can remain accessible through subsequent storage, retrieval, and reuse stages well beyond the original interaction.<\/p>\n<p>Prior works have tried to address this with masking \u2014 replacing sensitive values with tokens like <code>***<\/code>. The problem is that masking destroys semantics. If a user asks an agent to draft a doctor\u2019s email and their blood pressure reading and email address are both replaced with <code>***<\/code>, the cloud model cannot complete the task meaningfully. More principled techniques such as differential privacy and cryptographic protection offer stronger guarantees but are difficult to integrate into interactive memory pipelines without degrading response quality.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1382\" height=\"730\" data-attachment-id=\"79955\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/05\/18\/meet-memprivacy-an-edge-cloud-framework-that-uses-local-reversible-pseudonymization-to-protect-user-data-without-breaking-memory-utility\/screenshot-2026-05-18-at-2-14-57-pm-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-18-at-2.14.57-PM-1.png\" data-orig-size=\"1382,730\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"Screenshot 2026-05-18 at 2.14.57\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-18-at-2.14.57-PM-1-1024x541.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-18-at-2.14.57-PM-1.png\" alt=\"\" class=\"wp-image-79955\" \/><figcaption class=\"wp-element-caption\">https:\/\/arxiv.org\/pdf\/2605.09530v2<\/figcaption><\/figure>\n<\/div>\n<h2 class=\"wp-block-heading\"><strong>What MemPrivacy Does Differently<\/strong><\/h2>\n<p>Rather than masking private content, MemPrivacy replaces it with <em>typed placeholders<\/em> \u2014 structured tokens like <code>&lt;Health_Info_1&gt;<\/code> or <code>&lt;Email_1&gt;<\/code> \u2014 before the input leaves the local device. The cloud model receives semantically intact text and can reason and store memories normally; it just never sees the actual values. When the cloud returns a response containing placeholders, the local device looks up the originals from a secure local database and substitutes them back in. The user sees a fully coherent, personalized response.<\/p>\n<p>This design is called<strong> <em>local reversible pseudonymization<\/em><\/strong>, and the full pipeline operates in three stages. <strong>Stage 1 (Uplink Desensitization):<\/strong> A lightweight on-device model identifies privacy-sensitive spans in the input, classifies each by type and sensitivity level, and replaces them with typed placeholders. The original-to-placeholder mappings are stored locally and persist across sessions so the same value always gets the same placeholder. <strong>Stage 2 (Cloud Processing):<\/strong> The sanitized input is sent to the cloud agent or memory system. The typed placeholders preserve enough semantic structure for memory formation and retrieval to function correctly. <strong>Stage 3 (Downlink Restoration):<\/strong> The cloud response, which may contain placeholders, is restored locally via lightweight database lookup and string substitution, adding negligible latency.<\/p>\n<h2 class=\"wp-block-heading\"><strong>A Four-Level Privacy Taxonomy<\/strong><\/h2>\n<p><strong>A key contribution by the research team is a four-level privacy taxonomy (PL1\u2013PL4) that defines what gets protected and at what threshold:<\/strong><\/p>\n<ul class=\"wp-block-list\">\n<li><strong>PL1<\/strong> covers general preferences, habits, and stylistic choices that do not identify a person and carry low risk. These are not protected by default.<\/li>\n<li><strong>PL2<\/strong> includes identifiable PII \u2014 real names, phone numbers, email addresses, detailed addresses, account usernames, and combinations that could identify or trace a specific individual.<\/li>\n<li><strong>PL3<\/strong> covers highly sensitive PII: government document numbers, financial account details, health records, precise location and trajectory data, biometrics, raw communication content, and sensitive identity attributes such as religious beliefs or ethnicity.<\/li>\n<li><strong>PL4<\/strong> is the highest tier \u2014 credentials and secrets that are immediately exploitable: passwords, PINs, verification codes, session tokens, API keys, private keys, seed phrases, and undisclosed business materials. Exposure at this level can directly result in account takeover, financial loss, or large-scale data exfiltration.<\/li>\n<\/ul>\n<p>Users can configure the masking threshold for example, protecting only PL3 and PL4, or applying full protection across PL2\u2013PL4 \u2014 giving granular control over the privacy\u2013utility trade-off.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"1340\" height=\"642\" data-attachment-id=\"79957\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/05\/18\/meet-memprivacy-an-edge-cloud-framework-that-uses-local-reversible-pseudonymization-to-protect-user-data-without-breaking-memory-utility\/screenshot-2026-05-18-at-2-15-34-pm-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-18-at-2.15.34-PM-1.png\" data-orig-size=\"1340,642\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"Screenshot 2026-05-18 at 2.15.34\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-18-at-2.15.34-PM-1-1024x491.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/05\/Screenshot-2026-05-18-at-2.15.34-PM-1.png\" alt=\"\" class=\"wp-image-79957\" \/><figcaption class=\"wp-element-caption\">https:\/\/arxiv.org\/pdf\/2605.09530v2<\/figcaption><\/figure>\n<\/div>\n<h2 class=\"wp-block-heading\"><strong>MemPrivacy-Bench and Model Training<\/strong><\/h2>\n<p>To train and evaluate their approach, the research team constructed <strong>MemPrivacy-Bench<\/strong>, a dataset covering 200 synthetic user profiles and over 155,000 privacy instances (125,776 training, 29,967 test) across balanced Chinese and English dialogue, spanning 7 high-level scenario categories and 23 fine-grained subcategories. The test set contains 615 question-answer pairs across six memory task types: basic memory, temporal reasoning, adversarial questioning, dynamic updating, implicit inference, and information aggregation. Annotations were first generated by a dual-model pipeline using Gemini-3.1-Pro and GPT-5.2, then verified by six human annotators, achieving a final annotation accuracy of 98.08%.<\/p>\n<p>The MemPrivacy extraction models are fine-tuned from Qwen3 base models at 0.6B, 1.7B, and 4B parameter scales using supervised fine-tuning (SFT) followed by reinforcement learning with Group Relative Policy Optimization (GRPO). GRPO estimates advantages based on relative rewards across multiple sampled outputs per input, using F1 score as the reward signal, avoiding the computational overhead of a separately trained critic. Training used 160 users for the training split and 40 users for the test split.<\/p>\n<h2 class=\"wp-block-heading\"><strong>Experimental Results<\/strong><\/h2>\n<p>On MemPrivacy-Bench, the best-performing model \u2014 MemPrivacy-4B-RL \u2014 achieves an F1 score of 85.97%, compared to 78.41% for Gemini-3.1-Pro, the strongest general-purpose model tested. Even the smallest model, MemPrivacy-0.6B-SFT, reaches 83.09% F1, outperforming all general-purpose models evaluated. On the out-of-distribution PersonaMem-v2 benchmark, MemPrivacy-4B-RL achieves 94.48% F1, compared to 92.18% for DeepSeek-V3.2-Think, the best general model on that set.<\/p>\n<p>OpenAI\u2019s recently released Privacy-Filter, a bidirectional token-classification model for PII detection open-sourced. It achieves 35.50% F1 on MemPrivacy-Bench, a gap of over 50 percentage points behind the best MemPrivacy model, though it operates at significantly lower latency (0.34s versus roughly 2s for MemPrivacy models on MemPrivacy-Bench).<\/p>\n<p>On downstream memory utility, MemPrivacy was tested across three widely used memory systems: <strong>LangMem<\/strong>, <strong>Mem0<\/strong>, and <strong>Memobase<\/strong>. When protecting all PL2\u2013PL4 content, accuracy drops on MemPrivacy-Bench are contained to 0.73%\u20131.30% and 0.71%\u20131.60% on PersonaMem-v2, relative to no-protection baselines. By contrast, irreversible masking causes accuracy drops of 16.99%\u201341.87% on MemPrivacy-Bench, while untyped placeholder masking causes drops of 4.72%\u20136.67% on MemPrivacy-Bench and 2.67%\u20138.71% on PersonaMem-v2.<\/p>\n<h2 class=\"wp-block-heading\"><strong>Key Takeaways<\/strong><\/h2>\n<ul class=\"wp-block-list\">\n<li>MemPrivacy replaces sensitive user data with semantically typed placeholders (e.g., <code>&lt;Health_Info_1&gt;<\/code>) on-device before cloud transmission, so the cloud memory system never receives raw private values.<\/li>\n<li>The framework introduces a four-level privacy taxonomy (PL1\u2013PL4) ranging from general preferences to immediately exploitable credentials, with user-configurable masking thresholds.<\/li>\n<li>MemPrivacy-4B-RL achieves 85.97% F1 on MemPrivacy-Bench and 94.48% on PersonaMem-v2, outperforming GPT-5.2 (68.99%) and Gemini-3.1-Pro (78.41%) on privacy span extraction.<\/li>\n<li>Across LangMem, Mem0, and Memobase, applying MemPrivacy at the PL2\u2013PL4 level limits memory utility loss to within 1.6%, compared to accuracy drops of up to 41.87% with irreversible masking.<\/li>\n<li>Models range from 0.6B to 4B parameters, with per-message inference under two seconds, making the framework suitable for on-device deployment without noticeable latency.<\/li>\n<\/ul>\n<h2 class=\"wp-block-heading\"><strong>Marktechpost\u2019s Visual Explainer<\/strong><\/h2>\n<div>\n<div class=\"mp-header\">\n<div class=\"mp-logo\">\n      <span class=\"mp-logo-dot\"><\/span><br \/>\n      <span class=\"mp-logo-text\">MemPrivacy<\/span>\n    <\/div>\n<p>    <span class=\"mp-badge\">Developer Guide<\/span>\n  <\/p><\/div>\n<div class=\"mp-progress\"><\/div>\n<div class=\"mp-slider\">\n<div class=\"mp-track\">\n<p>      <!-- SLIDE 1: What is MemPrivacy --><\/p>\n<div class=\"mp-slide\">\n        <span class=\"mp-slide-num\">01 \/ 07 \u00a0\u2014\u00a0 Overview<\/span>\n<div class=\"mp-slide-title\">What is <span>MemPrivacy<\/span>?<\/div>\n<div class=\"mp-body\">\n<p>MemPrivacy is a privacy-preserving personalized memory management framework for <strong>edge-cloud LLM agents<\/strong>, developed by MemTensor, HONOR, and Tongji University.<\/p>\n<p>In a standard edge-cloud agent, your raw input \u2014 including sensitive data like health info, emails, and passwords \u2014 gets sent directly to the cloud for memory processing. MemPrivacy stops that.<\/p>\n<hr class=\"mp-divider\" \/>\n<div class=\"mp-flow\">\n<div class=\"mp-flow-box\">\n              <span class=\"mp-flow-box-label\">User Input<\/span><br \/>\n              <span class=\"mp-flow-box-val\">Raw text with private values<\/span>\n            <\/div>\n<div class=\"mp-flow-arrow\">\u2192<\/div>\n<div class=\"mp-flow-box\">\n              <span class=\"mp-flow-box-label\">On-Device<\/span><br \/>\n              <span class=\"mp-flow-box-val\">Detect &amp; replace with typed placeholders<\/span>\n            <\/div>\n<div class=\"mp-flow-arrow\">\u2192<\/div>\n<div class=\"mp-flow-box\">\n              <span class=\"mp-flow-box-label\">Cloud<\/span><br \/>\n              <span class=\"mp-flow-box-val\">Sees only placeholders, reasons normally<\/span>\n            <\/div>\n<div class=\"mp-flow-arrow\">\u2192<\/div>\n<div class=\"mp-flow-box\">\n              <span class=\"mp-flow-box-label\">Restore<\/span><br \/>\n              <span class=\"mp-flow-box-val\">Original values reinserted locally<\/span>\n            <\/div>\n<\/div>\n<div class=\"mp-callout\">\n            <span class=\"mp-callout-label\">Key Idea<\/span>\n<p>Privacy protection is decoupled from semantic destruction. The cloud gets enough structure to reason \u2014 but never the actual private values.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<p>      <!-- SLIDE 2: Why It Matters --><\/p>\n<div class=\"mp-slide\">\n        <span class=\"mp-slide-num\">02 \/ 07 \u00a0\u2014\u00a0 The Problem<\/span>\n<div class=\"mp-slide-title\">Why existing approaches <span>fall short<\/span><\/div>\n<div class=\"mp-body\">\n<p>Most cloud memory systems receive your raw input in plaintext. Once that data enters cloud logs or vector databases, it can persist indefinitely and be retrieved later.<\/p>\n<hr class=\"mp-divider\" \/>\n<ul class=\"mp-steps\">\n<li>\n              <span class=\"mp-step-num\">!<\/span><br \/>\n              <span class=\"mp-step-text\"><strong>Multi-turn memory attacks<\/strong> can extract user data with up to <strong>69% success rate<\/strong> according to published research.<\/span>\n            <\/li>\n<li>\n              <span class=\"mp-step-num\">!<\/span><br \/>\n              <span class=\"mp-step-text\"><strong>Memory leakage attacks<\/strong> against cloud memory systems reach up to <strong>75% success<\/strong> in documented studies.<\/span>\n            <\/li>\n<li>\n              <span class=\"mp-step-num\">!<\/span><br \/>\n              <span class=\"mp-step-text\"><strong>Full masking<\/strong> (replacing values with <code>***<\/code>) protects privacy but destroys the semantic cues the model needs to complete tasks.<\/span>\n            <\/li>\n<li>\n              <span class=\"mp-step-num\">!<\/span><br \/>\n              <span class=\"mp-step-text\"><strong>Differential privacy &amp; cryptography<\/strong> offer strong guarantees but are hard to integrate into interactive memory pipelines without major utility loss.<\/span>\n            <\/li>\n<\/ul>\n<div class=\"mp-callout\">\n            <span class=\"mp-callout-label\">MemPrivacy\u2019s answer<\/span>\n<p>Use semantically-typed placeholders \u2014 <strong>not blank masks<\/strong> \u2014 so the cloud can still reason about the type and role of information without seeing the actual value.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<p>      <!-- SLIDE 3: Privacy Taxonomy --><\/p>\n<div class=\"mp-slide\">\n        <span class=\"mp-slide-num\">03 \/ 07 \u00a0\u2014\u00a0 Privacy Levels<\/span>\n<div class=\"mp-slide-title\">The Four-Level <span>Privacy Taxonomy<\/span> (PL1\u2014PL4)<\/div>\n<div class=\"mp-body\">\n<p>MemPrivacy classifies every detected span into one of four levels. You can configure which levels get masked \u2014 e.g. mask only PL3+PL4, or all of PL2\u2014PL4.<\/p>\n<div class=\"mp-pills\">\n            <span class=\"mp-pill mp-pill-pl1\">PL1 \u00a0Low<\/span><br \/>\n            <span class=\"mp-pill mp-pill-pl2\">PL2 \u00a0Identifiable<\/span><br \/>\n            <span class=\"mp-pill mp-pill-pl3\">PL3 \u00a0Highly Sensitive<\/span><br \/>\n            <span class=\"mp-pill mp-pill-pl4\">PL4 \u00a0Critical<\/span>\n          <\/div>\n<table class=\"mp-table\">\n<tr>\n<th>Level<\/th>\n<th>What it covers<\/th>\n<th>Examples<\/th>\n<\/tr>\n<tr>\n<td><span class=\"mp-pill mp-pill-pl1\">PL1<\/span><\/td>\n<td>Preferences, habits, stylistic choices. Cannot identify a person.<\/td>\n<td>Food preferences, tone choices<\/td>\n<\/tr>\n<tr>\n<td><span class=\"mp-pill mp-pill-pl2\">PL2<\/span><\/td>\n<td>Information that can identify or trace a specific individual.<\/td>\n<td>Full name, email, phone, address, account ID<\/td>\n<\/tr>\n<tr>\n<td><span class=\"mp-pill mp-pill-pl3\">PL3<\/span><\/td>\n<td>Data whose leakage can cause significant harm to safety, health, or finances.<\/td>\n<td>Medical records, bank account, passport number, biometrics, precise location<\/td>\n<\/tr>\n<tr>\n<td><span class=\"mp-pill mp-pill-pl4\">PL4<\/span><\/td>\n<td>Immediately exploitable secrets \u2014 usable for account takeover or financial loss.<\/td>\n<td>Passwords, PINs, OTPs, API keys, private keys, session tokens<\/td>\n<\/tr>\n<\/table><\/div>\n<\/div>\n<p>      <!-- SLIDE 4: How Placeholders Work --><\/p>\n<div class=\"mp-slide\">\n        <span class=\"mp-slide-num\">04 \/ 07 \u00a0\u2014\u00a0 Typed Placeholders<\/span>\n<div class=\"mp-slide-title\">How <span>typed placeholders<\/span> preserve utility<\/div>\n<div class=\"mp-body\">\n<p>When a privacy span is detected, it is replaced with a structured token that carries the <strong>semantic type<\/strong> of the information \u2014 not just a blank mask.<\/p>\n<div class=\"mp-code\"><code>\/\/ Original user input:<br \/>\n\"My blood pressure today was 160\/110.<br \/>\n Reply to user@workmail.com.<br \/>\n Never mention my recovery code RC-7291.\"\n<p>\/\/ After MemPrivacy uplink desensitization:<br \/>\n\"My blood pressure today was &lt;Health_Info_1&gt;.<br \/>\n Reply to user@workmail.com.<br \/>\n Never mention my &lt;Recovery_Code_1&gt;.\"<\/p><\/code><\/div>\n<p>The cloud sees <code>&lt;Health_Info_1&gt;<\/code> and knows it\u2019s health data. It can draft the email correctly. It never sees <strong>160\/110<\/strong> or <strong>RC-7291<\/strong>.<\/p>\n<div class=\"mp-callout\">\n            <span class=\"mp-callout-label\">Session Persistence<\/span>\n<p>The original\u2014to\u2014placeholder mapping is stored in a <strong>local secure database<\/strong> and persists across sessions. The same value always gets the same placeholder, enabling consistent long-term memory.<\/p>\n<\/div>\n<p>Multiple spans of the same type are distinguished by incremental indices: <code>&lt;Email_1&gt;<\/code>, <code>&lt;Email_2&gt;<\/code>, etc.<\/p>\n<\/div>\n<\/div>\n<p>      <!-- SLIDE 5: Setup &amp; Installation --><\/p>\n<div class=\"mp-slide\">\n        <span class=\"mp-slide-num\">05 \/ 07 \u00a0\u2014\u00a0 Getting Started<\/span>\n<div class=\"mp-slide-title\">Installation &amp; <span>model setup<\/span><\/div>\n<div class=\"mp-body\">\n<p>MemPrivacy models are available at three scales for different edge hardware budgets: <strong>0.6B, 1.7B, and 4B<\/strong> parameters (all based on Qwen3). The 4B-RL model is the strongest.<\/p>\n<div class=\"mp-code\"><code># Clone the repository<br \/>\ngit clone https:\/\/github.com\/MemTensor\/MemPrivacy\n<p># Install dependencies<br \/>\ncd MemPrivacy<br \/>\npip install -r requirements.txt<\/p><\/code><\/div>\n<div class=\"mp-code\"><code># Load model from HuggingFace<br \/>\nfrom transformers import AutoModelForCausalLM, AutoTokenizer\n<p>model_id = \"IAAR-Shanghai\/MemPrivacy-4B-RL\"<br \/>\ntokenizer = AutoTokenizer.from_pretrained(model_id)<br \/>\nmodel = AutoModelForCausalLM.from_pretrained(<br \/>\n    model_id, torch_dtype=\"auto\", device_map=\"auto\"<br \/>\n)<\/p><\/code><\/div>\n<div class=\"mp-callout\">\n            <span class=\"mp-callout-label\">Model collection<\/span>\n<p>All six model variants (0.6B\/1.7B\/4B \u00d7 SFT\/RL) are available at:<br \/><strong>huggingface.co\/collections\/IAAR-Shanghai\/memprivacy<\/strong><\/p>\n<\/div>\n<\/div>\n<\/div>\n<p>      <!-- SLIDE 6: Integration Pattern --><\/p>\n<div class=\"mp-slide\">\n        <span class=\"mp-slide-num\">06 \/ 07 \u00a0\u2014\u00a0 Integration<\/span>\n<div class=\"mp-slide-title\">Integrating with <span>Mem0, LangMem, or Memobase<\/span><\/div>\n<div class=\"mp-body\">\n<p>MemPrivacy sits between your user-facing application and the cloud memory system. The three-stage pipeline maps directly onto your existing architecture.<\/p>\n<ul class=\"mp-steps\">\n<li>\n              <span class=\"mp-step-num\">1<\/span><br \/>\n              <span class=\"mp-step-text\"><strong>Uplink:<\/strong> Pass raw user input through the MemPrivacy model. It returns a list of detected spans with <code>(original_text, privacy_level, privacy_type)<\/code>. Replace each span at or above your configured threshold with a typed placeholder. Store mappings locally.<\/span>\n            <\/li>\n<li>\n              <span class=\"mp-step-num\">2<\/span><br \/>\n              <span class=\"mp-step-text\"><strong>Cloud call:<\/strong> Send the desensitized input to your existing memory system (Mem0, LangMem, Memobase) as normal. No changes to the cloud-side configuration are needed.<\/span>\n            <\/li>\n<li>\n              <span class=\"mp-step-num\">3<\/span><br \/>\n              <span class=\"mp-step-text\"><strong>Downlink:<\/strong> Scan the cloud response for placeholders. Query your local mapping database and substitute each placeholder with its original value before displaying to the user.<\/span>\n            <\/li>\n<\/ul>\n<div class=\"mp-callout\">\n            <span class=\"mp-callout-label\">Masking threshold config<\/span>\n<p>Set <code>lambda = \"PL4\"<\/code> to protect only credentials, <code>\"PL3\"<\/code> for PL3+PL4, or <code>\"PL2\"<\/code> for full protection. Utility loss at PL4-only is below 0.89% across all tested memory systems.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<p>      <!-- SLIDE 7: Results &amp; Resources --><\/p>\n<div class=\"mp-slide\">\n        <span class=\"mp-slide-num\">07 \/ 07 \u00a0\u2014\u00a0 Results &amp; Resources<\/span>\n<div class=\"mp-slide-title\">Benchmark results &amp; <span>where to go next<\/span><\/div>\n<div class=\"mp-body\">\n<table class=\"mp-table\">\n<tr>\n<th>Model<\/th>\n<th>F1 (MemPrivacy-Bench)<\/th>\n<th>F1 (PersonaMem-v2)<\/th>\n<th>Latency<\/th>\n<\/tr>\n<tr>\n<td>MemPrivacy-4B-RL<\/td>\n<td>85.97%<\/td>\n<td>94.48%<\/td>\n<td>~2s<\/td>\n<\/tr>\n<tr>\n<td>MemPrivacy-0.6B-RL<\/td>\n<td>84.66%<\/td>\n<td>93.40%<\/td>\n<td>~1.6s<\/td>\n<\/tr>\n<tr>\n<td>Gemini-3.1-Pro<\/td>\n<td>78.41%<\/td>\n<td>86.59%<\/td>\n<td>~33s<\/td>\n<\/tr>\n<tr>\n<td>OpenAI-Privacy-Filter<\/td>\n<td>35.50%<\/td>\n<td>85.27%<\/td>\n<td>0.34s<\/td>\n<\/tr>\n<\/table>\n<p>Utility loss when protecting PL2\u2014PL4 content across LangMem, Mem0, and Memobase is <strong>within 1.6%<\/strong> vs. no-protection baselines. Irreversible masking causes up to <strong>41.87% accuracy drop<\/strong> on the same systems.<\/p>\n<hr class=\"mp-divider\" \/>\n<ul class=\"mp-steps\">\n<li>\n              <span class=\"mp-step-num\">\u2197<\/span><br \/>\n              <span class=\"mp-step-text\"><strong>Code:<\/strong> github.com\/MemTensor\/MemPrivacy<\/span>\n            <\/li>\n<li>\n              <span class=\"mp-step-num\">\u2197<\/span><br \/>\n              <span class=\"mp-step-text\"><strong>Models:<\/strong> huggingface.co\/collections\/IAAR-Shanghai\/memprivacy<\/span>\n            <\/li>\n<li>\n              <span class=\"mp-step-num\">\u2197<\/span><br \/>\n              <span class=\"mp-step-text\"><strong>Paper:<\/strong> arxiv.org\/abs\/2605.09530<\/span>\n            <\/li>\n<\/ul><\/div>\n<\/div>\n<\/div>\n<p><!-- end track -->\n  <\/p><\/div>\n<p><!-- end slider --><\/p>\n<div class=\"mp-footer\">\n<div class=\"mp-dots\"><\/div>\n<p>    <span class=\"mp-slide-count\">1 \/ 7<\/span><\/p>\n<div class=\"mp-nav\">\n      <button class=\"mp-btn\" disabled>\u2190 Prev<\/button><br \/>\n      <button class=\"mp-btn\">Next \u2192<\/button>\n    <\/div>\n<\/div>\n<\/div>\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n<p>Check out\u00a0the\u00a0<strong><a href=\"https:\/\/arxiv.org\/pdf\/2605.09530v2\" target=\"_blank\" rel=\"noreferrer noopener\">Paper<\/a> <\/strong>and<strong> <a href=\"https:\/\/huggingface.co\/collections\/IAAR-Shanghai\/memprivacy\" target=\"_blank\" rel=\"noreferrer noopener\">Model Weights<\/a>.\u00a0<\/strong>Also,\u00a0feel free to follow us on\u00a0<strong><a href=\"https:\/\/x.com\/intent\/follow?screen_name=marktechpost\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Twitter<\/mark><\/a><\/strong>\u00a0and don\u2019t forget to join our\u00a0<strong><a href=\"https:\/\/www.reddit.com\/r\/machinelearningnews\/\" target=\"_blank\" rel=\"noreferrer noopener\">150k+ ML SubReddit<\/a><\/strong>\u00a0and Subscribe to\u00a0<strong><a href=\"https:\/\/www.aidevsignals.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">our Newsletter<\/a><\/strong>. Wait! are you on telegram?\u00a0<strong><a href=\"https:\/\/t.me\/machinelearningresearchnews\" target=\"_blank\" rel=\"noreferrer noopener\">now you can join us on telegram as well.<\/a><\/strong><\/p>\n<p>Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.?\u00a0<strong><a href=\"https:\/\/forms.gle\/MTNLpmJtsFA3VRVd9\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Connect with us<\/mark><\/a><\/strong><\/p>\n<p>The post <a href=\"https:\/\/www.marktechpost.com\/2026\/05\/18\/meet-memprivacy-an-edge-cloud-framework-that-uses-local-reversible-pseudonymization-to-protect-user-data-without-breaking-memory-utility\/\">Meet MemPrivacy: An Edge-Cloud Framework that Uses Local Reversible Pseudonymization to Protect User Data Without Breaking Memory Utility<\/a> appeared first on <a href=\"https:\/\/www.marktechpost.com\/\">MarkTechPost<\/a>.<\/p>","protected":false},"excerpt":{"rendered":"<p>As LLM-powered agents move fro&hellip;<\/p>\n","protected":false},"author":1,"featured_media":931,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-930","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts\/930","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=930"}],"version-history":[{"count":0,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/posts\/930\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=\/wp\/v2\/media\/931"}],"wp:attachment":[{"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=930"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=930"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/connectword.dpdns.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=930"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}