Everything is Context: Agentic File System Abstraction for Context Engineering
- Source: https://arxiv.org/pdf/2512.05470
- Authors: Xiwei Xu, Xuewu Gu, Robert Mao, Yechao Li, Quan Bai, Liming Zhu
- Date: 2025-12-05
- Clipped: 2026-03-09 (SGT)
Gist
The paper argues that GenAI systems should treat context management as a first-class software architecture problem. It proposes an Agentic File System (AFS) abstraction (“everything is context/file”) to unify memory, tools, external APIs, and human inputs under one governed, auditable interface.
Key ideas
- Problem shift: from model tuning to context engineering (capture, structure, govern, and refresh context).
- Core abstraction: file-system-style namespace/mounting for heterogeneous context resources.
- SE principles applied: abstraction, modularity, encapsulation, separation of concerns, traceability, composability.
- Persistent context lifecycle:
- History (immutable source of truth),
- Memory (structured/indexed long-lived representations),
- Scratchpad (temporary task workspace).
- Design constraints identified: token-window limits, model statelessness, and output non-determinism.
- Pipeline design:
- Context Constructor (select/prioritize/compress),
- Context Updater (inject/refresh context during runtime),
- Context Evaluator (validate outputs, update memory, escalate to human review).
- Implementation: in open-source AIGNE framework with AFS modules (
list/read/write/search) and metadata-backed traceability. - Exemplars: memory-enabled conversational agent and MCP-based GitHub assistant mounted as filesystem resources.
Practical implications
- Encourages treating context operations like infra/data engineering (versioning, lineage, governance) rather than ad-hoc prompt stitching.
- Makes human-in-the-loop correction and audit trails native to the architecture.
- Useful for enterprise scenarios that require reproducibility, compliance, and multi-agent coordination.
Caveats
- Short paper focused on architecture/implementation blueprint; limited quantitative benchmarking.
- Real impact depends on quality of retrieval/indexing policies and operational governance in production.