Docs/provider / storage strategies

Feed Types & Storage Models

To maximize performance and minimize Token costs for AI Agents, PipeAgent uses a three-tier storage architecture. This ensures that Agents only process the data they actually need.

The Problem: Data Bloat & Token Burn

Traditional scraping returns raw HTML or massive unoptimized JSON. When an Agent reads a 100KB HTML page just to find one price, you are "burning" thousands of tokens on layout code rather than reasoning.

The Solution: 3 Specialized Models

PipeAgent providers choose a model during data ingestion (Push API).

1. Singleton (Atomic Mode)

Best for: Single JSON objects, configuration files, or real-time status updates (e.g., "Current Weather").

How it works: The entire payload is stored as a single blob. Each push atomically replaces the previous state.

Agent Payload: Small (1:1 with source).

2. Collection (Batch Mode)

Best for: Lists that update as a group (e.g., "Top 50 Cryptos", "Current Stock Prices").

How it works: Providers push a batch of records. Consumers get a stable, paginated view of the latest version.

Agent Payload: Partial (Only requested items/fields).

3. Stream (Append Mode)

Best for: Time-series data or growing lists (e.g., "Trade History", "News Feed").

How it works: Every push appends new records to the feed. Consumers navigate using time-based cursors.

Agent Payload: Optimized (Incremental updates).

---

Query Guardrails & Performance

To ensure millisecond response times and multi-tenant isolation, PipeAgent enforces strict performance guardrails on all data queries.

1. SQL-Like Filtering (Strict `eq`)

You can filter data using standard query parameters, but only the Equality (eq) operator is supported for JSONB top-level fields inside the payload.

Allowed: GET https://api.pipeagent.dev/api/v1/feed/{id}?status=eq.bullish

Disallowed: Range (lt, gt) or pattern (ilike) filters on data fields are disabled to prevent slow database scans.

2. JSONPath Projection

Use JSONPath to extract only the specific fields your Agent needs.

Supported: Simple path extractions (e.g., $.metadata.price, $.items[0].price).

Disallowed: Filter expressions (e.g., $[?(@.price < 100)]) are disabled at the API level to prevent high CPU overhead. Only projection (path extraction) is allowed.

3. Enforced Pagination

To protect the context window of AI Agents, we strictly limit the amount of data returned in a single request.

Default Limit: 20 records.

Maximum Limit: 50 records. Any request for more than 50 records will be capped at 50.

Applies to: list reads, cursor reads, and /details (ids mode) responses.

---

Consumer Usage

Production base URL: https://pipeagent.dev. Paths below use /api/v1/feed/{feed_id}.

1. Basic Fetch

Returns the Singleton object or the latest page of a Collection/Stream.

bash

GET https://api.pipeagent.dev/api/v1/feed/{feed_id}

1.1 Stream Time Window (Native Range)

For stream feeds, you can query by event time window using inclusive bounds on event_timestamp.

bash

GET https://api.pipeagent.dev/api/v1/feed/{feed_id}?start_time=2026-03-24T00:00:00Z&end_time=2026-03-24T23:59:59Z

start_time: inclusive lower bound (event_timestamp >= start_time)

end_time: inclusive upper bound (event_timestamp <= end_time)

Both parameters must be valid ISO-8601 datetime strings.

You can combine them with cursor and limit.

2. Advanced JSONPath Projection

Slice and project JSON on the server to protect your Agent's context window.

bash

# Array Slicing (Get top 5)
GET https://api.pipeagent.dev/api/v1/feed/{feed_id}?jsonpath=$[0:5]

# Deep Field Extraction
GET https://api.pipeagent.dev/api/v1/feed/{feed_id}?jsonpath=$[*].metadata.tags

3. Details Mode

Retrieve full payloads for specific record IDs identified during browsing.

bash

GET https://api.pipeagent.dev/api/v1/feed/{feed_id}?ids=id1,id2

Performance Tips

Use Collection for any list over 20 items.

Use Stream for data that grows over time.

Always use JSONPath to shield your LLM contexts from irrelevant fields.

Version 1.0.4 - Premium Infrastructure

Legal Disclaimer

Feed Types & Storage Models

The Problem: Data Bloat & Token Burn

The Solution: 3 Specialized Models

1. Singleton (Atomic Mode)

2. Collection (Batch Mode)

3. Stream (Append Mode)

Query Guardrails & Performance

1. SQL-Like Filtering (Strict eq)

2. JSONPath Projection

3. Enforced Pagination

Consumer Usage

1. Basic Fetch

1.1 Stream Time Window (Native Range)

2. Advanced JSONPath Projection

3. Details Mode

Performance Tips

1. SQL-Like Filtering (Strict `eq`)