Architecture

The iPulse architecture starts with classified data before AI

iPulse forecasting is built on a subject taxonomy, sector and record classification, historic data management, and traceable AI configuration. The model response is the last visible layer, not the foundation.

Updated July 3, 2026

Visual Map

From subject taxonomy to auditable forecast

This is the high-level flow: iPulse first classifies the subject and record universe, then builds a versioned historical data layer, then assembles AI configurations, and only then produces forecasts that can be inspected later through lineage and Config Details.

Enterprise architecture

How classification, data, AI controls, runtime, and product surfaces connect

01 Subject Taxonomy

Subject

Asset or broader object of analysis.

Sector

Financial Markets, Real Estate, Macroeconomy, Politics, Environment, Health, Sports.

Scope

Category, tier, region, industry, and subject ID.

02 ETL Layer

Data Sources

Partners, exchanges, vendor feeds, public records, and internal pipelines.

Processing

Raw records are prepared before they become platform data.

IngestionQATransformationPreprocessingCleaning

03 Data Platform

Core Information Buckets

Separated stores keep each information class governable, cacheable, and auditable.

Historic DataAnalytics DataFeature DataPrediction DataSimulation Data

Control Records

The control plane explains how information is defined, monitored, related, and changed.

Schema RegistriesReference DataMonitoring & Logs

04 AI Control + Runtime

AI Agent Profile

Persona, archetype, specialization heads, model/backend, and style.

Task Config

Mode, horizon, input format, output schema, and assembly rules.

Prompt Assembly

System layer plus context sandwich.

Generation

Model output, parsing, and validation.

05 Product + Trust

Prediction Batch

Timestamped swarm forecast package.

Asset Page

Reports, charts, ratings, scenarios, and deep analysis.

Config Details

Gated configuration and lineage inspection.

Time Travel

Previous batches remain inspectable.

Overview

A data-first architecture, not a chatbot wrapper

The foundation of iPulse is correct classification of the thing being analyzed, the records attached to it, the ETL process that prepares those records, and the platform data bucket that stores them. AI only becomes useful after the platform knows what the subject is, which data bucket it belongs to, which records are available, and which configurations are allowed to act on it.

Data foundation to forecast output

The iPulse prediction assembly line

AI appears late in the process. The first job is to classify the subject, data, records, and lineage correctly.

01

Subject ontology

Asset or broader subject, sector, category, tier, country, region, industry, and eligibility scope.

02

ETL layer

Ingestion, quality assurance, transformation, preprocessing, cleaning, normalization, and validation.

03

Platform buckets

Historic, analytics, feature, prediction, and simulation buckets, with governance and observability tracked as separate control records.

04

AI configuration

AI Agent profile with model assignment, task config with mode, prompt assembly, input format, output schema, and horizon.

05

Forecast package

Ratings, thesis fields, scenarios, events, risk notes, forecast dates, and structured timeseries outputs.

06

Transparency layer

Config Details, prediction time travel, lineage inspection, performance review, advisor comparison, and future configuration retirement.

Finance zoom

OHLCVA market recordsFundamentalsCorporate and macro eventsNewsfeed and sentimentKnowledge and alternative dataGlobal contextChangelogs

Traceable output

Every forecast should be tied back to the subject, data version, AI Agent profile, model/backend assignment, task config, mode, prompt components, output schema, generation batch, and horizon.

This matters because a public equity, a currency pair, a macro indicator, a weather event, and a sports team are not the same forecasting object. They can all be subjects in a broader intelligence system, but each one needs different identifiers, data records, eligibility rules, and output formats.

Layer 1

Subject classification

The platform identifies what is being forecast: an asset today, or a broader subject such as a country, region, event, organization, or future non-financial object.

Layer 2

ETL and bucket classification

ETL handles ingestion, QA, transformation, preprocessing, and cleaning. The data platform then separates core information buckets such as historic, analytics, feature, prediction, and simulation data, while control records cover schemas, registers, governance, and observability.

Layer 3

AI configuration

Only after subject and data layers are stable does iPulse select AI Agent profiles, model assignments, task configs, modes, prompt assembly components, and output schemas.

Public asset pages mostly expose financial assets today, but the architecture is intentionally broader than listed stocks. The current public product is a focused first use case of a more general forecasting system.
iPulse is heavily schema-registry driven. Across the Data Platform and Application layers, the system currently maintains 60+ active schemas and tables so records, task configs, output formats, lineage fields, and product surfaces can be validated against explicit contracts instead of informal assumptions.

Asset And Subject

An asset is the financial version of a broader subject

In user-facing finance pages, iPulse usually says asset because the product is centered on equities, crypto, commodities, indices, funds, foreign exchange, and related market instruments. Internally, the broader word is subject. A subject is the general object of analysis; an asset is one important financial kind of subject.

That distinction lets iPulse expand without rebuilding the core ontology. A company stock can be a subject, but so can a country, city, industry, weather system, political event, economic indicator, sports team, or other forecastable entity once the corresponding records and permissions are supported.

Ontology map

How iPulse decides what a forecast is about

Subject

Asset today, broader subject tomorrow: company, country, city, event, weather system, team, or other forecastable object.

Classification

Pulse sector, subject category, detailed category, tier, industry, region, country, and record type.

Eligibility

Scoping fields decide which data, agents, prompt components, models, and output schemas can apply.

Pulse sector

The top-level domain. The public taxonomy is designed around Financial Markets, Real Estate, Macroeconomy, Politics, Environment, Health, and Sports.

Subject category

The type of subject inside the domain. In finance, this includes equity, fixed income, commodity, crypto, cash, forex, fund, derivative, real asset, real estate, and index.

Subject tier

A prioritization layer that separates showcase, major, standard, extended, wide-range, far-reach, and long-tail subjects for coverage depth and pipeline scheduling.

Scoping fields

Eligibility filters such as sector, record category, subject category, industry, region, country, market cap, valuation, growth, and subject ID determine which components can apply.

Platform Scope

Finance is the priority, but the ontology is wider

The active public focus is finance and macroeconomy because those domains are commercially useful and already have dense historical records. The underlying sector taxonomy is broader: Financial Markets, Real Estate, Macroeconomy, Politics, Environment, Health, and Sports are all part of the long-range subject map.

Macroeconomy is the system term for economic and state-of-the-world context. It can include indicators such as inflation, employment, income, demographics, social conditions, geopolitical events, tariffs, unrest, or policy shocks. Politics can also be treated as its own subject sector when the forecast target is a policy, election, government relationship, or political risk question rather than general macro context.

Environment covers location and phenomena such as weather, climate, pollution, and physical conditions. Future extensions can add deeper support for real estate, health, sports, and other domains where the same discipline applies: define the subject, classify records, preserve lineage, then forecast.

Finance And Economics

Financial subjects are split into distinct data buckets

Inside the current financial priority area, iPulse does not treat all data as one blob. Financial forecasts depend on multiple categories that behave differently, update on different schedules, and carry different reliability profiles.

A helpful way to read this is: the subject is what we forecast, while the record type is the kind of evidence we attach to that subject. Market records tell us how price behaved. Fundamentals tell us how the business or instrument is performing. News, sentiment, and events explain what changed in the world around it. Governance and observability records explain whether the data can be trusted and later audited.

Finance data architecture

Financial subjects are surrounded by evidence record families

Forecast Subject

Subject / Asset

The company, instrument, index, currency pair, commodity, fund, real asset, or other financial object being analyzed.

Historic Bucket

Accepted and versioned records available to AI tasks for that subject at a specific generation time.

Plain-English Rule

The asset is the thing. The record type is the evidence.

Apple stock, Bitcoin, crude oil, and a currency pair are subjects. Prices, fundamentals, news, sentiment, events, reference tables, schemas, changelogs, and logs are record families attached to those subjects. iPulse classifies both before an AI Agent receives context.

Fact + time-series records

Structured evidence about what happened, when, and at what measured value.

MarketFundamentalIndicatorEventTransactionAlternative

Text + signal records

Unstructured or semi-structured material, plus signals derived from public language and specialist knowledge.

NewsfeedSentimentKnowledgeScientificCode

Governance + support records

The control data that keeps assets, datasets, models, schemas, and relationships understandable.

ReferenceSpecificationsSchemasCatalogMetadataXREF

Operations + audit records

The monitoring and accountability trail that explains changes, processing health, and platform activity.

ChangelogsLogsBillingMultipleOther

Primary category

Where the data sits in the platform lifecycle.

HistoricLiveFeaturePredictionSimulationGovernanceObservability

Structure + modality

How the record is shaped for storage and model use.

StructuredSemi-structuredUnstructuredTextTabularJSONGraphGeospatial

Lineage

Where a dataset came from and whether it is raw, copied, derived, or generated.

Primary supplierSource of truthPackaged copyAnalytics derivativeFeature derivativePrediction generated

Scope

Which slice of the dataset is valid for a task or review.

Full datasetLatest recordIncrementalFilteredTrainingValidationHoldout
This taxonomy mirrors the shared-base architecture: SectorRecordsCategory classifies the evidence family; DataPrimaryCategory separates core lifecycle categories and control-plane categories; DataModality and structure fields explain the data shape; DatasetLineage and DatasetScope preserve provenance and task validity.

Record Taxonomy

Record categories turn raw market information into teachable evidence

For a non-technical user, record categories are like labeled folders. Price history goes in one folder, company financials in another, news and social signals in another, and audit records in another. That makes it easier to explain what an AI Agent was allowed to see and what kind of evidence shaped the answer.

For finance and AI practitioners, the same taxonomy is an engineering control. A market time series, a scraped article, a provider sentiment score, an earnings transcript, a schema definition, and a changelog cannot be validated, cached, sampled, summarized, or injected into prompts the same way. They need different freshness checks, transformation logic, lineage labels, and output contracts.

Market and transaction records

OHLCVA, quotes, trades, volume, adjusted prices, returns, market microstructure fields, and raw transaction-style logs. These answer: what did the market actually do?

Fundamental and indicator records

Financial statements, earnings, margins, cash flow, balance sheet items, shares outstanding, macro indicators, rates, inflation, employment, GDP, and state-of-the-world measures.

Events, newsfeed, and sentiment

Corporate actions, dividends, splits, earnings releases, tariffs, geopolitical shocks, news, social posts, analysis, podcasts, and derived sentiment signals.

Knowledge, scientific, and alternative data

Knowledge graphs, taxonomies, equations, technical research, satellite or shipping data, and other specialist evidence that may become useful for broader forecasting domains.

Reference, schema, and catalog records

Static identifiers, calendars, specifications, schema definitions, catalog entries, metadata, and cross-reference tables. These explain what a record means and how it connects to other records.

Changelogs, logs, and observability

Monitoring data, processing logs, data-change records, and later audit evidence. These make the platform inspectable when a user asks which data version an AI forecast used.

This is one of the fastest E-E-A-T improvements for a YMYL product: turn internal taxonomy into public education. The more clearly iPulse teaches users the difference between asset type, evidence type, data lineage, model configuration, and forecast output, the less the product looks like a black-box prediction page and the more it reads like a transparent market-intelligence curriculum.

Historic Layer

Daily ingestion builds a versioned historic record

iPulse pulls data from data partners, exchanges, and internal processing pipelines on scheduled cycles. The docs intentionally do not name individual suppliers, because provider mix can change and some relationships are operational rather than public brand claims.

Incoming data is screened for freshness, completeness, schema fit, unusual gaps, duplicate records, and basic health signals inside the ETL layer. Ingestion, QA, transformation, preprocessing, cleaning, normalization, and validation happen before records are promoted into durable data-platform buckets.

The data architecture separates core information buckets such as Historic Data, Analytics Data, Feature Data, Prediction Data, and Simulation Data. Control records then describe the operating layer around those buckets: schema registries, registers, reference data, monitoring, logs, changelogs, and later audit evidence. It also tracks lineage, such as primary supplier, secondary supplier, source of truth, exact copy, packaged copy, analytics derivative, feature derivative, and prediction-generated datasets.

Data changes are not treated casually. Changelog-style records allow future inspection of what changed, when it changed, and which data version was available to a forecast batch. That is essential for later performance review and for explaining why a past AI forecast saw the world the way it did.

Operating Lessons

The data architecture changed many times as iPulse grew

Over the last three years, iPulse has redesigned major parts of its architecture from the ground up multiple times. As the product grew from simpler market-data and forecast surfaces into a broader AI intelligence platform, the architecture had to keep changing to support the ambition: more asset categories, more data families, deeper AI Agent configuration, better lineage, stronger serving paths, and clearer user-facing transparency.

Those redesigns were not cosmetic. They came from live data and product complexity. Market data arrives with corporate actions, revisions, stale adjusted values, provider gaps, duplicate records, excessive precision, and occasional impossible values. The platform therefore treats data preparation and auditability as part of the methodology, not as background plumbing.

Raw records and adjustments are separated

Historic prices are easier to audit when raw market values and corporate-action adjustment logic remain distinct. That lets adjusted views be recalculated when later splits or dividends change the interpretation of old prices.

QA uses context-aware thresholds

A fixed percentage spike rule is too crude across equities, crypto, forex, and commodities. iPulse screens records with volatility-aware and schema-aware checks so unusual values are investigated in context.

Prediction records inherit the same lesson

Forecast outputs are designed around step-over-step percentage changes and anchor metadata, not only absolute future prices. This makes forecasts more resilient when the subject changes through splits, dividends, or other structural events.

Changelogs are part of the record

When governed records are fixed, backfilled, or repackaged, the goal is to preserve the reason and timing of the change so future forecast reviews can understand which information state was used.

Prediction Layer

AI starts after the subject and history are prepared

Once the subject and historic context are ready, the prediction layer assembles the AI side: an AI Agent profile, a task configuration, prompt assembly, input data, output schema, horizon, and validation rules. The model belongs to the AI Agent profile, while the execution mode belongs to the task configuration.

AI Agents are the public label for what product surfaces may also call advisors. Their personas and archetypes are deliberately selected for analytical diversity, then paired with model/backend choices, specialization heads, vocabulary, and asset-class fit. This is how iPulse separates value ownership, macro cycles, political power mapping, forensic skepticism, technical and quantitative thinking, technology futurism, scientific reasoning, and other specialized lenses.

In implementation terms, the AI Agent is not only the famous-name persona. It is the governed combination of persona, archetype, model/backend assignment, mode, asset-class reasoning head, subject-fit constraints, task rules, and output schema. Those combinations already create 100+ AI Agents, and the inventory can expand as new frontier LLM models and additional specialized heads are released, tested, and approved.

The architecture also separates psychological and communication dimensions. Cognitive style, mood, tone, lexical register, thinking horizon, asset-class specialization, subject-fit constraints, and model selection can all influence how an AI Agent reasons and explains a forecast.

Prediction runtime

AI controls converge into one forecast package

Configured Inputs

01 AI Agent

Persona, archetype, specialization, and model

The agent is the analytical identity: character lens, cognitive profile, specialist heads, language style, and model/backend.

PersonaArchetypeSpecialization headsVocabularyModel/backend

02 Task Config

Mode and assignment rules

The task config decides how that agent is deployed for this run: mode, horizon, subject scope, prompt recipe, and output contract.

ModeHorizonSubject scopeInput formatOutput schema

03 Input Data

Prepared context

Versioned prices, fundamentals, events, global context, and task-specific data are attached after classification.

Market dataFundamentalsEventsGlobal contextLineage

04 Prompt Assembly

Layered context sandwich

Agent identity, task rules, subject context, global context, data excerpts, and output instructions are assembled in a sandwich structure.

System layerAgent layerTask layerData layerOutput rules

05 Generation

Configured execution

The configured agent/model runs the task, then the response is parsed, checked, normalized, and prepared for display.

Reasoning passParsingValidationNormalization

06 Forecast

Structured output package

The final package contains ratings, thesis fields, risk notes, scenarios, timeseries, and traceable configuration references.

RatingsThesisRisksScenariosTimeseries

AI Agent

The named analytical framework: persona, archetype, cognitive style, specialization heads, communication profile, subject fit, and model/backend assignment.

Task config

The reusable forecast assignment that contains mode, horizon, subject scope, input format, prompt assembly recipe, output instructions, and schema.

Mode

The execution capability inside the task config, such as Thinker for reasoning over supplied context or Researcher for targeted web-enabled investigation.

Model

The AI or statistical backend assigned within the AI Agent profile. It is visible as configuration lineage so model changes can be compared cleanly over time.

Lineage

Every forecast should be inspectable later

A forecast is only useful if iPulse can later explain the configuration behind it. Lineage links the subject, sector, record categories, data version, AI Agent profile, model/backend assignment, task config, mode, prompt assembly components, input format, output schema, generation batch, and forecast horizon.

This is why the Config Details modal exists inside the product. Eligible signed-in users can inspect the configuration used for a specific prediction without exposing proprietary full prompt text or private operational records.

The transparency goal is not to reveal every private implementation detail. It is to make the forecast auditable enough that users can understand what kind of system produced it, what information category it used, and how future versions can be compared against past versions.

Accountability

Prediction Time Travel is our accountability moat

iPulse is designed so past forecasts do not disappear into a black box after the next batch runs. On individual asset pages, eligible users can use the prediction batch selector to open prior forecast batches produced by the AI Agent swarm for that asset.

When a previous batch is selected, the experience is not reduced to a static archive line. The user can inspect the historical advisor reports, visual forecast outputs, deep analysis, ratings, thesis fields, scenarios, risk notes, and batch-level context as they existed for that run.

This creates a practical accountability layer: users can compare what iPulse said at a specific point in time against what happened later, while also inspecting the configuration lineage behind that forecast where their subscription and asset access allow it.

Audit architecture

Prediction Time Travel keeps old forecasts alive

01

Batch archive

Each swarm run is stored as a timestamped prediction batch.

02

Asset page selector

Users select current or past batches from the asset page.

03

Full historical view

Reports, visuals, deep analysis, ratings, scenarios, and risks remain inspectable.

04

Config Details

Eligible users inspect lineage and configuration for that exact run.

The goal is durable auditability. A new forecast does not erase the old one; it becomes another timestamped prediction batch that can be revisited, compared, and learned from.