Hearth

Overview

Hearth is a personal home-management platform built on one observation: every house carries thousands of small facts, and almost none of them are written down anywhere the homeowner can find when it counts. What year was the furnace installed? When was the roof last replaced? What is the radon zone, and where is the main water shutoff? Hearth gathers those answers in one place, takes in new information through photo and document uploads, and works out what the home needs and when.

Behind the dashboard, Hearth runs a multi-pipeline AI architecture on a tightly RLS-scoped Supabase backend. One vision pipeline reads appliance nameplates, service receipts, and identifying documents and pulls out structured data. A separate reasoning-model pipeline decodes manufacture dates from serial numbers when no install date is on file. A streaming “Research this model” call produces grounded summaries of an appliance's service life, maintenance needs, and known issues. Finally, a maintenance-synthesis workflow combines those summaries with the home's environmental context to emit a real, scheduled plan in which every task carries its own reasoning.

The habitat surface is where Hearth turns public records into something a homeowner can act on. Working from the home's coordinates, it queries EPA radon zones, FEMA flood maps, EPA Superfund proximity, and EPA drinking-water compliance and lead/copper sample data, then writes the results up in plain language. Each finding shows its severity, the actions it recommends, and an activity log of exactly how it was computed. Hearth even gives its methodology its own page in the app, on the principle that a homeowner should be able to check the work.

Hearth is in active development and currently deployed for real-world use. A public beta and a premium tier are on the near-term roadmap.

Dashboard

The home view: an AI-generated architectural sketch of the property, the home's facts pulled from public records, an emergency video reference panel, and a live habitat findings preview. Everything on this page is rendered from real data, updated in real time via Supabase Realtime as background workflows complete.

Onboarding discovery

The first-run experience narrates what Hearth is doing as it does it: looking up public records, then checking radon, Superfund proximity, flood zones, and water quality. Showing that work as it happens builds more trust than a spinner ever could.

Smart Uploader

A photo of an appliance nameplate becomes a structured inventory record. A Claude Haiku vision pipeline reads the manufacturer, model, and serial off the label, pulls out the key facts, and routes the result through a review step before saving. The same uploader also handles multi-page service receipts.

Inventory detail

Each appliance gets its own page with extracted facts, an on-demand “Research this model” panel that streams a grounded summary of service life and maintenance needs, and a manufactured-date tile auto-filled by a reasoning model decoding the serial number.

Habitat: Superfund proximity

Hearth queries EPA Envirofacts SEMS data by coordinates, joins it to a tiered proximity model, attaches EPA's Community Involvement Coordinator contacts, and summarizes the result in a single paragraph calibrated to neither overstate nor understate the risk. Each contaminant in the list carries a plain-language description drawn from a curated reference table.

Habitat: Water quality awareness

EPA's CWS service-area layer resolves the home's coordinates to a Public Water System ID. From there, SDWIS violation history and lead/copper sample data are pulled, parsed, and severity-classified. The page then shows the compliance state, the contaminants detected against their federal action levels, and the actions it recommends for this specific water source.

Habitat: Remediation matrix

The water module's real output is this remediation matrix, personalized from the home's actual 2024 Consumer Confidence Report. Every contaminant detected in the supply (lead, PFAS, trihalomethanes, haloacetic acids, VOCs, arsenic, and nitrate) is mapped against six filter technologies, with cell shading showing how well each one removes it and the detected rows highlighted in amber. Hearth reads that grid into a concrete recommendation: an NSF/ANSI 53 + P473 carbon block covers six of the nine detected contaminants, and an NSF/ANSI 58 reverse-osmosis stage handles the fluoride, arsenic, and nitrate a carbon block can't touch. Each tier comes with real install and annual-cost figures, and the certifications are named so every suggestion points to a product the homeowner can actually buy rather than a generic “consider a filter.”

Maintenance panel

A real maintenance plan, assembled by a reasoning-model workflow from the appliance's research summary, the home's habitat findings, and any attached service receipts. Tasks are tiered by urgency and scheduled with cadences (interval, seasonal, one-time, per-use), and every one of them carries its own reasoning.

Task detail with cross-module reasoning

Every task sits one tap away from a “why this task” explanation. Here, a dishwasher-cleaning cadence was tightened from the usual one-to-two months down to monthly because the water-quality module found hard water in the user's county, a finding that flowed into the maintenance pipeline as a habitat modifier. The card spells out the adjustment, where it came from, and the locality in plain language, so the schedule never feels arbitrary.

Selected Design Decisions

The user's water source declaration trumps EPA's map. EPA's national CWS service-area layer has documented coverage gaps, and established city addresses can sit in the holes between polygons. So when a user says during onboarding that they're on city water, Hearth runs a nearest-polygon fallback against a 500-meter buffer before it ever falls through to “we couldn't pinpoint your utility.” Trusting what the user told us over a map with known holes gives a more honest answer than quietly treating a polygon miss as a private well.

Serial-number decoding runs on a separate reasoning model. The streaming Research-this-model pipeline uses a fast non-reasoning model to keep latency down. Decoding a manufacture date from a serial number is a different kind of problem, because it demands determinism. In testing, non-reasoning models would invent a plausible-looking decoding rule on each call and then confidently apply it to that same serial. Moving the decode into a parallel call on a reasoning model, under a strict protocol of name the rule, apply it, verify internal consistency, and return null if any step fails, eliminated the hallucinated dates while leaving the rest of the pipeline just as fast.

Soft-fail at every level. Every external data source can fail, whether it's Mapbox, EPA Envirofacts, FEMA NFHL, EPA's drinking-water APIs, or the AI Gateway. Hearth is built so that when one does, the user-visible result degrades to “we don't know yet” instead of a confidently wrong answer, and unrelated features keep working. The activity log on each habitat finding records which sources answered and which didn't, so the user can see exactly what was checked.

JSONB columns earn promotion to real columns only when a query pattern demands it. The inventory and document tables both carry a `metadata` JSONB column for subtype-specific fields that don't need cross-row queries yet, things like vehicle VINs and plate states, pet microchip numbers, or a receipt's vendor and line items. When a query pattern finally needs one of those fields as a real column, say, aggregating values for an insurance valuation, that shape migrates out of JSONB into a column of its own. Until then, JSONB keeps the schema small and the path to a new feature short.

Workflow errors are sorted into terminal vs. retryable. Hearth's durable jobs run on Vercel Workflows, where a naive policy retries every failure as if it were transient. Hearth instead marks its orchestrator-internal errors as non-retryable: constraint violations, missing entities, a failed write. Those surface a deterministic bug in about two seconds rather than grinding through roughly thirty seconds of pointless retries, while genuinely transient errors like a flaky EPA call or a gateway hiccup still retry. That one distinction turns the workflow layer into a fast feedback loop during development instead of a latency tax.

Data shared across homes is ingested once and cached for everyone. Some facts aren't per-house. A water utility publishes one annual quality report, and it's identical for every home on that system. Rather than ingest and summarize it once per user, Hearth keys the report by utility and year and serves a single cached row to every house the utility covers. The same pattern fits any data shared across entities, and it saves redundant ingestion, duplicate LLM cost, and needless load on slow government APIs.

Findings go stale-but-labeled instead of eagerly regenerating. When upstream data changes, Hearth ingests it, works out which homes are affected, and writes a notification, but it stops short of regenerating any findings. The existing finding stays put, clearly labeled as stale, until the user decides to reanalyze. There's a cost asymmetry at play: regenerating findings nobody is looking at is wasted money, and the notification on its own already tells the user the system is watching, without any silent background churn.

Hearth

Tech Stack

Overview

Screenshot gallery

Dashboard

Onboarding discovery

Smart Uploader

Inventory detail

Habitat: Superfund proximity

Habitat: Water quality awareness

Habitat: Remediation matrix

Maintenance panel

Task detail with cross-module reasoning

Selected Design Decisions

AI Integrations

Vision extraction pipeline

Research this model

Serial-number decode

Maintenance synthesis workflow

Habitat data integration

Activity logs and methodology transparency

Have a project in mind?