The SmallPond Platform
SmallPond provides a configurable DNA database architecture that automatically generates investigative matches in real time.
Instead of relying on static database structures, SmallPond separates how profiles are stored from how they are compared, allowing laboratories and agencies to design systems aligned with their operational, legal, and investigative requirements.
Core Concepts
SmallPond is built around a small number of core concepts:
DNA Profile
A digital representation of genetic markers used for comparison.
Profile Pond
A logical container that defines where profiles are stored.
Matching Net
A configurable rule defining which ponds are compared and how.
Match Result
The outcome of comparisons, producing investigative leads.
1. System Configuration
Before DNA profiles are ingested, SmallPond is configured by authorized administrators.
This configuration defines how profiles are organized and how matching occurs, ensuring alignment with operational workflows, legal requirements, and agency policies.
Configuration includes:
- Defining Ponds (data organization)
- Defining Nets (matching logic)
These definitions are typically established based on:
- Investigative workflows
- Laboratory procedures
- Applicable legislation
Profile Ponds
Ponds define where DNA profiles are stored.
- Known Offenders
- Arrestees
- Forensic Evidence
- Missing Persons
- Unidentified Remains
- Elimination Databases
Matching Nets
Nets define how profiles are compared.
- Which ponds are compared
- Matching stringency and thresholds
- Cross-pond and intra-pond comparisons
Matching Nets
Matching behavior in SmallPond is defined by configurable rules called Matching Nets.
A Net specifies:
- Which ponds should be compared
- The matching stringency applied
Examples include:
| Net | Ponds Compared | Stringency |
|---|---|---|
| Investigative Lead Net | Evidence ↔ Known Offenders | High |
| Evidence Correlation Net | Evidence ↔ Evidence | Medium |
| Missing Persons Net | Missing Persons ↔ Unidentified Remains | Medium |
Nets can connect two or more ponds, enabling agencies to define matching strategies aligned with investigative objectives and legal requirements.
Why This Matters
Traditional DNA database systems tightly couple data storage and matching logic, limiting flexibility and slowing adaptation to new workflows or legal requirements.
SmallPond’s separation of Ponds and Nets allows agencies to:
- Adapt matching strategies without restructuring data
- Support multiple investigative workflows simultaneously
- Align database behavior with jurisdictional requirements
- Scale from small deployments to national systems
2. DNA Profiles Enter the System
After the system has been configured, DNA profiles can be ingested from multiple operational workflows.
Profiles may originate from:
- Rapid DNA instruments used in booking stations or field deployments
- Traditional laboratory workflows using high-volume forensic instruments
- External partner databases or collaborative networks
Regardless of source, profiles are normalized and securely ingested into the SmallPond platform.
SmallPond is instrument-agnostic, allowing agencies to integrate Rapid DNA and traditional laboratory pipelines within the same investigative environment.
3. Profiles Are Placed into Ponds
As profiles are ingested into SmallPond, they are placed into a designated Pond.
The destination pond is determined by the user or workflow responsible for importing the profiles.
This may occur through manual entry, batch import, or automated ingestion processes configured by the laboratory.
| Profile Source | Typical Pond |
|---|---|
| Rapid DNA Booking Station | Arrestees |
| Laboratory Evidence Workflow | Forensic Evidence |
| Missing Persons Program | Missing Persons |
This ensures profiles are organized according to operational procedures and governance policies.
4. Automatic Matching
Once a profile is stored in a pond, SmallPond automatically applies all Matching Nets that involve that pond.
This means:
- Matching occurs immediately upon profile ingestion
- No manual search initiation is required
- Continuous evaluation of new profiles against existing data
This enables rapid identification of investigative leads.
5. Investigative Results
Each Net may generate match results depending on the configured comparison rules and stringency.
Results may include:
- High-confidence matches
- Candidate matches
- Investigative leads for further analysis
These results are presented to investigators and analysts to support:
- Criminal investigations
- Missing persons identification
- Cross-agency collaboration
The SmallPond Model
Conceptually, the system operates as follows:
System Configuration (Ponds + Nets)
↓
Profiles Enter System
↓
Profiles Assigned to Ponds
↓
Nets Automatically Applied
↓
Investigative Matches Generated
Scalability
SmallPond deployments range from small local DNA programs to national forensic databases, all powered by the same high-performance matching engine.
| Deployment Type | Typical Scale | Example Use |
|---|---|---|
| Local Programs | 5,000 – 50,000 profiles | Rapid DNA programs, local law enforcement databases |
| Regional Programs | 100,000 – 1,000,000+ profiles | State or small national forensic DNA databases |
| National Databases | Millions of profiles | National identification systems and large investigative datasets |
Regardless of database size, SmallPond’s proprietary matching algorithms allow investigators to search very large datasets and identify potential matches in milliseconds.
SmallPond separates data organization from matching logic, enabling real-time investigative matching at any scale.