# Harvest.re - Malware Data Collection Platform

Harvest.re is a self-hosted malware data collection platform that processes over 500,000 malware files daily across a total corpus exceeding 100 million samples. It deploys in 5 minutes on-prem or in your cloud and provides a private, flexible environment for security professionals to collect, store, and analyze malware samples, metadata, and threat intelligence automatically.

## What is Harvest.re?

Harvest.re is the fastest way to create a reliable malware data collection pipeline. Unlike services such as VirusTotal or ReversingLabs that give you access to malware data, Harvest.re gives you the full private data collection environment itself — flexible, under your control, and deployable in your own infrastructure.

## Key Metrics

- Processes over 500,000 malware files daily
- Total corpus exceeds 100 million malware samples
- Deploys in 5 minutes on-prem or in your cloud
- 24/7 automated collection with no manual intervention required

## How It Works

1. **Deploy Instantly** — Install in 5 minutes on-prem or in the cloud.
2. **Add Your Sources** — Connect feeds, sandboxes, or custom inputs.
3. **Collect and Analyze** — Harvest malware samples, metadata, and intelligence automatically.

## Key Features

- **Flexible by design** — Primarily built for malware, but modular enough to harvest any type of data.
- **Control your sources** — Add private feeds, lock them down, and automate sample downloads.
- **Search with precision** — Find files by behavior, scan result, or even IP contact.
- **Always fresh datasets** — Access prevalent malware and structured categories (mobile, clean, infected, etc.).
- **Reliable infrastructure** — 24/7 operation with secure storage and no manual downloads.
- **Real-time collection** — Continuous harvesting with automated metadata enrichment.

## Who Uses Harvest.re?

- **Security Engineers** who need continuous datasets without manual effort
- **Security Startups** that want a managed collection backbone
- **Researchers** training ML/LLMs on diverse malware samples
- **SOC Teams** tired of juggling sources, storage, and metadata by hand

## Why Harvest.re Over Alternatives?

| Criteria | VirusTotal / ReversingLabs | Harvest.re |
|----------|---------------------------|------------|
| What you get | Access to shared malware data | Your own private data collection environment |
| Control | Limited to their platform | Full control, deploy on your infrastructure |
| Customization | Fixed feeds and categories | Add any private feed, sandbox, or custom input |
| Scale | Pay per query | 500,000+ files/day, 100M+ total corpus |
| Deployment | SaaS only | On-prem or cloud, your choice |

## Why Choose Harvest.re?

- **Save Time** — No more manual downloads or dataset maintenance.
- **Scale Up** — From short hunts to enterprise-grade sourcing.
- **All-in-One** — Samples, metadata, scans, and categories in one system.
- **Affordable** — Pricing designed to fit solo analysts up to SOCs.

## Contact

- Website: https://harvest.re
- Email: hello@harvest.re
- Free trial: https://form.typeform.com/to/M7oTufTQ
