Creating content that doesn't read like AI slop requires more than a single prompt. Paperclip is a multi-agent pipeline where specialized agents handle research, writing, review, image generation, and publishing — each with its own model, temperature, and quality criteria.
Topic Selection → Research → Writing → Review → Image Gen → Publishing
↓ ↓ ↓ ↓ ↓ ↓
[Curator] [Researcher] [Writer] [Reviewer] [Artist] [Publisher]
Temp: 0.9 Temp: 0.3 Temp: 0.7 Temp: 0.2 Temp: 0.8 Temp: 0.1
Model: GPT-4 Model: Sonnet Model: Opus Model: Haiku Model: DALL-E Model: Haiku
Selects topics based on a content calendar, trending topics, and a registry of previously covered subjects (to avoid repetition). High temperature for creativity, but constrained by:
Low temperature, high precision. Searches the web, reads documentation, finds concrete examples and data points. Outputs structured research notes with citations:
{
"topic": "FPGA-based ML inference",
"key_facts": [...],
"code_examples": [...],
"sources": [...],
"statistics": [...],
"related_topics": [...]
}
Takes research notes and produces the article. Medium temperature — creative enough to be engaging, constrained enough to be accurate. The writer follows a style guide:
Low temperature, adversarial role. Checks for:
If the review fails, the article goes back to the writer with specific feedback. Maximum 2 revision cycles before human review is triggered.
Lowest temperature — pure execution. Converts markdown to HTML, generates slugs, updates sitemap, commits to git, pushes to hosting. No creativity needed, just reliability.
It's tempting to add more agents. Resist. Every agent adds latency, cost, and failure modes. The sweet spot is 4-6 agents for content creation. More than that and you're building a bureaucracy, not a pipeline.