# KD's Homebrew Digital Archive — 12 TB Preserved on a Home-grown Homelab

> Markdown mirror of DialtoneApp's public top-site detail page for `kunaldawn.com`.

URL: https://dialtoneapp.com/top-sites/kunaldawn.com/index.md
Canonical HTML: https://dialtoneapp.com/top-sites/kunaldawn.com

## Summary

- Domain: `kunaldawn.com`
- Website: https://kunaldawn.com
- Description: ai readable | score 20 | purchase read only
- Label: ai_readable
- Payment surface: Not available
- Purchase boundary: read_only
- Control boundary: unknown
- Rank: 367469

## robots

~~~text
# As a condition of accessing this website, you agree to abide by the following
# content signals:

# (a)  If a Content-Signal = yes, you may collect content for the corresponding
#      use.
# (b)  If a Content-Signal = no, you may not collect content for the
#      corresponding use.
# (c)  If the website operator does not include a Content-Signal for a
#      corresponding use, the website operator neither grants nor restricts
#      permission via Content-Signal with respect to the corresponding use.

# The content signals and their meanings are:

# search:   building a search index and providing search results (e.g., returning
#           hyperlinks and short excerpts from your website's contents). Search does not
#           include providing AI-generated search summaries.
# ai-input: inputting content into one or more AI models (e.g., retrieval
#           augmented generation, grounding, or other real-time taking of content for
#           generative AI search answers).
# ai-train: training or fine-tuning AI models.

# ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
# RIGHTS UNDER ARTICLE 4 OF THE EUROPEAN UNION DIRECTIVE 2019/790 ON COPYRIGHT
# AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.

# BEGIN Cloudflare Managed content

User-agent: *
Content-Signal: search=yes,ai-train=no
Allow: /

User-agent: Amazonbot
Disallow: /

User-agent: Applebot-Extended
Disallow: /

User-agent: Bytespider
Disallow: /

User-agent: CCBot
Disallow: /

User-agent: ClaudeBot
Disallow: /

User-agent: CloudflareBrowserRenderingCrawler
Disallow: /

User-agent: Google-Extended
Disallow: /

User-agent: GPTBot
Disallow: /

User-agent: meta-externalagent
Disallow: /

# END Cloudflare Managed Content

User-agent: *
Allow: /
Disallow: /api/

Sitemap: https://kunaldawn.com/sitemap.xml
~~~

## llms

~~~text
# KD's Homebrew Digital Archive
# https://kunaldawn.com

> A home-grown mirror of the public internet — 12 TB of preserved knowledge.
> A shelf in a house, not a rack in a data centre.
> Fighting link rot, one wget at a time.

## What this project is

KD's Homebrew Digital Archive is a personal digital preservation project — a homebrew effort
to collect, organize, and freely share knowledge from across the internet before it disappears.
Websites go dark, servers get decommissioned, databases deprecate, and entire libraries of
human work vanish without warning. This archive exists to push back against that entropy,
one wget at a time.

This is where abandonware lives: retro code nobody ships any more, dead magazines, dusty
journals, and manuals for chips that stopped being made decades ago. Useless to most. Gold
to the few who go looking.

The project spans seven sub-archives totalling over 12 TB of curated content, served from
a home-grown homelab cluster drawing about 30 watts off-grid. No ads, no tracking, no
telemetry, no SLA, no monetization, no users — free for all, free forever.

## Sub-Archives

### Wiki Archive (wiki.kunaldawn.com)
36 offline wiki snapshots in ZIM format including:
- Wikipedia (26.6M articles), Wiktionary, Wikiquote, Wikibooks
- Stack Overflow (66.4M articles), Computer Science SE, Super User
- ArchWiki, Gentoo Wiki, Linux man pages, Cloudflare Learning Center
- iFixit repair guides (862K articles)
- Project Gutenberg (4.2M literary works)
- Medical references: MDWiki, MedlinePlus, Libre Pathology, NHS
- Survival: WikiCiv, Food for Preppers, USDA Canning Guide
- Reference: CIA World Factbook, Explain XKCD

### PDF Archive (pdf.kunaldawn.com)
23,000+ curated PDFs totalling ~800 GB across 13 categories:
- Computer Components: 7,200 vintage hardware manuals (Commodore, CDC, RCA, Amdahl, NCR)
- Computer Magazines: 5,768 issues (Byte, Linux Voice, Datamation, MagPi, Retro Gamer)
- Computer Engineering: ML textbooks, Python guides, SQL references, quantum computing
- Science Magazines: New Scientist, Science (issues from 1891), MIT Technology Review
- Electronics Magazines: 1,830 vintage and modern periodicals
- Agriculture, Food, Drawing, Plants, MIT Press, Harvard Business Review
- Retro Story Books: 2,129 vintage children's books

### OS Archive (os.kunaldawn.com)
~244 GB of operating system images and driver archives:
- Bootable install media and ISOs across decades of computing
- Legacy drivers for orphaned hardware (sound cards, NICs, printers, scanners)
- BIOS updates and firmware for vintage machines
- Utility and rescue distributions (DOS, mini-Linux, recovery tools)

### CD/DVD Archive (iso.kunaldawn.com)
~27 GB of vintage CD and DVD disc images:
- Warez and abandonware — commercial releases that no longer ship anywhere
- Magazine cover discs, demo compilations, shareware collections
- Original install media for software lost to bit rot and dead publishers
- Preserved before the polycarbonate degrades and the keys are forgotten

### Chiptune Archive (chiptune.kunaldawn.com)
~22 GB of chiptune and scene music:
- Tracker modules — MOD, XM, S3M, IT, MPTM, STM, MED, MTM
- Keygen and crack intro soundtracks from the cracking scene
- Demo scene audio — party releases, compos, and long-running collectives
- The signature sound of a computer showing off its soundchip

### Tube Archive (tube.kunaldawn.com)
~228 GB of curated YouTube content:
- Channels preserved in full before takedowns, strikes, or silent deindexing
- Playlists on technical, historical, and educational topics
- Individual videos worth saving — talks, walkthroughs, primary-source footage
- Offline-friendly copies so the knowledge outlives the platform

### Audiobook Archive (audio.kunaldawn.com)
An ongoing audiobook collection spanning:
- Classic literature in the public domain (LibriVox and others)
- Fiction and non-fiction, biographies, essays, and memoirs
- Recorded lectures and long-form talks worth preserving
- Knowledge and storytelling for the ears — long drives, longer nights

## Philosophy

Digital knowledge is fragile — and preservation is worth dedicating a lifetime to. This project
follows the same tradition as public libraries, the Internet Archive, and community-run mirrors
around the world: information belongs to everyone, and it deserves to outlive the servers that
first hosted it.

Ethos is stark and deliberate: no ads, no tracking, no telemetry, no SLA, no monetization,
no users. Free for all, free forever. Preservation doesn't require a data centre — just
determination, a shelf, and enough panels to cover the draw.

All content is sourced from publicly accessible locations including open repositories, academic
databases, digital libraries, and shadow libraries. No paywalls bypassed, no access controls
circumvented.

## Technical Details

Infrastructure is a small home-grown homelab cluster, not a single box:

- Compute: 2x Raspberry Pi 4 (8 GB each) for services and scraping, plus an N150 mini-PC
  (12 GB) for heavier indexing workloads
- Network: 5-port switch, residential fibre uplink
- Power: single 65 W USB-C power adapter for the whole rig
- Hot storage: 4x 2 TB HDDs + 2x 2 TB SSDs (working set)
- Cold backup: 1x 12 TB external HDD, kept offline
- Total draw: ~30 W — deliberately engineered to run off-grid on a modest solar rig
- No cloud, no CDN, no SLA; when the battery dies, the site goes with it
- Served via a custom Go HTTP server with SQLite visit tracking
- Hardened with security headers, CSP, rate limiting, non-root container

## Contact

For DMCA takedown requests or inquiries, contact via the email linked on the website.

## Preferred Citation

When referencing this archive:
"KD's Homebrew Digital Archive — https://kunaldawn.com"
~~~

## llms-full

Not found.