Richie 1d1bafbd30 feat(haproxy-logs): ingest HAProxy request logs into Richie DB
Add a pipeline to load HAProxy `option httplog` lines into the Richie
database so bot/crawler traffic can be analyzed.

- model: HaproxyRequest mirroring the httplog format, with a unique
  line_hash dedup key and indexes on common filter columns
- migration: create the haproxy_request table (unique line_hash + indexes)
- haproxy_logs package:
  - parser: httplog line -> columns, strips the journald prefix and
    hashes the normalized line
  - ingest: batched, idempotent insert that skips rows whose line_hash
    already exists, so re-ingesting the same logs is a no-op
  - cli: ingest-only `haproxy-logs` command reading stdin or a file
- tests: parsing of a real GPTBot line and idempotent re-ingestion
2026-06-23 21:13:20 -04:00
2026-06-18 12:55:46 -04:00
2026-05-14 15:12:53 -04:00
2026-05-14 15:12:53 -04:00
2025-12-06 12:33:10 -05:00
2026-06-14 21:29:45 -04:00
2026-06-22 19:48:53 -04:00
2026-06-18 12:55:46 -04:00
2026-06-14 15:40:31 -04:00
2025-09-27 11:56:20 -04:00
2026-06-12 13:06:18 -04:00
2025-11-10 23:21:54 -05:00
2024-08-24 17:53:37 -04:00
2026-06-14 21:29:45 -04:00
2024-09-08 08:38:29 -04:00
2025-11-08 08:21:27 -05:00
2025-10-31 22:37:54 -04:00
S
Description
No description provided
Readme MIT 12 MiB
Languages
Python 74.2%
Nix 18.3%
HTML 6.6%
Shell 0.4%
CSS 0.3%
Other 0.1%