Commit Graph

113 Commits

Author SHA1 Message Date
26105b7daa updated BenchmarkConfig to have from_toml 2026-04-10 21:55:18 -04:00
0d81f2d17b setup FinetuneConfig 2026-04-10 21:40:17 -04:00
1409e9c63e deleted train.sh 2026-04-10 20:58:26 -04:00
259e952afc added containers dir 2026-04-10 20:48:24 -04:00
4a10a80ba0 conveted to summarization_prompts 2026-04-10 18:57:21 -04:00
03208a1ab2 moved renamed container.py to vllm_container.py 2026-04-10 13:16:18 -04:00
721526022b created working finetuing pipeline 2026-04-10 12:56:57 -04:00
88dae310b6 added summarization_prompts.py to sore the prompts 2026-04-10 12:40:36 -04:00
24f0e8693a added tools dir for on off scripts i used 2026-04-10 12:37:14 -04:00
ced78fe516 added batch_bill_summarizer.py
batch bill  summarizer sends a batch api call to gpt
2026-04-10 12:36:39 -04:00
251da6c14a added bill_token_compression.py
tested on sample size of 100 bills matching the distribution of our data
Compression saves ~11.5% on prompt tokens; completion/reasoning are roughly equal across the two sets.
prompt	completion	reasoning	total
compressed	349,460	157,110	112,128	506,570
uncompressed	394,948	154,710	110,080	549,658
delta	−45,488	+2,400	+2,048	−43,088
2026-04-09 18:41:13 -04:00
d17c883476 created main prompt bench 2026-04-08 09:08:25 -04:00
9c8013d69d creating prompt_bench downloader 2026-04-07 19:15:43 -04:00
261ff139f7 removed ds table from richie DB 2026-03-29 15:54:54 -04:00
ba8ff35109 updated ingest_congress to use congress-legislators for legislator info 2026-03-29 15:54:54 -04:00
e368402eea adding LegislatorSocialMedia 2026-03-29 15:54:54 -04:00
89f6627bed converted session.execute(select to session.scalars(select 2026-03-29 15:54:54 -04:00
c5babf8bad ran treefmt 2026-03-29 15:54:54 -04:00
dae38ffd9b added ingest_congress.py 2026-03-29 15:54:54 -04:00
ca62cc36a7 adding congress data to new DS DB 2026-03-29 15:54:54 -04:00
e40ab757ca making more generic exception handling 2026-03-29 15:54:54 -04:00
345ba94a59 ran ingest_posts 2026-03-29 15:54:54 -04:00
f2084206b6 adding tables for 2023 2026-03-29 15:54:54 -04:00
50e764146a added ingest_posts.py 2026-03-29 15:54:54 -04:00
ea97b5eb19 adding 2026 partitions 2026-03-29 15:54:54 -04:00
1ef2512daa adding post table 2026-03-29 15:54:54 -04:00
d8e166a340 adding data_science_dev 2026-03-29 15:54:54 -04:00
c266ba79f4 updated snapshot_config.toml 2026-03-29 14:12:06 -04:00
8f8177f36e adding zstd compression to fastapi 2026-03-24 12:53:44 -04:00
b4d9562591 fixed treefmt 2026-03-22 19:07:23 -04:00
66f972ac2b removing react 2026-03-22 19:07:23 -04:00
1b5a036061 httpx conversion 2026-03-18 19:38:14 -04:00
42330ec186 changed to sa.Enum 2026-03-18 19:29:59 -04:00
3f4373d1f6 fixed tests and treeftm 2026-03-18 19:29:59 -04:00
976c3f9d3e move signal bot to its own DB 2026-03-18 19:29:59 -04:00
2661127426 updated _format_location to use van van_last_known_longitude and van_last_known_latitude 2026-03-18 19:29:59 -04:00
1b3e6725ea added sync_roles 2026-03-18 19:29:59 -04:00
7d2fbaea43 added bot class and rbac style auth with dynamic help msg base on roles 2026-03-18 19:29:59 -04:00
a19b1c7e60 Add Signal location command backed by Home Assistant 2026-03-18 19:29:59 -04:00
c83bbe2c24 added more data to van weatere and moved retry logic to tenacity 2026-03-15 12:06:01 -04:00
4e3273d5ec fixed tree fmt and removed chat with images 2026-03-14 11:49:44 -04:00
b5ee7c2dc2 added logging 2026-03-14 11:49:44 -04:00
958b06ecf0 added auth cashe 2026-03-14 11:49:44 -04:00
71ad8ab29e removed comand prefix 2026-03-14 11:49:44 -04:00
f1e394565d migrated to tanasty and added dead letter queue 2026-03-14 11:49:44 -04:00
3a3267ee9a fixed ruff warning 2026-03-14 11:49:44 -04:00
0497a50a43 removed repo_line_counter.py 2026-03-14 11:49:44 -04:00
6365dd8067 updated the van inventory to use the api 2026-03-14 11:49:44 -04:00
a6fbbd245f fixed safety number logic 2026-03-14 11:49:44 -04:00
7ad321e5e2 moved device registry to postgresql 2026-03-14 11:49:44 -04:00