saturn·

data trove doge workforce cuts

saturn notebook · generated 2026-06-22 Report Notebook

Overview

Source: /home/coolhand/html/datavis/data_trove/data/policy/doge_cuts_by_agency.json

Saturn profiled 1 rows across 6 columns. The stats below are deterministic and machine-readable; the prose is a language-model interpretation of those stats (opt-in, added after the fact, never sees raw rows).

[2]:
!pip install saturn-dissect
import subprocess
subprocess.run([
    "saturn", "analyze", "/home/coolhand/html/datavis/data_trove/data/policy/doge_cuts_by_agency.json",
    "--findings", "data-trove-doge-workforce-cuts.json",
    "--llm", "anthropic:default",
])

Summary confidence: low

This dataset is a single-record metadata wrapper describing a DOGE (Department of Government Efficiency) federal workforce cuts dataset, organized by agency and paired with spending paradox data. The file contains only 1 row and 6 columns, with most columns ('_fields', '_key_numbers', '_sources', 'data') skipped during profiling — meaning the substantive data is likely nested inside those unparsed structures. The only readable signals are the description label and a stub flag set to 'True', suggesting this is a placeholder or index record pointing to richer nested content. Analysts should prioritize unpacking the 'data' and '_key_numbers' columns to access the actual agency-level workforce and spending figures.

citing: row_count · column_count · _description.top_value · _stub.top_value · _fields.alerts · _key_numbers.alerts · data.alerts

Out[4]:

saturn.schema() · 6 columns

column kind n null% unique alerts
_stub categorical 1 0.0% 1 long_tail imbalance
_description categorical 1 0.0% 1 long_tail imbalance
_sources unknown 1 0.0% skipped
_fields unknown 1 0.0% skipped
_key_numbers unknown 1 0.0% skipped
data unknown 1 0.0% skipped
Fig 1.
_description · Confirms this is a single-topic metadata record — expect all rows to share the same DOGE workforce cuts description.
Show data table
Top values for _description (1 unique shown, of 1 total).
valuecountshare
DOGE federal workforce cuts by agency with spending paradox data1100.0%
Fig 2.
_stub · The stub flag is universally 'True', signalling this record is a shell and the real data lives in nested fields.
Show data table
Top values for _stub (1 unique shown, of 1 total).
valuecountshare
True1100.0%
Fig 3.
Per-column null rate across the corpus. Columns are ordered by input position.
Show data table
Per-column null rate across the corpus.
columnkindnull %
_stubcategorical0.0%
_descriptioncategorical0.0%
_sourcesunknown0.0%
_fieldsunknown0.0%
_key_numbersunknown0.0%
dataunknown0.0%

_stub categorical other

This column, '_stub', is a degenerate single-value flag containing only the string 'True' across all 1 observed row. It carries zero information (entropy 0.0, top_rate 1.0, cardinality 1) and appears to be a placeholder or stub artifact rather than a meaningful feature. The alerts for long_tail and imbalance are technically triggered but are artifacts of the column's complete uniformity.

Treatment: Drop immediately; zero-variance column with a single row and single value provides no modelling signal.

anthropic:default · confidence high
Out[9]:

saturn.columns["_stub"].stats

statvalue
n1
nulls0 (0.0%)
unique1
top_value True
top_rate 1
cardinality 1
entropy 0
entropy_ratio 0
alert: long_tail1 singleton categories
alert: imbalancetop value is 100.0% of rows
Fig 4.
Top values for _stub.
Show data table
Top values for _stub (1 unique shown, of 1 total).
valuecountshare
True1100.0%

_description categorical metadata

This column contains a single free-text description string that appears to be a dataset-level title or metadata label, not a row-level feature. With n=1, there is only one record in this slice, and the sole value—'DOGE federal workforce cuts by agency with spending paradox data'—occupies 100% of rows (top_rate=1.0). The column carries zero information entropy (entropy=-0.0) and cannot contribute to any model or aggregation.

Treatment: Drop before modelling; this is a constant dataset-level descriptor with no analytical variance.

anthropic:default · confidence high
Out[12]:

saturn.columns["_description"].stats

statvalue
n1
nulls0 (0.0%)
unique1
top_value DOGE federal workforce cuts by agency with spending paradox data
top_rate 1
cardinality 1
entropy 0
entropy_ratio 0
alert: long_tail1 singleton categories
alert: imbalancetop value is 100.0% of rows
Fig 5.
Top values for _description.
Show data table
Top values for _description (1 unique shown, of 1 total).
valuecountshare
DOGE federal workforce cuts by agency with spending paradox data1100.0%

_sources unknown metadata

This column '_sources' contains only a single row with no computed statistics, making it impossible to characterize its content or distribution. The 'skipped' alert indicates the profiler bypassed analysis, likely due to an unsupported or complex data type (e.g., array, struct, or JSON). With n=1 and no uniqueness or value data available, no meaningful analytical signal can be extracted.

Treatment: Inspect raw values manually to determine type; if it contains provenance or lineage tags, consider parsing and exploding before use.

anthropic:default · confidence low
Out[15]:

saturn.columns["_sources"].stats

statvalue
n1
nulls0 (0.0%)
unique
alert: skippedno profiler for kind=unknown

_fields unknown metadata

_fields is an unknown-type column with only 1 row and no computed statistics, likely a metadata or structural artifact from the ingestion pipeline (e.g., a nested/struct field that saturn could not parse). The profiler skipped analysis entirely, so no distributional information is available.

Treatment: Inspect raw source to determine if this is a nested struct or pipeline artifact; drop or unpack before modelling.

anthropic:default · confidence low
Out[17]:

saturn.columns["_fields"].stats

statvalue
n1
nulls0 (0.0%)
unique
alert: skippedno profiler for kind=unknown

_key_numbers unknown other

This column contains only a single row with no computed statistics, and the profiler flagged it as 'skipped' with kind 'unknown', indicating saturn could not determine its type or distribution. With n=1 and no unique-count or any stats available, nothing meaningful can be inferred about its content or role. The column name '_key_numbers' suggests it may be an internal index or composite key field, but this is entirely speculative given the evidence.

Treatment: Investigate manually — only 1 row was profiled and the column type is unknown; verify whether this is a key, a metadata artifact, or a parsing error before use.

anthropic:default · confidence low
Out[19]:

saturn.columns["_key_numbers"].stats

statvalue
n1
nulls0 (0.0%)
unique
alert: skippedno profiler for kind=unknown

data unknown other

This column contains only 1 row and was skipped by the profiler, yielding no distributional statistics. The kind is marked 'unknown', making it impossible to determine the column's semantic role or content. No nulls are present, but with n=1 the dataset itself may be a stub or test fixture.

Treatment: Investigate data source — single-row dataset with no profiling output; re-profile after confirming full data load.

anthropic:default · confidence low
Out[21]:

saturn.columns["data"].stats

statvalue
n1
nulls0 (0.0%)
unique
alert: skippedno profiler for kind=unknown

How to cite

click to copy

BibTeX
@misc{saturn-data-trove-doge-workforce-cuts-2026,
  author       = {Steuber, Luke},
  title        = {Saturn reading: data trove doge workforce cuts},
  year         ={2026},
  howpublished = {\url{https://dr.eamer.dev/saturn/view/data-trove-doge-workforce-cuts}},
  note         = {Profiled with saturn-dissect v0.2.0, prompt saturn-insight-v2, model anthropic:default},
}
APA
Steuber, L. (2026). Saturn reading: data trove doge workforce cuts. Source: /home/coolhand/html/datavis/data_trove/data/policy/doge_cuts_by_agency.json. Profiled with saturn-dissect v0.2.0 (saturn-insight-v2, anthropic:default). Retrieved from https://dr.eamer.dev/saturn/view/data-trove-doge-workforce-cuts