Get the latest tech news
DuckDB over Pandas/Polars
Since my previous post on DuckDB (DuckDB as the New jq), I’ve been continuing to use and enjoy DuckDB.
Recently, I wanted to analyze and visualize some financial CSVs, including joining a few files together. For example, here is how I parsed a Transactions.csv and summed entries by Category for rows in 2024 (simplified example, code formatted with Black): Once I switched to DuckDB, I could write much more familiar (to me) SQL, while still using python for the rest of the code:
Or read this on Hacker News