10 Proven Ways I Instantly Spot Bad AI-Generated Code (link)
The 4 Lines Every CLAUDE.md Needs (link)
Best SQL Hacks I Wouldn’t Have Believed If I Hadn’t Used Them Myself (link)
Java 26 Is Here — Here’s What Actually Matters (link)
Anthropic Is Giving Away 13 Free Courses That Others Charge Thousands For (link)
I Tested 5 Python ORMs. One Replaced SQLAlchemy Completely. (link)
sqlit (link)
rainfrog - terminal ui for postgres (link)
RIP Commercial OCR. An Open-Source Model Just Topped Every Benchmark (link)
Why Agentic AI Is the #1 Skill To Learn (link)
Everyone Said AI Would Take Engineering Jobs. They Were Right. (link)
The YC CEO Ships 10,000 Lines a Day. Here’s His Exact Setup (link)
9 Things Every Python Script Should Have (link)
Zero ETL Is the Reality Check Every Data Engineer Needs in 2026 (link)
Agentic AI with DuckDB and smolagents (link)
Our JetBrains Devs Switched to VSCode. They Hate VSCode. (link)
Every Python Concept Explained (link)
Exploratory Data Analysis Checklist: What to Look for Every Time (link)
oLLM: The Revolutionary Python Library Running Powerful Language Models on Ordinary Computers (link)
The RAG Layer Nobody Talks About (link)
“OpenRAG” From Documents to Agentic Search in Minutes (from IBM research open source) (link)
End-to-End Data Engineering Project (by Free Tools) (link)
How to Build a High-Performance, Free ELT Pipeline Locally using DuckDB (link)
10 SQL Time Zone Mistakes Most Teams Discover Only After Numbers Are Escalated by Executives (link)
10 Critical Data Quality Queries Every Data Engineer Should Implement (Before Trust Is Lost) (link)
Google Finally Solved The Fragile Text-to-SQL Systems (link)
F3: The Future-Proof File Format That Finally Gets It Right (link)
Dynamic Programming: A Handbook (link)
Automate schema mappings with LLMs (link)
How to Scrape a Website to Markdown: 2026 Guide (link)
Django vs FastAPI: I Built the Same App Twice (link)
How to Build a Local ELT Pipeline with DuckDB and DBT (link)
NumPy + Pandas: The Only Guide You Need (link)
AI Code Assistants for Data Engineering: I Tested 6 Tools for SQL and Python (link)
Agentic EDA with AI Foundry: Automating Exploratory Analysis (link)
The 2026 Data Engineering Roadmap: Building Data Systems for the Agentic AI Era (link)
10 Business-Centric Data Metrics Analysts Ignore (Until They’re Replaced) (link)
10 dbt Macros That Saved Us 100+ Hours (With Copy-Paste Code) (link)
The Power of PyTorch and vLLM Together (link)
10 Data Models Every Data Engineer Must Know (link)
The First Nmap Scan That Makes You Realize How the Internet Actually Works (link)
How to Write SQL Queries That Use Window Frames to Transform Your Data Analysis (link)
Building a Simple SQL Parser in Python: From Basics to Hands-On (link)
Unlock RAG-Anything’s Power with Ollama on Your Machine (with Docling as Bonus) (link)
SQL Performance Mastery: 10 High-Impact Aggregation Strategies for Sub-Second Queries (link)
Why Semi-Joins in SQL Are More Powerful Than You Think (And How to Master Them) (link)
Handling Imbalanced Data: The Complete Guide Every Data Scientist Must Know (link)
Fivetran vs. Airbyte in 2026 | Complete ELT Guide (link)
AI Skills Are Exploding in 2026 (link)
Why Gradient Boosting Often Beats Deep Learning on Tabular Data (And How to Tune It) (link)
Why Exponentials and Logarithms Dominate Statistics and Information Theory: The Fundamental Properties (link)
The Journey to Causality: From Dashboards to Causal Inference (link)
How to Choose the Right Search Algorithm for your VectorDB? (link)
Top 8 Free Python Excel Libraries for Developers in 2025 (link)
Data Analyst vs. AI Agent: Who Wins the Job in 2026? (link)
How to Write SQL Queries That Detect When a LEFT JOIN Is Silently Dropping Rows (link)
The Semantic Layer Revolution: How dbt and Databricks Built the Universal Language of Business Data (link)
The Truth About Data Modeling: What You Learn Only After Real Projects (link)
I Wasted 6 Months Learning NotebookLM the Hard Way — So You Don’t Have To (link)
Phind vs Google: My Go-To Tool for Explaining Complex Code (link)
Python + MCP Is the New Automation Superpower (Here’s the Proof) (link)
Why COLLATION Rules in SQL Are More Powerful Than You Think (And How to Master Them) (link)
We Spent 2 Years Building a Data Mesh. It Was a $4M Disaster. (link)
Mastering Hyperparameter Tuning (link)
What is Microsoft MarkItDown and Why It Matters? (link)
Every Python Built In Function Explained (link)
How Web Search Inside AI Chatbots Works (link)
Testing Google’s Antigravity for Data Engineering: My End-to-End Experience (link)
Insert-Only Design in Modern Data Warehousing: Lessons from Data Vault 2.0, SCD2, and Databricks Performance (link)
10 Hard Window Join Problems in SQL and How to Solve Them (link)
The Serverless Illusion: Why Everyone Is Quietly Moving Back to VMs (link)
How I Built an AI That Talks to Your Database: A Journey into RAG (link)
What Makes dbt So Popular in Modern Data Teams? (link)
Uber Eats Data Warehouse Architecture: A Complete Guide to Trip Data Modeling and Analytics Storage (link)
UV in Python: The Fastest Package & Project Manager (Complete Guide + Example Project) (link)
HTMX Murdered React: Why Nobody Needs JavaScript Frameworks Anymore (link)
Why Anti-Joins in SQL Are More Powerful Than You Think (link)
Why Window Exclusion Clauses in SQL Are More Powerful Than You Think (link)
Python visualization tools to level up from Matplotlib (link)
How to Write Python (and Others) Code Your Future Self Will Thank You For! (link)
Understanding Palantir’s Ontology: Semantic, Kinetic, and Dynamic Layers Explained (link)
Meet TOON — The Fresh Data Format That Could Replace JSON in the AI Era (link)
Storing products, prices and orders in PostgreSQL (link)
The Convergence Nobody Saw Coming: When APIs, AI, and Cloud Storage Accidentally Built the Future (link)
Your Database Is Slow Because You’re Using UUIDs (link)
Solving Many-to-Many & Drill-Across with the Unified Star Schema (link)
Why Scalar Subqueries in SQL Are More Powerful Than You Think (link)
SQL Query Optimization: Modern Techniques and Best Practices (link)
Every Industry Faces the Same Data Problems. My Take on Palantir’s Solution at AIPCON 8 (link)
Designing the Open Metadata Modeling Platform (link)
Self-Describing SQL: Embedding Metadata as YAML Front-Matter in Generated Objects (link)
Lessons from Data Vault: Principles Without the Dogma (link)
Islands and Gaps with Recursive CTE (link)
Graph Analytics for All of Your Data - Oracle (link)
Getting Started with Oracle AI Database AI Vector Search (link)
How and Why Netflix Built a Real-Time Distributed Graph: Part 1 — Ingesting and Processing Data Streams at Internet Scale (link)
From Kimball to Metadata: How Dimensional Thinking Still Shapes Modern Data Architecture (link)
Dual SCD2: The Foundation for True History in Data Warehousing (link)
Time-Based SQL Questions: Gaps, Overlaps, and Intervals (link)
Resampling Imbalanced Datasets for Binary Classification (link)
Robust methods to generate synthetic table data (link)
A Marketer’s Guide to Calculus (link)
Cosine Distance vs Dot Product vs Euclidean in vector similarity search (link)
6 Data Modeling Mistakes That Kill Scalability (and How to Fix Them) (link)
Why Recursive CTEs Are More Powerful Than You Think (link)
Palantir’s Ontology, Kimball’s Star Schema, and Model-Driven Data Engineering: A Comparative View (link)
SQL Complexity Explained: What Your Queries Are Really Doing Behind the Scenes (link)
Synthetic Data: What It Is and How to Use It (link)
5 Essential MCP Servers That Give Claude & Cursor Real Superpowers (2025) (link)
The Only 15 SQL Questions I Ask in Every Junior Data Scientist Interview (link)
From Raw Data to Reliable Systems: The Power of Data Modeling in Data Engineering (link)
From Chaos to Clarity: Advanced Data Models Every Data Engineer Must Master. (link)
How to Compare Two or More Distributions (link)
Is Your Training Data Representative? A Guide to Checking with PSI in Python (link)
What Fivetran’s acquisition of dbt Labs would mean for the Data Industry (link)
Agentic AI: Building Long-Term Memory (link)
I built an end-to-end interpretable Machine Learning research pipeline (link)
The Anatomy of a Modern LLM (link)
Waiting for Postgres 18: Accelerating Disk Reads with Asynchronous I/O (link)
Building 17 Agentic AI Patterns and Their Role in Large-Scale AI Systems (link)
MCMC & the art of Sampling without Sampling (link)
Building a Real-Time Profit & Loss Engine with RisingWave and Streaming SQL (link)
What Is The Best Diagramming Software in 2025 (link)
excalidraw (link)
Type Casting in Python (link)
SIMD: The real superpower behind super fast databases (link)
How to Start Learning Machine Learning: A Practical Guide (link)
Autoencoders for Defect Detection in Images (link)
What is an autoencoder? (link)
Integrating LLMs and AI Agents into Data Engineering Workflows (link)
Memory Management in Python (link)
Get Excited About Postgres 18 (link)
Optuna: The Hyperparameter Optimization Framework That Saved My Machine Learning Sanity (link)
7 Wonders of Data Science (link)
SQL Window Functions Explained Like a Story (link)
Why Generative AI Is Forcing Us to Rethink Data Modeling (link)
Stop Using Requests — Try This Modern HTTP Library Instead (link)
LangExtract (Google, Open Source): Turn Unstructured Text into Structured, Auditable Data (link)
Zero Degrees of Separation (link)
HTAP: Still the Dream, a Decade Later (link)
Demystifying Apache Spark (link)
The Complete Timeline of How AI Went From Miracle to Bubble in 3 Months (link)
mlflow (link)
Radically Simple Data Lineage (link)
DocumentDB (link)
AWS joins the DocumentDB (link)
A decade of database innovation: The Amazon Aurora story (link)
Claude Sonnet 4 vs Kimi K2 vs Gemini 2.5 Pro: Which AI actually ships production code? (link)
5 SQL Questions That Stump Even Senior Analysts (link)
SQLModel (link)
Effortless EDA with Sweetviz & YData-Profiling (link)
How Agentic RAG is Transforming Information Retrieval (link)
Why LLMs Can’t Solve Time Series (link)
Why AI Frameworks (LangChain, CrewAI, PydanticAI and Others) Fail in Production (link)
Introducing LangExtract: A Gemini powered information extraction library (link)
LangExtract (link)
Fun and weirdness with SSDs (link)
Vector Search Isn’t the Answer to Everything. (link)
A Hands-on Guide to DSPy (link)
How to Give Your RTX GPU Nearly Infinite Memory for LLM Inference (link)
What You Should Know About B-Trees on Disk (link)
PyCaret (link)
STOP Guessing Who Will Leave — How I Would Predict Customer Churn Before It Happens (link)
Building the Entire RAG Ecosystem and Optimizing Every Component (link)
What is a t-test and When to Use It in Pandas? (link)
The Complete Beginner’s Guide to Python Modules (link)
Kimball Star Schema vs Palantir’s Ontology (link)
Google’s Data Science Agent: Data Scientists are doomed (link)
The Loop is Back: Why HRM is the Most Exciting AI Architecture in Years (link)
Database Connections in FastAPI: Best Practices for Efficient and Scalable APIs (link)
The Dark Side of @Transactional in Spring Boot—Exposed (link)
Customer-Facing Analytics Without Denormalizing Everything (link)
Building Agentic Adaptive RAG with LangGraph for Production (link)
Microsoft Study: 40 Jobs AI Will Replace vs 40 Safe Careers (link)
How CERN Powers Ground-Breaking Physics with TimescaleDB (link)
RAG Without Embeddings? Here’s how OpenAI is doing this… (link)
Object Detection with Python and HuggingFace Transformers (link)
Implementing 12 AI Agent Evaluation Techniques Using LangSmith (link)
25 ChatGPT Prompts to Replace Hours of Work as a Data Analyst (link)
Optimizing LangChain AI Agents with Contextual Engineering (link)
Why the Server Should Handle the Web Again (link)
From Chaos to Clarity: Building Modern Python Projects with UV (link)
The AI Stack No One Talks About: Data Acquisition as Infrastructure (link)
Agentic-Doc: Extract Structured Data from Complex Documents with Ease (link)
Experimenting with SQL:2023 Property-Graph Queries in Postgres 18 (link)
A Database Schema for Engineering Project Management (link)
Featherweight - Lightning Fast Analytics with DuckDB and Postgres (link)
Why Your Data Lake Needs BLM, Not LLM (link)
Chat with your documents tool — RAG (vector DBs + cosine sim.) & Claude API implementation (link)
Implementing 9 Techniques to Optimize AI Agent Memory (link)
The Hidden Costs of LangChain, CrewAI, PydanticAI and Others: Why Popular AI Frameworks Are Failing Production Teams (link)
SQL Isn’t a Query Language. It’s a Thinking Framework (link)
How to Build a Production-Ready RAG App with Gemma and Bright Data in Under an Hour (link)
I spent $500 testing Replit/Lovable/Bolt/v0 & Cursor so you don’t have to (link)
Billions of Edges Per Second with Postgres (link)
pdot: Interactive Directed Graphs of Your Database (link)
Exploring Databases Visually (link)
pdot: Exploring Databases Visually, Part II (link)
pgai (link)
Foursquare Introduces SQLRooms (link)
Smaller Models, Smarter SQL: Arctic-Text2SQL-R1 Tops BIRD and Wins Broadly (link)
How to Build Near Real Time Data Pipelines with Incremental Loading (link)
5 Things You Didn’t Know About LocalStorage (link)
Building Real-Time Dashboards with FastAPI and HTMX (link)
HTMX Made Me Like the Web Again (link)
I Used MCP for 3 months: Everything You Need to Know + 24 Best Servers. New Anthropic DTX Extensions (link)
UV Cheatsheet for Python Developers (link)
Palantir Foundry Is 5–10 Years Ahead of Every Other Data Platform (link)
Performance vs Practicality: A Comparison of vLLM and Ollama (link)
Agentic RAG: Company Knowledge Slack Agents (link)
The math and logic behind ChatGPT. This paper is all you need. (link)
Build an AI Agent That Turns SQL Databases into Dashboards — No Queries Needed (link)
OpenAI: Scaling PostgreSQL to the Next Level (link)
Data Engineering: Now with 30% More Bullshit (link)
Solving RL Algorithms by Hand: A Step-by-Step Math Example (link)
DuckDB vs Databricks SQL Warehouse: Can We Save on Compute? (link)
Dynamic Data Source Routing in Spring Boot: Master Multi-Tenancy & Read-Write Separation (link)
Documents to Markdown Beyond Conventional Limits with gpt4.1 | ExtractThinker (link)
GPT-4.1 just PERMANENTLY transformed how the world will interact with data (link)
SQLMesh Incremental Modeling with DuckDB: A Hands-On Tutorial (link)
Postgres Language Server: Initial Release (link)
VS Code’s New AI Did My Job for 30 Minutes. The Results? Honestly Shocking! (link)
Building a modern Data Warehouse from scratch (link)
A case where SQL joins struggle but MongoDB documents shine (link)
Fine-Tuning a Text-to-SQL LLM for Reasoning Using GRPO (link)
From Zero to Production: Building a Rock-Solid K3s Cluster with PostgreSQL (link)
Beyond Materialized Views: Using DuckDB for In-Process Columnar Caching (link)
I tested out all of the best language models for frontend development. One model stood out amongst the rest. (link)
Transforming VS Code into a Powerful SQL IDE (link)
smallpond (link)
Life Altering Postgresql Patterns (link)
API-First AI Agentic Patterns: Building Smarter Systems Without the Framework Overhead (link)
The Most Comprehensive Explanation of Session, Cookie, Token, and JWT (link)
Should Stakeholders be Writing SQL for Self-Service? (link)
PgBouncer: Don’t Let Connection Chaos Ruin Your Day (link)
Xata Agent, your AI expert in PostgreSQL (link)
Can Artificial Intelligence Created Better Tables Than You? (link)
Turbocharging Denodo AI SDK: How Semantic Caching Makes Text-to-SQL 9X Faster (link)
How I built a SaaS product with 100% AI generated code (link)
Benchmarking PostgreSQL Batch Ingest (link)
Google’s Gemma-3 Fine-Tuning Made Simple: Create Custom AI Models with Python and Unsloth (link)
This Happens Inside Python…When We Call a Function (link)
SQLAlchemy 2.0: The Most Powerful ORM for Python Yet (link)
After DeepSeek-R1, a goldmine of research is waiting to be discovered. (link)
Stop Writing Manual Validators! Use Pydantic for Data Validation (link)
PostgreSQL: 1 trillion rows in Citus (link)
Is Kimball Still Relevant in the Modern Data Warehouse Era? (link)
STOP Using Python Dictionaries Like This! (link)
Multi-Tenant Architecture using SpringBoot and PostgreSQL (link)
PyGWalker (link)
OpenAI’s new prompting guide: how to get the best results from reasoning models (link)
Using Custom Agents in Cursor, Windsurf, Copilot and Others to Supercharge Your Workflow (link)
Data Warehouse Basics: How to Handle Changing Data with SCDs (link)
Postgres is all you need for vectors (link)
Building a Perfect Million Parameter LLM Like ChatGPT in Python (link)
I “vibe-coded” over 160,000 lines of code. It IS real. (link)
Hard-Earned Lessons from a Year of Building AI Agents (link)
License to Kill: Coding with Cursor AI Agents (link)
How To Train Your PyTorch Models (Much) Faster (link)
Prompt Decorators: A Simple Way to Improve AI Responses (link)
Postgres query plan visualization tools (link)
Vector Search at 10,000 QPS in PostgreSQL with VectorChord (link)
Optimizing PostgreSQL Performance: Essential Queries for Monitoring and Maintenance (link)
One Line of SQL, All the LiteLLM Embeddings (link)
How to Map Column Values in a Pandas DataFrame? (link)
Introducing Wren AI’s New AI-Powered Spreadsheets & Pre- (link)
Top 6 Core App Dashboard Building Tools (link)
How I Learned to Love init.py : A Simple Guide (link)
Postgres as a Graph Database: (Ab)using pgRouting (link)
EdgeDB is now Gel and Postgres is the Future (link)
EdgeDB 1.0 (link)
Use PASSING with JSON_TABLE() To Make Calculations (link)
How PostgreSQL’s Aggregate FILTER Will Spoil You (link)
Handling Billions of Rows in PostgreSQL (link)
Creating an AI Agent That Uses a Computer Like People Do (link)
Delta Lake 4.0: Next-Level Big Data Management (link)
From Traditional BI to GenBI: Embracing a Smarter, More Human Approach (link)
Building a Reliable Text-to-SQL Pipeline: A Step-by-Step Guide pt.1 (link)
Building a Reliable Text-to-SQL Pipeline: A Step-by-Step Guide pt.2 (link)
Batch processing at Scale: How PostgreSQL outperformed BigQuery for us (link)
Try Deep Research and our new experimental model in Gemini, your AI assistant (link)
Introducing Perplexity Deep Research (link)
ClickBench — a Benchmark For Analytical DBMS (link)
Economics of LLMs: Evaluations vs Pricing (link)
Open Source Data Engineering Landscape 2025 (link)
Modern CI-CD Pipelines of REST API Python Project with UV (link)
Real-Time Chat Application with FastAPI and WebSockets (link)
10 Advanced Python Concepts You Should Know To Be a Senior Developer (link)
ETL, ELT… or Something Better? (link)
MkDocs (link)
Building DeepSeek R1 from Scratch Using Python (link)
Postgres in the time of monster hardware (link)
Representing graphs in Postgresql (link)
Expanding Pgai Vectorizer: SQLAlchemy and LiteLLM Make Vector Search Simple (link)
Redis with FastAPI for Lightning-Fast Applications (link)
Top 8 PostgreSQL Extensions (link)
No need to write the boilerplate code again. Use Python dataclasses instead (link)
Building the Modern PostgreSQL GUI With PopSQL (link)
The twelve-factor app (link)
What I’ve Discovered While Using uv (link)
11+ React Ecosystem: Libraries That Shape Modern Development in 2025 (link)
20 Advanced Statistical Approaches Every Data Scientist Should Know (link)
How Uber Handles TRILLIONS of Transactions — The Secret (link)
Data Formulator: Create Rich Visualizations with AI (link)
Improving Agentic SQL Generation (link)
Several ways to improving the accuracy of AI-generated SQL (link)
IceCream (link)
Building a SQL Bot with LangChain, Azure OpenAI, and Microsoft Fabric (link)
11 Free Must-Know Websites for Every Developer (link)
OpenAI just quietly released another agentic framework. It’s really fucking cool (link)
Google just ANNIHILATED DeepSeek and OpenAI with their new Flash 2.0 model (link)
Enhancing Text-to-SQL Agents with Step-by-Step Reasoning (link)
Top 16+ Modern React Libraries To Supercharge Your Next Big Project (link)
Let’s reproduce GPT-2 (1.6B): one 8XH100 node, 24 hours, $672, in llm.c (link)
Drawing DeepSeek R1 Architecture and Training Process from Scratch (link)
Advanced SQL for Data Professionals (link)
FineWeb: decanting the web for the finest text data at scale (link)
Step-by-Step: Running DeepSeek locally in VSCode for a Powerful, Private AI Copilot (link)
DeepSeek R1 in 24GB GPU : Dynamic Quantization by Unsloth AI for a 671B-Parameter Model (link)
Fine-Tuning DeepSeek-R1 on Consumer Hardware: A Step-by-Step Guide (link)
DeepSeek Fine-Tuning Made Simple: Create Custom AI Models with Python (link)
Handling Slowly Changing Dimensions (SCD) in Modern Data Pipelines: A Complete Guide with SQL Examples (link)
Top 15 Must-Have JavaScript UI Libraries to Elevate Your Web Development (link)
I ranked every AI Coder: Bolt vs. Cursor vs. Replit vs Lovable (link)
Ollama vs vLLM: which framework is better for inference? (Part II) (link)
Automating Data Summarization in PostgreSQL With Claude (link)
RunSQL (link)
ChartDB (link)
Rethinking the frontend with HTMX (link)
Why Docker Might Not Be Your Best Choice (link)
SQLModel (link)
Why Choose Caddy Server instead Nginx? (link)
documentdb (link)
Top 15 Trending GitHub Repositories For AI Developement (link)
Stream Processing Systems in 2025: RisingWave, Flink, Spark Streaming, and What’s Ahead (link)
Ollama vs vLLM: which framework is better for inference? (link)
Is Cursor better than VS Code? (link)
Stop Being Racist! Just Use DeepSeek Dammit! (link)
AlloyDB vs PostgreSQL: Unleash Performance, Slash Costs, Simplify Data Stack (link)
2025 Data Engineering & AI Trends (link)
Don’t Fear Async: A Friendly Guide to Python’s Most Powerful Tool (link)
Top 11 AI-Powered Developer Tools Transforming Workflows in 2025 (link)
A Simple Guide to DeepSeek R1: Architecture, Training, Local Deployment, and Hardware Requirements (link)
Building an End-to-End Data Lake ELT Pipeline using Modern Data Stack (link)
25 Game-Changing Websites Every Developer Must Bookmark (link)
I am among the first people to gain access to OpenAI’s “Operator” Agent. Here are my thoughts. (link)
Building a Reliable Text-to-SQL Pipeline: A Step-by-Step Guide pt.1 (link)
DeepSeek-R1: A Cutting-Edge Logical Reasoning Model for Local AI Development with Ollama (link)
I spent 6 hours learning AWS Glue. Here is what I found (link)
LlamaCoder (link)
OCR with Phi-3-Vision: Revolutionizing Document Processing (link)
How We Built a Content Recommendation System With Pgai and Pgvectorscale (link)
SQL Applications using LlamaIndex (link)
Step-by-Step Guide to Creating Your Own Large Language Model (link)
LLM Finetuning Strategies (link)
A Visual Exploration of Semantic Text Chunking (link)
The Best NVIDIA GPUs for LLM Inference: A Comprehensive Guide (link)
Jupyter Agent: Revolutionizing Data Analysis with LLMs (link)
Combining FastAPI, PostgreSQL, and Leaflet — GIS Tutorial (link)
Unlock Custom Quantization for Hugging Face Models Locally with Ollama (link)
Python Mako (link)
Python Memory Management: Best Practices for Performance (link)
From query to plot: Exploring GeoParquet Overture Maps with Ibis, DuckDB, and Lonboard (link)
PostGIS meets DuckDB: Crunchy Bridge for Analytics goes Spatial (link)
FlockMTL is a DuckDB extension that integrates language model (LLM) (link)
DuckERD CLI (link)
Enhance your AI/ML applications with flexible Bring Your Own Model options (link)
Building a High-Performance Data Pipeline Using DuckDB (link)
Why PyMuPDF4LLM is the Best Tool for Extracting Data from PDFs (link)
This is How I Use Swagger to Design REST APIs Before Starting the Development (link)
The Ultimate Guide to Relational Databases for Backend Developers (link)
Dynamic SQL Query Formation with Jinja Template (link)
How I Made A Python Script 130 Times Faster with PostgreSQL Query Optimization (link)
Examples of specialized DB Usage (link)
Turn Your CSVs Into Graphs Using LLMs (link)
Optimizing Your Database for Analytics (link)
Implementing Hybrid Search with Postgres and pgvector (link)
How SELECT FOR UPDATE Works (link)
Storage Disaggregated Databases and Shared Transaction Log Architecture In Comparison (link)
Implementing Filtered Semantic Search Using Pgvector and JavaScript (link)
4 SQLAlchemy Features For Data Engineering (link)
Extracting Meaning from Tables in Financial Statements With LLMs and Chatbots (link)
How to Containerize Your Local LLM (link)
LLM Monitoring and Observability: Tools, Tips and Best Practices (link)
Quantization of LLMs with llama.cpp (link)
A Practical Guide to using Pydantic (link)
Running Ollama on Google Colab (Free Tier): A Step-by-Step Guide (link)
DuckDB & PyArrow: Lightweight and Speed Data Analysis (link)
The new “window.ai” API will blow your mind. (link)