Pipelines, plates of dal-bhaat, and a stubborn running habit.

I'm Rajeev — a data engineering team lead in Kathmandu. I spend most of my weekdays building data pipelines, and most of my weekends running, watching F1, or getting dissapointed by my fav football team.

About

I lead a small data team at 108 Capital. We ship pipelines that index multi-TB blockchain history, scrape the strange corners of social media, and turn audio into searchable text.

A short feed of what's currently consuming my brain — newest at the top.

// now.log

Four things currently running in the background.

last edit · 2026-05-28
2026 · May BUILDING

A small F1 data warehouse + a laptime comparator on top.

Pulling from jolpica and openf1 into one store, then a downstream service that compares two drivers in any session — lap by lap, sector by sector. Currently at v0.3, ingestion's stable, the comparator's getting there.

jolpicaopenf1warehouselaptime-compare
2026 · ongoing TRAINING

Kathmandu Marathon, November — my first.

Five-hour goal. The point this time is to finish, not to chase a number. Most runs are after dark, ~18:30 start, when the city cools down.

42.2 kmgoal 5:00:00night runs
every sunday WATCHING

Formula 1 — 2026 season.

New regs, weirder than expected. Lights-out is sacred; everything else routes around it.

2026 · May READING

Paper Towns, John Green.

A novel between systems books — turns out a plot is restful.

Work

Three roles, one through-line — make the pipes hold.

Apr 2025 — Now

Data Engineering Team Lead

108 Capital · Remote

Lead a team of engineers shipping production data infrastructure. Owned architecture, deployment, and monitoring across five products in our first year together — from on-chain indexers to NLP-on-warehouse sentiment pipelines.

Architecture Mentorship Reviews
Oct 2023 — Mar 2025

Data Engineer

108 Capital · Remote

Built the on-chain indexer from scratch as sole owner — a ClickHouse warehouse with dbt + Prefect ETL keeping EVM data under one-minute latency at multi-TB scale. Then built the Twitter pipeline and started the Spark migration.

ClickHouse dbt Prefect Airflow
Jan 2023 — Oct 2023

Software Engineer — Frontend & Real-Time

Arman Group · Remote

Built a multiplayer mobile Ludo game in Flutter and owned the real-time communication layer over WebRTC. Different stack, same taste for low-latency systems.

Flutter WebRTC Dart
Things I built

Four at work, two for fun, all production.

Four production systems shipped at 108 Capital, plus two personal open-source projects. Each one has a stack story — happy to tell it over coffee.

Blockchain

EVM On-Chain Indexer

Multi-chain blockchain data platform covering Ethereum Mainnet, Base, BSC, Arbitrum, and Optimism. Sub-one-minute end-to-end latency using Kafka stream processing and ClickHouse UDFs for real-time ABI decoding.

ClickHouseKafkadbt PrefectPythonUDFs
20TB+
on-chain data across five EVM chains, held under one-minute end-to-end latency.
Web Scrping

Socials Intelligence

Social pipeline covering account management and ETL orchestration with Airflow + ClickHouse. GPU-enabled NLP UDFs run on a distributed cluster for in-warehouse sentiment analysis and entity extraction.

AirflowClickHouseGPU UDFs NLPSentiment
45min
worst-case ingestion delay. Sentiment and entity extraction happen inside the warehouse.
Media

AV Engine

Media transcription pipeline that tracks new YouTube channels and podcast feeds, downloads content, and transcribes with AI. Orchestrated with Plomberry; Celery handles the async background work so the queue never blocks the front end.

PlomberryCeleryWhisper Python
audio,
searchable.
podcasts and YouTube channels turn into queryable text, async, every day.
Reverse Engineering

NEPSE API

Reverse-engineered the Nepal Stock Exchange's undocumented internal API to extract live market data — prices, order books, and trade history — that NEPSE never officially exposed. Built a clean Python client on top so anyone can pull structured stock data without screen-scraping.

PythonReverse EngineeringREST
github ↗
zero docs,
clean API.
NEPSE had no public API. Reverse-engineered the private one and wrapped it.
Open Source

Monocoque

F1 data warehouse that ingests from Jolpica and OpenF1, normalises laps, telemetry, results, and standings, then loads everything into ClickHouse. One SQL query away from any race stat — built as a foundation so others can ship dashboards, models, or analyses without worrying about the plumbing.

ClickHousePythonJolpica OpenF1dbt
github ↗
F1,
queryable.
lap times, telemetry, and standings from two APIs — one ClickHouse warehouse, ready to build on.
Stack

Tools I reach for, in roughly the order I reach for them.

Warehouses
ClickHouse · Trino · Snowflake
Orchestration
Prefect · Airflow · Celery · Plomberry
Transformation
dbt · SQL · Python
Streaming
Apache Kafka · ClickHouse Mat. Views
Databases
SurrealDB · PostgreSQL
Languages
Python · Go · Dart · SQL
Infra
Docker · Linux · GPU UDFs · Distributed clusters
Real-time / other
WebRTC · Browserless · Headless scraping
Off-hours

Most of who I am happens off the keyboard.

A non-exhaustive list of obsessions that crowd out engineering on the weekend. None of them are productive. All of them are the point.

Sport · Discipline

Running, badly but daily

Started after my wrist gave up on too much typing. Now somewhere between "stubborn jogger" and "slow marathoner". Targeting sub-4 in October.

Sunday ritual

Formula 1, every weekend

Race day starts before lights out. I keep an embarrassingly detailed spreadsheet of pit-stop deltas — old habits.

The other Sunday ritual

MotoGP

The bikes lean past 60° and pretend gravity is optional.

Tribal allegiance

Football, mostly tactics

I watch midfielders like other people watch the ball. La Liga and the Premier League weekends are sacred.

Slow burn

Reading non-fiction

Systems books, distributed-systems papers, the occasional novel when my brain needs a hard reset.

Background hum

Long walks, longer playlists

Kathmandu's hills are the best debugging environment I've found. One hour out, one bug fewer.

Pulse

Signals from the my legs, and someone else's car.

Two feeds that update on their own. The runs come from my Strava; the standings come from the current F1 season. No editorialising.

Strava · last 4 weeks

The legs

Connecting…
km, last 4 weeks
activities
avg pace / km
m of climb
    Formula 1 · 2026 season

    The grid

    Loading…
    After rounds · Next:
      Constructors, top three
        Notes

        A small notebook, mostly for me.

        Engineering postmortems, stray opinions, the occasional running journal. New entries at the top.

        All notes
        Contact

        If a pipeline of yours is leaking, I might be useful. Otherwise, just say hi.

        Reach me

        I'm on Instagram and email. Most replies within a day.

        Send a message
        Kathmandu, Nepal
        27.7172° N · 85.3240° E