AI & ML interests

We're creating a causal atlas of global well-being---combining satellite imagery, AI agent swarms, traditional machine learning, and causal methods to track and test human development at scale

Recent Activity

cjerzak 
posted an update 18 days ago
view post
Post
681
Ethos: In our team at UT Austin, we train students to become full-stack researchers—and increasingly, designers of the systems that do research. Our students learn to carry projects end-to-end: from idea generation and theory to data creation, analysis, and iterative refinement across diverse subfields. Using modern AI (including agentic workflows) and scalable computation, students build reproducible pipelines that can ingest and update planetary-scale data—like satellite imagery and other high-dimensional sources. But the goal isn’t tool use for its own sake: students learn to set the objectives, constraints, and evaluation standards that guide these systems through large spaces of hypotheses, while grounding results in causal inference and careful measurement. The outcome is scholarship that can rigorously test policy counterfactuals and translate evidence into durable, responsible improvements in societal well-being.

We welcome students at every stage to engage with projects—from motivated high-schoolers to undergraduates, graduate students, and those from highly non-traditional backgrounds.

Join us! https://connorjerzak.com/students/

  • 2 replies
·
cjerzak 
posted an update 3 months ago
view post
Post
2878
>>> We're writing a new book, <Planetary Causal Inference>, on how to model counterfactuals at planetary scale by combining satellite imagery + other global data with local studies and RCTs. Forthcoming in 2026+.
>>> Book info: https://planetarycausalinference.org/book-launch
>>> All datasets used in the book will be openly available on our lab’s Hugging Face hub:
theaidevlab

cjerzak 
posted an update 5 months ago
view post
Post
277
Excited to share LinkOrgs Online, a web app hosted here on HF for embedding and linking organization names using character-level Transformer encoder models trained on half a billion scraped LinkedIn records.

Upload CSVs or paste text, generate 1024D embeddings (v1–v4), PCA viz, and download results!

Try it: cjerzak/LinkOrgs_Online

Details/paper: https://connorjerzak.com/linkorgs-summary/

Training/inference code: https://github.com/cjerzak/LinkOrgs-software
cjerzak 
posted an update 6 months ago
cjerzak 
posted an update 7 months ago
cjerzak 
posted an update 7 months ago
view post
Post
2358
Dropped a dataset on here for linking org data: half a billion records scraped from LinkedIn networks. Positive/negative matches, bipartite graphs, Markov clusters – all the goods to train models that actually work on fuzzy company names.

NegMatches, PosMatches, holdouts for eval.

Check it out: cjerzak/LinkOrgs