Nazih Kalo's Resume

Nazih Kalo

Data Scientist / Engineer focused on building high-quality data products and ML systems.

Brooklyn, NY

About

From the wild world of crypto to the dynamic domain of data science, my passion for exploring the cutting edge has led me down an exciting path of discovery. I've worked across multiple layers of the data stack; as a data scientist, data engineer, product analyst and a bit of frontend development. I enjoy exploring the power of AI, macro & behavioral economics, and incentive models to uncover insights that can help the companies I work with thrive.

Work Experience

Phantom

2023 - Present

Senior Data Scientist

  • Built all data pipelines, including indexing & decoding on/off-chain data from multiple chains using Airflow/Spark/dbt
  • Developed nft & wallet recommendation engines, leveraging wallet trading/minting history to power follow/content suggestions
  • Maintained all internal/external dashboards (incl. dune, internal), retention/growth insights, & analytics for partners
  • Remote
  • Data Science
  • ML
  • Python
  • dbt

CyberConnect

2022 - 2023

Head of Data

  • Built all data pipelines, including indexing & decoding on/off-chain data from multiple chains using Airflow/Spark/dbt
  • Developed nft & wallet recommendation engines, leveraging wallet trading/minting history to power follow/content suggestions
  • Maintained all internal/external dashboards (incl. dune, internal), retention/growth insights, & analytics
  • Remote
  • Data Engineering
  • ML
  • Python
  • Spark

Scale AI

2020 - 2022

Product Manager → Data Engineer

  • Built & maintained data pipelines for the company's largest data extraction/scraping project, scraping 12M+ products from ~5000 ecommerce sites
  • Developed internal Payout Optimizer to dynamically adjust payout functions to hit target rates; reduced pay variance by ~50% and led to $90k savings/month
  • Deployed self-hosted data cataloging tool (Amundsen), improving data discovery across the company
  • Reduced LiDAR labeling time 34% through optimizing ML pre-labels and developing a new labeling pipeline
  • San Francisco
  • Data Engineering
  • Product
  • Python

Hive AI

2020 - 2020

Product Analyst

  • Product lead for company's new ML based text-moderation product
  • Collaborated with the ML team to develop a human-assisted model auditing system to identify model deficiencies
  • San Francisco
  • ML
  • Product

Apple

2018 - 2018

Operations Intern

  • Built data pipelines integrating internal & vendor data to reduce spend forecasts latency from 168 to 24hrs
  • Managed data for $50M budget for iPhone XR dev builds and identified $1M fraudulent invoices through analysis
  • Cupertino
  • Data Analysis
  • Operations

Education

University of Chicago

2019 - 2020
MSc Data Science

University of California, Berkeley

2014 - 2017
B.A Economics

Skills

  • Python
  • SQL
  • dbt
  • Spark
  • Airflow/Dagster
  • AWS
  • GCP
  • Machine Learning
  • NLP
  • Data Engineering
  • GraphQL
  • React/TypeScript