vincent tian

vincent tian

data scientist · san francisco bay area

about me

  • data scientist @ meta - business ai & api products
  • prev @ tiktok (social) and @ quantium (government & product analytics)
  • ml, deep learning & optimisation @ mit, maths + stats @ unimelb
  • grew up in perth, australia 🇦🇺

recent

aug 2025tiktok launched ai avatar stickers - part of the social avatar work i contributed to at tiktok.
jul 2025whatsapp launched the business calling api - a product i worked on at meta.
jul 2025meta announced new business messaging features at conversations 2025, including whatsapp calling and messenger api updates.
oct 2024won 1st place (starkware award) at cube summit 2024 for our project 'where's the alpha? scraping telegram with llms'.

work experience

mar 2025 – present
MetaMeta· data scientist, business AI

causal inference, experimentation, and ml modelling across whatsapp api calling, whatsapp api voice messaging, and messenger calling.

oct 2024 – mar 2025
TikTokTikTok· data scientist, social

machine learning and experimentation on the social team, driving the launch and growth of social avatar, and defining social connection scenarios to improve organic relationship formation between users.

feb 2021 – jun 2023
QuantiumQuantium· data scientist, product analytics

end-to-end analytics pipelines, forecasting models, and dashboards for B2B SaaS and government clients in australia.

projects

computer vision

get real: real vs fake image detection

built a fake product image detector using transfer learning (VGG-19, ResNet50, EfficientNet) on a custom 6,000-image dataset. shipped as a Chrome Extension.

nlp & llms

fine-tuning gpt to write like shakespeare

fine-tuned GPT-2 and GPT DaVinci on the Shakescleare dataset to reproduce Shakespearean prose. benchmarked against a Style Transformer using BLEU, ROUGE, and cosine similarity.

where's the alpha? scraping telegram with llms
where's the alpha? scraping telegram with llms

built an end-to-end analytics tool that scrapes Telegram data, applies NER and TF-IDF to extract features, clusters messages via embeddings, and generates market insights using GPT-4.

optimisation

optimising a global microchip supply chain

formulated a mixed-integer optimization model for a global microchip producer to minimize warehouse and transportation costs across 1,000 orders.

machine learning

swipe for travel planning

a fun side project with a friend - a travel planning app where you swipe through activities tinder-style to build a trip. i worked on data scraping, activity recommendations, and ranking. shipped to 100 real users.

predicting fetal health from heartbeat signals

built a multi-class classifier on cardiotocogram (CTG) data - fetal heart rate and uterine contraction signals - to classify fetal health states and support early clinical intervention.

targeting the right buyers for automotive market entry

predicted customer segments (A/B/C/D) for 2,627 new-market prospects using imputation techniques to handle missing data, enabling targeted sales outreach.

education

mitmit· masters - business analytics
2024
university of melbourneuniversity of melbourne· bsc - mathematics & statistics
2020
university college londonuniversity college london· exchange program - statistics
2019

vincent.tian72@gmail.com · san francisco bay area