Data Scientist & Applied Scientist

Published in IBM, 2023

  • Implemented CPU multimodal retrieval (Grounding DINO, SBERT-Whisper) boosting speed 31% and recall to 83%.
  • Orchestrated ancient map geocoding (68.6% acc) fusing GenAI correction and DBSCAN under custom validation.
  • Bootstrapped a few-shot SBERT classifier achieving 73.4% accuracy via 10-shot LLM labeling; deployed via ONNX.
  • Saved $1,000+/mo on 5M+ samples via a hybrid LLM/SBERT anomaly detection pipeline for sentiment analysis.
  • Boosted summarization throughput 5x and cut waste 21.7% via parallelized LLM inference on A100 HPC.