
Benjamin Shvartsman
Resourceful CS/AI student at Cornell with hands-on experience in software development, machine learning, and data engineering.
Passionate about building AI applications and solving real-world problems through close collaboration with users and stakeholders.
Thrive in fast-paced, ambiguous environments; driven by curiosity, a love for tough challenges and a desire to learn new technologies to sharpen my technical and analytical skills.
Experience

Machine Learning Engineer
Ecentria · Internship
Sep 2023 - Sep 2024 · 1 yr
- Clustering of users and products on retail website OpticsPlanet.com, utilizing the user and product embeddings from their matrix factorization-based Recommender System
- Recommendation engine to drive cross-sell and up-sell strategies across both consumer and government sectors
- Real-time debugging platform, integrating API clients to diagnose and resolve product similarity issues in production
- Conducted data preprocessing, cleaning, and analysis to support model training and validation
Writings
thoughts on technology, engineering, and problem-solving
Data Labeling and Compute: Rethinking RL for the next AI Jump
This report looks at the interplay between compute scaling, data labeling, and RL effectiveness, sparked by xAI's Grok 4 release showing limited gains despite 10x compute boost. It contrasts pretraining's exponential reasoning gains with RL's challenges. I aim to explore how RL with human feedback could optimize data use.
Case Studies & Projects
real-world problem → technical solution → business value presentation
Precision Health with Palantir Foundry and ElevenLabs
A precision health platform combining an operational clinician dashboard, a real-time clinical decision support system, and a patient-facing application with a voice AI agent for seamless, real-world data collection, all deeply integrated into foundry's ontology and aip platform
AI-Powered Government Supplier Procurement
AI-Powered Government Supplier Procurement For Solving Mission Variability & Equipment Complexity
Hospital Post-Op Coordination Dashboard
A real-time hospital dashboard system that optimizes post-operative patient care through AI-driven queue prioritization, automated equipment tracking, and seamless nurse-doctor coordination. Built with Palantir Foundry ontology integration and OpenAI-powered smart workflows.
Medical Conversation Optimizer Engine
Guard‑railed chat contextualised to their health data with perplexity-like prompt suggestions for guided continuation.

Medical Agentic RAG
A medical chatbot powered by CrewAI that combines real-time RAG with interactive document analysis. The system uses specialized agents to retrieve information from trusted medical sources, Reddit experiences, and uploaded medical documents, while providing NotebookLM-style document summaries and topic-specific discussions.

Parking Payment Automation
Developed a real-time automated parking payment system to reduce costly fines for university sports facilities with estimated throughput of $3,500/week in transaction processing via SMS-triggered workflows, integrating automated scripting and multiprocessing for immediate payment execution.
Automation of Bulk USPTO Trademark Risk Assessment and Domain Availability
A comprehensive trademark conflict detection and domain availability checking system that combines USPTO trademark data with AI-powered risk assessment and domain availability verification.
Whoop Cornell Wrestling Team Aggregated Insights
Created data pipeline for visualizing and analyzing the captains' Whoop longitudinal wearable data to uncover insights into Cornell Wreslting team's recovery, performance, and sentiment.
Certifications & Coursework
continuous learning and skill development

Cornell University Coursework
Natural Language Processing, Computer Systems and Organization, OOP & Data Structures, Linear Algebra, Mathematical Foundations of Computing, Probability Models and Inference, Introduction to Machine Learning

Reliable Google Cloud Infrastructure: Design and Process by Google Cloud Skills Boost
Course on designing and implementing reliable, scalable cloud infrastructure on Google Cloud Platform using best practices for system architecture, fault tolerance, and operational processes.

Google Cloud Computing Fundamentals: Cloud Computing Fundamentals by Google Cloud Skills Boost
Course covering essential cloud computing concepts, Google Cloud Platform services, and foundational skills for deploying and managing applications in the cloud.

DSPy: Build and Optimize Agentic Apps by Deeplearning.AI
Comprehensive course on building, debugging, and optimizing modular GenAI agentic applications using DSPy's module-based programming model with MLflow integration and automated prompt tuning.

Safe and Reliable AI via Guardrails by Deeplearning.AI
Course on implementing AI guardrails to mitigate LLM failure modes like hallucinations and data leaks by adding input/output validation controls to applications like RAG-powered chatbots.

Practical Multi AI Agents and Advanced Use Cases with crewAI by Deeplearning.AI
Course on building collaborative multi-agent systems with external tool integration, performance evaluation, and human feedback optimization for automating complex business tasks like project planning and content creation.

Multi AI Agent Systems with crewAI by Deeplearning.AI
Course on designing collaborative AI agent teams using crewAI to automate complex multi-step business processes by assigning specialized roles and breaking down tasks across multiple customized agents.

Building Agentic RAG with LlamaIndex by Deeplearning.AI
Course on building and designing various AI agents including document reasoning agents, router agents for Q&A and summarization, and research agents with multi-document handling capabilities and debugging controls.