Overview
At Unsiloed AI, we are building AI Agents for Unstructured Financial Data. Our proprietary multi-modal vision language models enable us to automate complex workflows, empowering businesses to extract, process, and analyze data efficiently. We are backed by a leading startup accelerator based out of Silicon Valley.
Founded by IIT Kharagpur graduates, our team brings extensive research experience from institutions like MIT and Mercedes-Benz Research, combined with work experience across wealth management and high-frequency trading firms.
Role:
We are looking for a passionate and driven Machine Learning Engineer to join our team. This is a research-heavy role, ideal for candidates who thrive on reading and implementing advanced research papers, devising innovative techniques, and working on cutting-edge AI challenges. You will play a pivotal role in building, training, fine-tuning, and deploying multi-modal models tailored for extracting and processing complex unstructured financial data, such as PDFs, tables, charts, and transcripts.
Key Responsibilities
- Work on cutting-edge OCR and VLM research with real-world applications.
- Set up model training, evaluation, and deployment pipelines on cloud infrastructure (AWS/GCP/Azure).
- Implement CI/CD workflows for ML models using Docker, Kubernetes, and GitHub Actions.
- Optimize inference on GPUs/TPUs and deploy models using TensorRT, ONNX, or Triton Inference Server.
Requirements
- 1 to 3 years of experience in deep research in Document AI, Computer Vision, or Vision-Language Models (VLMs)
- Strong knowledge of deep learning frameworks (PyTorch, TensorFlow, JAX).
- Experience in training large-scale models on distributed compute clusters (e.g., Ray, FSDP, DeepSpeed).
- Experience with MLOps tools (Kubernetes, Docker, Airflow, MLflow, Weights & Biases, SageMaker, etc.).
- Strong Python and software engineering skills, with experience in writing scalable, production-grade ML code.
- Top-tier research background (NeurIPS, CVPR, ICML, ACL, ICLR) with strong publications in Computer Vision, Document AI, or Vision Language Models.
What We Offer:
- Hybrid Work
- Mentorship directly from founders with backgrounds at IIT, MIT, HFTs, and wealth management firms.
- Be part of a fast-moving team solving hard AI problems in document intelligence.
- A potential path to relocate to SF is future based on performance