Overview
Company: Rosh Technologies LLC.
About Us:
Rosh Technologies is a leading technology solution provider based in New Jersey. We provide software solutions to Financial industries, Real Estate, Insurance and Pharmaceuticals.
Job Title: Generative AI Developer / Engineer
Location: Remote /Hybrid
Type: Contract
Experience Level: 2+ years in AI/ML, 1+ year in Generative AI applications
Job Description:
We are seeking an experienced Generative AI Developer with proven expertise in designing, developing, and deploying AI applications across cloud platforms such as AWS, Google Cloud Platform (GCP), and Microsoft Azure. The ideal candidate should have hands-on experience with large language models (LLMs), agent-based AI implementations, and modern deployment frameworks such as Hugging Face, Amazon SageMaker, or similar platforms.
Responsibilities:
- Design and implement end-to-end Generative AI solutions using LLMs and foundational models.
- Develop and deploy AI agents capable of reasoning, tool usage, and memory on cloud platforms.
- Integrate and fine-tune models using Hugging Face Transformers, Amazon SageMaker, Azure ML, or GCP Vertex AI.
- Build APIs and microservices to serve generative models in production environments.
- Optimize model performance, latency, and cost across different deployment architectures.
- Collaborate with product teams to turn ideas into working AI-powered applications.
- Stay up to date with the latest in LLM research, vector databases, RAG pipelines, and prompt engineering.
Required Skills:
- Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
- Proficiency in Python and experience with machine learning frameworks (e.g., PyTorch, TensorFlow).
- Strong understanding of Generative AI techniques (text generation, image generation, summarization, etc.).
- Experience deploying models using AWS (SageMaker, Bedrock), GCP (Vertex AI, PaLM), or Azure (OpenAI, ML Studio).
- Familiarity with Hugging Face Transformers, LangChain, LLamaIndex, or similar ecosystems.
- Experience building and deploying AI agents or autonomous workflows.
- Exposure to vector databases such as FAISS, Pinecone, Weaviate, or ChromaDB.
- Strong DevOps mindset with experience in containerization (Docker), APIs, and CI/CD for AI workloads.
Preferred Qualifications:
- Hands-on experience with RAG (Retrieval-Augmented Generation) systems.
- Familiarity with multi-modal models (text, vision, audio).
- Experience in MLOps and model monitoring in production.
- Contributions to open-source GenAI projects or hackathons.
Job Types: Full-time, Part-time, Permanent, Freelance
Contract length: 4 months
Pay: ₹48,000.00 - ₹72,000.00 per year
Benefits:
- Flexible schedule
- Paid time off
- Work from home
Location Type:
- Remote
Schedule:
- US shift
Education:
- Bachelor's (Preferred)
Work Location: Remote