Proxify’s mission is to connect outstanding developers worldwide with the opportunities they truly deserve. No matter where you’re located, we’re here to help you accelerate your independent career in the right direction.
Since launching, developers in the Proxify network have collaborated with more than 1,200 satisfied clients to build products and deliver growth-focused features. Over 5,000 skilled developers trust Proxify and its community to support their goals and ambitions.
Proxify is powered by a global network of talented, supportive developers seeking remote full-time roles. Our Glassdoor (4.5/5) and Trustpilot (4.8/5) ratings reflect the confidence developers place in us and our commitment to their long-term success.
The Role:
We are seeking a Senior AI Engineer to lead the development and scaling of our LLM-driven solutions. While your core strength is Python, this role is designed for an expert who understands the full generative AI lifecycle—from prompt design and RAG architectures to fine-tuning and production-ready deployment. You will be responsible for converting raw model capabilities into reliable, scalable backend services.
What we are looking for:
Expert-level Python skills with a strong grasp of asynchronous programming and backend system design.
Demonstrated experience building and deploying production-grade applications powered by Large Language Models.
Strong experience with both SQL and NoSQL databases, optimized for high-dimensional vector search workloads.
Experience deploying AI services in cloud environments such as AWS, GCP, or Azure, along with CI/CD management for AI pipelines.
A first-principles mindset for tackling the unique challenges of non-deterministic AI behavior.
Hands-on experience fine-tuning open-source models using approaches such as LoRA or QLoRA.
Familiarity with agent-based frameworks and autonomous AI systems.
Background in traditional NLP (such as SpaCy or NLTK) or classical machine learning techniques.
Responsibilities:
Design and implement advanced AI workflows using frameworks like LangChain, LlamaIndex, or Haystack.
Build and optimize Retrieval-Augmented Generation pipelines, including vector database management (Pinecone, Weaviate, or Milvus) and sophisticated indexing strategies.
Assess and select appropriate models (OpenAI, Anthropic, or open-source options like Llama 3) based on cost, latency, and performance trade-offs.
Develop high-performance Python backend services using FastAPI or Flask to power AI features for a global user base.
Create strong evaluation frameworks to measure LLM quality, reduce hallucinations, and monitor production behavior using tools such as LangSmith or Arize Phoenix.
Partner closely with product and engineering teams to identify high-impact AI use cases and navigate the technical trade-offs inherent in generative AI systems.
What we offer:
Get paid, not played
No more unreliable clients. Receive on-time monthly payments with flexible withdrawal options.
Predictable project hours
Maintain a healthy work-life balance with consistent 8-hour workdays with client teams.
Flex days to recharge
Enjoy up to 24 paid flex days per year for full-time roles secured through Proxify.
Career-accelerating opportunities
Access exclusive long-term remote roles with some of the world’s most exciting companies.
Hand-picked opportunities, just for you
Avoid common recruitment friction with personally matched positions.
One seamless process, multiple opportunities
Complete one onboarding process to unlock multiple roles, without repeated assessments.
Compensation
Receive the same reliable monthly pay for all positions secured through Proxify.
Talent has no borders. Proxify's mission is to connect top developers around the world with the opportunities they deserve. So, it doesn't matter where you are; we are here to help you fast-track your independent career in the right direction.
Employee Type:
Full-timeLocation:
Anywhere in the WorldJob Type:
All Other RemoteApplicants:
0Salary:
Date posted:
Feb 17, 2026