Dear Community,
Sixsentix is seeking a Gen(AI) Test Engineer to support next-generation AI testing, validation, and governance for enterprise clients. This is an on-site, full-time role based in Zurich, Switzerland.
About the role
You will design and validate testing frameworks for LLM applications, multi-agent systems, and RAG pipeline, ensuring AI systems are safe, reliable, compliant, and high-quality.
Key Responsibilities
AI System Validation
- Build testing frameworks for LLM apps, agents, and RAG
- Evaluate AI outputs (accuracy, relevance, tone, bias, safety)
- Implement AI governance workflows, risk scoring, and compliance reporting
- Set up continuous monitoring for production AI systems
Testing & Automation
- Validate LLM APIs (Postman, REST Assured, pytest)
- Integrate AI test suites into CI/CD pipelines
- Manage prompt versioning & test cases via Git
- Use MLOps tools (MLflow, Weights & Biases)
Data & Evaluation
- Handle data formats: JSON, CSV, Parquet, JSONL
- Build evaluation datasets & ground-truth benchmarks
- Design automated + human-in-the-loop evaluation workflows
Consulting & Leadership
- Act as SME and help build a high-performing team
- Support client relationships and engagement
- Contribute to Sixsentix consulting community
Your Profile
- Hands-on GenAI development (prompt engineering, API workflows)
- Knowledge of RAG, embeddings, vector DBs, multi-agent systems
- Strong understanding of hallucinations, prompt sensitivity, variability
- Solid background in software testing in enterprise settings
- Skilled in API testing, CI/CD, Git
- Data handling and AI evaluation experience
- Self-starter, comfortable with ambiguity
- Excellent communication & presentation skills
If this role fits your experience, you can apply directly via LinkedIn Easy Apply or the Sixsentix careers page.