At Chatzy AI (chatzy.ai) we automate customer interactions across digital channels for businesses and build agentic AI systems that automate complex enterprise workflows. This article outlines a full-time internship opportunity focused on developing our intelligent document entity extraction stack—an essential part of our agentic AI—covering detailed responsibilities, required skills, and the multi-stage hiring process.
Role overview and core responsibilities
This role centers on building and improving the intelligent document entity extraction stack that streamlines enterprise workflows. The work is split into two complementary parts: primary responsibilities focused on extraction accuracy and real-world validation, and additional backend and deployment contributions that ensure the system performs reliably in production.
- Main responsibilities
- Develop and enhance the current system for extracting structured data from diverse documents, making the stack more accurate and robust.
- Integrate and optimize multiple third-party OCR APIs and LLM-based vision models to improve overall accuracy and performance.
- Implement and test extraction solutions in real-world scenarios such as KYC and document verification to validate effectiveness.
- Fine-tune vision or language models for specialized extraction tasks if required, adapting models to domain-specific document characteristics.
- Participate in client communication to gather requirements and organize demo sessions, explaining technical solutions clearly and simply.
- Additional contributions
- Contribute to the Chatzy AI backend by improving APIs, developing new features, and ensuring performance and scalability.
- Collaborate with AI and product teams to deploy and monitor solutions in production environments, ensuring the extraction stack meets enterprise needs.
Qualifications, hiring process, and growth path
Successful candidates will combine strong technical skills with clear communication and remote self-direction. This role begins as a full-time internship with strong potential for conversion to a full-time position based on performance and fit, including increased compensation and benefits.
- Key requirements
- Strong proficiency in Python, Node.js, and API development.
- Experience with AWS, OCR/Computer Vision APIs, and data cleaning.
- Solid understanding of data structures & algorithms (DSA) and SQL.
- Prior experience or projects involving OCR, NLP, or AI integrations is preferred.
- Excellent problem-solving, debugging, and communication skills, with the ability to translate client requirements into actionable technical tasks.
- Self-motivated and able to work independently in a remote environment (very important).
- Hiring process
- Application Screening – resume and portfolio/GitHub project review.
- Short Assignment Submission – a brief technical assignment to evaluate approach.
- Video Submission (via Loom.com) – a short explainer describing your assignment approach.
- Shortlisting Based on Video – evaluation for further rounds.
- 15-Minute Screening Call – assess communication and cultural fit.
- Two Interview Rounds:
- Technical Round: coding, system design, and problem-solving deep dive.
- Soft Skills & Scenario Round: teamwork, communication, and client interaction scenarios.
- Final HR Round – salary discussion and offer rollout.
In summary, this internship at Chatzy AI focuses on building an intelligent document entity extraction stack by integrating OCR and LLM-based vision models, validating solutions in KYC and document verification scenarios, and contributing to backend and production deployments. If you meet the technical and communication requirements and thrive working remotely, this role offers a clear path from internship to potential full-time conversion.