Applied Scientist

₹ 28,00,000 - 45,00,000
In-Office
14 Jul 2026

AI & ML

Full Time

14 Jul 2026

The selected candidate’s day-to-day responsibilities center on building and improving advanced AI and machine learning capabilities across human language and vision technologies. The work includes developing novel algorithms and modelling techniques, advancing the state-of-the-art in several core areas, and contributing to large-scale systems that serve millions of users. The scope spans speech, text, and structured data, with an emphasis on working with large-scale computing resources and diverse data sources. This role brings together multiple AI disciplines in a way that supports high-performing systems and practical impact.

Core Focus of the Role

The role is defined by a strong focus on human language and vision technologies. Rather than concentrating on a single task, the work covers a broad set of AI and ML responsibilities that connect language understanding, speech processing, and computer vision. The selected candidate is expected to develop novel algorithms and modelling techniques, which means the work is centered on creating new approaches rather than simply applying existing ones. This makes the role especially relevant for work that requires both technical depth and flexibility across different AI areas.

Another important part of the role is the expectation to advance the state-of-the-art in multiple domains. These include ASR, MT, NLU, Text-to-Speech (TTS), Dialogue Management, and Computer Vision. Each of these areas contributes to the broader goal of improving how systems understand, generate, and interact with language and visual information. The responsibilities suggest a setting where progress in one area can support progress in others, especially when building integrated AI/ML systems.

The role also emphasizes scale. The selected candidate will work with large-scale computing resources and diverse data sources, including speech, text, and structured data. This indicates that the work is not limited to small experiments or isolated models, but instead involves systems and data environments that require careful handling and strong technical execution. The overall focus is on contributing to large-scale, high-performing AI/ML systems that have meaningful reach.

Contribute to building large-scale, high-performing AI/ML systems that impact millions of users.

Key areas included in the role

Human language technologies
Vision technologies
Novel algorithms
Modelling techniques
Large-scale computing resources
Diverse data sources

Developing Novel Algorithms and Modelling Techniques

A central responsibility in this role is the development of novel algorithms and modelling techniques. This points to work that requires originality, technical reasoning, and the ability to improve how AI and ML systems are designed. The phrase “novel” highlights that the candidate is expected to contribute new ideas or methods within human language and vision technologies. The work is therefore not only about implementation, but also about advancing the underlying approach used to solve problems.

Because the role spans both language and vision technologies, the algorithms and modelling techniques may need to support different kinds of data and tasks. Speech, text, and structured data are all part of the environment described, which means the candidate must work across varied inputs. This kind of responsibility often requires careful modelling choices so that systems can perform well across different sources of information. The role therefore combines innovation with practical system-building.

The emphasis on modelling techniques also suggests a need to think about how AI systems represent and process information. In the context of ASR, MT, NLU, TTS, Dialogue Management, and Computer Vision, modelling is a key part of making systems more accurate, more capable, and more useful. The selected candidate’s work contributes to this improvement by helping shape the methods used in large-scale AI/ML systems. That makes the role important not only for individual components, but also for the overall quality of the systems being built.

What this responsibility involves

Creating new algorithmic approaches
Improving modelling methods for language and vision tasks
Working across different data types
Supporting stronger AI/ML system performance

Advancing Speech, Language, and Vision Technologies

The role includes advancing the state-of-the-art in several major AI areas. These areas are ASR, MT, NLU, Text-to-Speech (TTS), Dialogue Management, and Computer Vision. Together, they show that the selected candidate will work across both understanding and generation tasks, as well as visual intelligence. The responsibilities are broad, but they are connected by a common goal: improving how AI systems interact with language and visual information.

ASR and TTS reflect the speech side of the work, while MT and NLU reflect language translation and understanding. Dialogue Management adds the interaction layer, where systems must manage conversational flow. Computer Vision extends the scope into visual technologies, showing that the role is not limited to text or speech alone. This combination makes the position especially relevant to integrated AI systems that need to handle multiple forms of input and output.

Advancing the state-of-the-art means contributing to improvements that push beyond current methods. In practical terms, this can involve refining models, improving system behavior, or strengthening how different technologies work together. The role does not isolate one domain from another; instead, it places them within a shared environment of large-scale AI/ML development. That makes the work both specialized and interconnected.

Technologies named in the responsibilities

ASR
MT
NLU
Text-to-Speech (TTS)
Dialogue Management
Computer Vision

Working with Large-Scale Computing Resources and Diverse Data

The selected candidate will work with large-scale computing resources, which indicates that the role involves substantial technical infrastructure. This is important because the responsibilities are tied to building large-scale, high-performing AI/ML systems. Large-scale computing resources support the development, testing, and improvement of systems that must operate effectively across demanding workloads and complex tasks. The role therefore requires comfort with environments where scale is a defining feature.

Equally important is the use of diverse data sources. The content specifically identifies speech, text, and structured data, showing that the work involves multiple forms of information. Each data source contributes something different to the AI/ML process, and the role requires the candidate to work with all of them. This variety reinforces the broad nature of the responsibilities and the need to handle different kinds of inputs within one technical setting.

The combination of large-scale computing and diverse data sources suggests a role that sits at the intersection of infrastructure and model development. The candidate is not only expected to create algorithms and modelling techniques, but also to do so in an environment where data variety and computational scale matter. That makes the work especially relevant to systems that need to be both robust and high-performing. It also supports the broader goal of impact at scale.

Data sources mentioned in the role

Speech
Text
Structured data

Why scale matters here

Supports large-scale AI/ML systems
Helps manage diverse data sources
Aligns with high-performing system goals

Contributing to High-Performing AI/ML Systems

One of the most important outcomes of the role is contributing to large-scale, high-performing AI/ML systems. This phrase captures both the technical ambition and the practical purpose of the work. The selected candidate’s responsibilities are not limited to research or isolated development; they are tied to systems that are intended to perform well at scale. That makes reliability, effectiveness, and system quality central themes in the role.

The content also states that these systems impact millions of users. This shows that the work has broad reach and that the systems being built are intended for meaningful use. The role therefore connects technical development with user impact, even though the content does not provide additional details about the users or the systems themselves. The key point is that the work contributes to technology with wide-scale influence.

Because the role spans human language and vision technologies, the systems being built likely need to bring together multiple capabilities. The candidate’s work in algorithms, modelling, and state-of-the-art advancement supports this broader objective. The responsibilities point to an environment where technical improvements can translate into stronger system performance and wider impact. This makes the role both technically demanding and strategically important.

System-level priorities reflected in the role

Large-scale AI/ML systems
High-performing system behavior
Impact on millions of users
Integration of language and vision technologies

Frequently Asked Questions

What is the main focus of the selected candidate’s day-to-day responsibilities?

The main focus is developing novel algorithms and modelling techniques in human language and vision technologies. The role also includes advancing the state-of-the-art in ASR, MT, NLU, Text-to-Speech (TTS), Dialogue Management, and Computer Vision. These responsibilities are centered on building and improving AI/ML systems.

Which AI areas are included in the responsibilities?

The responsibilities include ASR, MT, NLU, Text-to-Speech (TTS), Dialogue Management, and Computer Vision. These areas cover speech, language understanding, language generation, interaction, and visual technologies. Together, they show the broad technical scope of the role.

What kinds of data sources are mentioned?

The content specifically mentions speech, text, and structured data. The selected candidate will work with these diverse data sources while contributing to large-scale AI/ML systems. This indicates that the role involves handling multiple forms of information.

What kind of computing environment is part of the role?

The role involves working with large-scale computing resources. This supports the development of large-scale, high-performing AI/ML systems. The computing environment is therefore an important part of the work, alongside algorithm and model development.

What is the expected impact of the systems being built?

The content states that the selected candidate will contribute to building large-scale, high-performing AI/ML systems that impact millions of users. This shows that the work is intended to have broad reach. The role connects technical development with meaningful user impact.

Does the role focus only on language technologies?

No. The role covers both human language and vision technologies. It includes language-related areas such as ASR, MT, NLU, TTS, and Dialogue Management, as well as Computer Vision. This makes the scope broader than language alone.

Conclusion

The selected candidate’s day-to-day responsibilities bring together innovation, scale, and broad technical scope. The work includes developing novel algorithms and modelling techniques, advancing the state-of-the-art across speech, language, and vision technologies, and working with large-scale computing resources and diverse data sources. It also contributes to large-scale, high-performing AI/ML systems that impact millions of users. Taken together, these responsibilities describe a role centered on meaningful technical progress across multiple AI domains. The emphasis is on building systems that are both advanced and impactful.

Share this post –