
Review the role details and submit your application.





Job Description
Cookpad develops the world’s most user-friendly products for people who cook.
The use of AI and robotics broadly falls into two approaches: replacing human actions, or extending human capability. Our focus is on using these technologies to bring out people’s strengths.
Cooking affects our health, cognition, society, culture, and the environment. Our challenge is to make it possible—through our products—for anyone, anywhere, to cook at an unprecedentedly high standard.
The essence of this position
moment
helps people learn to cook in an innovative way with a personal coaching service. This service is completely based in AI using multimodal (text, vision, audio). The challenges at moment are not about incremental or partial improvements. What is required is the ability to identify the gap between the ideal learning experience and what AI can realistically deliver today, and then to narrow that gap by isolating and fully solving the single point where the greatest impact can be made right now.
Examples of Key Challenges
Video Analysis Domain: An elite coach can instantly discern from video the heat level, early signs of failure, and the appropriate corrective actions in cooking. The goal is to reproduce and extend this judgment using AI.
This is not a job focused on improving accuracy metrics, but on implementing the judgment of top-tier experts themselves.
Solving fundamental challenges in understanding cooking videos
Designing multimodal (video, audio, text) decision-making models
Designing coaching logic that takes conversational memory and learning state into account
Designing and implementing task / research agents
Submit your application for this role at Cookpad.