Descrizione Lavoro
Language Engineer, Artificial General Intelligence - Data ServicesThe Amazon Artificial General Intelligence (AGI) Data Services organization is responsible for developing diverse datasets to train and evaluate the Amazon AI models. We are looking for Language Engineers to join our science and engineering team to support the development of complex, multimodal datasets, using a range of approaches including synthetic data generation, model-supported data generation, and human-in-the-loop data collections.You will play a critical role in driving innovation and advancing the state-of-the-art in evaluating and training AI models. You will work closely with cross-functional teams, including product managers, engineers, and data scientists to ensure that our AI systems are best in class.Key job responsibilitiesDesign complex data collections with human participants in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverablesDesign and conduct complex data creation tasks using synthetic and model-based data generation methods, following state-of-the-art approachesAnalyze and extract insights from large amounts of dataBuild tools or tool prototypes for data analysis or data creation, using Python or another scripting languageUse modeling tools to bootstrap or test new AI functionalitiesCollaborate with scientists, software engineers, and other data creators to evaluate performance of AI modelsAbout the teamAmazon strives to be the world’s most customer‑centric company, where customers can research and purchase anything they might want online or offline. We set big goals and are looking for people who can help us reach and exceed them. The AGI organization provides AI capabilities for a variety of Amazon products and searches. We provide secure, flexible, cost effective, and high‑quality data development services to our customers, that enables them to build advanced ML models.Basic QualificationsExperience owning and executing language data collection projects, including guidelines, labelset and annotation workflow developmentMaster’s or higher degree in a relevant field (Computational Linguistics or equivalent field with computational analysis)2+ years experience in computational linguistics or language data processing or AI data creationExperience with language data annotation systems and other forms of data markupProficient with scripting languages, such as PythonExperience working with speech, text, and multimodal data in multiple languagesExcellent communication, strong organizational skills and very detailed orientedComfortable working in a fast paced, highly collaborative, dynamic work environmentPreferred QualificationsPhD in Computational Linguistics (or equivalent field with computational emphasis)Expertise in bootstrapping AI data collections for quickly evolving requirements, including for complex agentic functionsPractical experience with Machine Learning and technical concepts such as APIsPractical knowledge of version control, agile development, database queries and data analysis processes (SQL, R, Matlab, etc.)Able to think creatively and possess strong analytical and problem solving skillsOur inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
#J-18808-Ljbffr