
Research Scientist Intern, Speech & Audio Technologies (PhD) London, England
Job description
Research Scientist Intern, Speech & Audio Technologies (PhD) Responsibilities:
- Perform research to advance the science and technology of intelligent machines.
- Develop novel and accurate speech algorithms and systems, leveraging deep learning and machine learning on big data resources.
- Contribute research that can be applied to Meta product development.
- Analyze and improve efficiency, scalability, and stability of various deployed systems.
- Collaborate with team members from prototyping to production.
Minimum Qualifications:
- Currently has, or is in the process of obtaining a PhD degree.
- Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment.
- Experience in C/C++ and Python.
- Experience in deep learning frameworks (PyTorch, Tensorflow, …).
- Research and/or work experience in machine learning, deep learning, and/or speech technology.
Preferred Qualifications:
- Intent to return to the degree-program after the completion of the internship/co-op.
- Experience manipulating and analyzing complex, high-volume, high-dimensionality data from varying sources.
- Proven track record of achieving results as demonstrated by grants, fellowships, patents, as well as first-authored publications at workshops or conferences such as Interspeech, ICASSP or similar.
- A strong interest in theoretical and empirical research and for answering hard questions with research.
- Interpersonal experience: cross-group and cross-culture collaboration.
- Ability to stay in touch with the literature of a particular domain and has the ability to reproduce results if needed.
- Experienced with training deep neural networks for key Speech tasks such as speech recognition, speech synthesis, speech translation, speaker diarization, sentiment analysis, acoustic event recognition, scene understanding, wake word, etc.
- Experience working with other modalities such as vision and text understanding is a plus.
