(AI, LLM, NLP) - Data Linguist - (Malay or European Portuguese Language is must)
Seattle, WA or Burlingame, CA
Fulltime (Permanent)
Job Description:
Experience: Experience in linguistic data processing, visualization, and machine learning environments. Familiarity with developing rules for Large Language Models.
Core Skills:
- Programming & Tools: Python, Regular Expressions, Data Processing, Data Visualization, Command Line, Scripting, Notebooks, Data Manipulation
- AI/ML & Data: LLM (Large Language Models), Machine Learning Models, Data Analysis
- Linguistics & Language: Malay Language, European Portuguese, Linguistic Analysis, Dialects, Varieties, Language Data
- Professional & Soft Skills: Communication, Teamwork, Flexibility, Adaptability, Learning Agility, Workflow Management
Key Responsibilities:
- Develop and release hard-coded rules for Large Language Models (LLM) using regular expressions.
- Adapt existing scripts for data manipulation in peer-developed notebooks and within the command line.
- Work with large amounts of language data within a client environment.
- Manage diverse and quickly changing workflows.
- Apply knowledge of linguistic differences across dialects/varieties.
- Understand the relationship between data and machine learning models.