Appen’s cover photo
Appen

Appen

IT Services and IT Consulting

Kirkland, Washington 1,053,631 followers

Appen is your trusted data partner, powering cutting-edge AI applications for the world's most innovative companies.

About us

Appen has been a leader in AI training data for over 25 years, providing high-quality, diverse datasets that power the world's leading AI models. Our end-to-end platform, deep expertise, and scalable human-in-the-loop services enable AI innovators to build and optimize cutting-edge models. We specialize in creating bespoke, human-generated data to train, fine-tune, and evaluate AI models across multiple domains, including generative AI, large language models (LLMs), computer vision, speech recognition, and more. Our solutions support critical AI functions such as supervised fine-tuning, reinforcement learning with human feedback (RLHF), model evaluation, and bias mitigation. Our advanced AI-assisted data annotation platform, combined with a global crowd of more than 1M contributors in over 200 countries, ensures the delivery of accurate and diverse datasets. Our commitment to quality, scalability, and ethical AI practices makes Appen a trusted partner for enterprises aiming to develop and deploy effective AI solutions. At Appen, we foster a culture of innovation, collaboration, and excellence. We value curiosity, accountability, and a commitment to delivering the highest-quality AI solutions. We support work-life balance with flexible work arrangements and a dynamic, results-driven environment. Employees have access to competitive pay, comprehensive benefits, and opportunities for continuous learning and career growth. Our team works closely with the world’s top technology companies and enterprises, tackling exciting challenges and shaping the future of artificial intelligence.

Website
http://appen.com
Industry
IT Services and IT Consulting
Company size
501-1,000 employees
Headquarters
Kirkland, Washington
Type
Public Company
Founded
1996
Specialties
Search, Annotation, Evaluation, Personalization, Transcription, Spam Detection, Translation and Localization, Data Collection, training data, artificial intelligence , machine learning, data preparation, model evaluation, datasets, computer vision, natural language processing, LLM, and generative ai

Locations

Employees at Appen

Updates

  • View organization page for Appen

    1,053,631 followers

    Our team attended EMNLP 2025 in Suzhou, China, where one theme stood out across many sessions - multilingual NLP that reflects real human communication. This year’s focus on code-switching, dialectal variation, and regional language diversity marks a turning point for how we build and evaluate language models. The research community is making it clear: models need to handle how people actually speak, blending languages, borrowing words, and using regional expressions in everyday conversation. At Appen, this aligns directly with our mission to deliver inclusive, culturally grounded data for AI systems that truly understand their users. 📘 Read our EMNLP recap by Victor Vilarubia, Director of Crowd Success, to explore what’s next for multilingual AI and why closing the dialect gap matters: 👉 https://lnkd.in/e-p8k3SF

    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
  • View organization page for Appen

    1,053,631 followers

    Webinar: Beyond the Leaderboard: Bridging Research and Real-World AI Performance Benchmark scores don’t always tell the full story of how AI performs in practice. Join Daniel Dahlmeier, Chief Data Scientist at SAP, and Si Chen, VP of Strategy & Marketing at Appen, for a live discussion on evaluating AI systems beyond leaderboard metrics, focusing on real-world reliability, safety, and context. 📅 Nov 11 - 7:00 PM ET / 4:00 PM PT | Nov 12, 2025 - 8:00 AM SGT 💻 Live Webinar (Microsoft Teams) 👉 Register Now: https://lnkd.in/e7TZ3hJP

    • No alternative text description for this image
  • View organization page for Appen

    1,053,631 followers

    Appen is excited to join AI Summit Seoul & Expo as an exhibitor! 📍 Booth #A104 | COEX Grand Ballroom + Hall B Exhibition, Seoul, Korea 📅 November 10-11, 2025 Our team will be onsite to share how Appen helps global innovators develop safer and more reliable AI with diverse, high-quality speech and multimodal data. Attending AI Summit Seoul? Stop by our booth to connect and continue the conversation.

    • AI Summit Seoul & Expo - Appen
  • View organization page for Appen

    1,053,631 followers

    What Is Sociophonetics and Why It Matters for AI Sociophonetics examines how social meaning is encoded in speech through accent, intonation, rhythm, and pronunciation. For AI, that understanding is critical. Speech systems often underperform when voices diverge from the accents they were trained on, leading to: • Accent bias and elevated word-error rates • Misrecognition of regional or community speech • Exclusion of underrepresented speakers A sociophonetic lens helps AI teams design for diversity, model prosody and pronunciation as meaningful signals, and evaluate systems across accents, not just across languages. At Appen, we apply these principles in how we collect and annotate speech data, balancing by region, age, and community, and embedding phonetic, prosodic, and quality metadata into every dataset. The result: speech AI that performs more equitably and sounds more natural across dialects. 📘 Read the full blog: https://lnkd.in/gMKZzyKr

    • No alternative text description for this image
  • View organization page for Appen

    1,053,631 followers

    Last week, Appen hosted the Responsible AI Fireside Chat at our Hyderabad office 🇮🇳, which brought together leaders from Bosch, CBRE, PwC, and more. On stage, Rajesh Dhuddu (PhD), Partner & Leader, Emerging Tech at PwC, and Flt Lt Bipin Chandra Dutt Pendyala, GM India Operations at Appen, led a powerful discussion on building responsible AI. Topics included why many AI projects fail, how bias is embedded at the foundational level, and the importance of guardrails, PII protection, and human-in-the-loop. They also shared examples of cyber risks like prompt injection and highlighted frameworks shaping the industry, from India’s DPDP Act to NIST standards. Real-world applications were also explored across healthcare, finance, telecom, and compliance - from credit decisioning with built-in safeguards to multilingual model development for Indic languages. Together, these insights reinforced a clear takeaway: building trustworthy AI requires high-quality data, strong governance, and explainability, paired with practical steps that balance innovation with risk. Stay tuned for a detailed debrief.

    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
      +1
  • View organization page for Appen

    1,053,631 followers

    This week in Seattle, Appen partnered with AI Circle for a fireside chat on one of the most transformative topics in AI: Retrieval-Augmented Generation (RAG). Our speakers: Vladimir Karpukhin, TPM at Google and co-author of the original RAG paper Si Chen, VP of Strategy & Marketing at Appen Together, they unpacked: - Why the original idea of jointly training retrieval and generation was a turning point and how RAG only surged in adoption once LLMs like ChatGPT opened the door for enterprise use. - The hidden challenges of real-world deployment, from scaling vector databases to adapting retrieval for domain-specific data. - The critical role of high-quality, diverse datasets in reducing hallucinations and powering reliable AI systems. - What’s next: agentic retrieval, multi-modal augmentation, and infrastructure innovations that could reshape how enterprises apply AI. Thank you to AI Circle and to everyone who joined us in Seattle. 💡 Stay connected - our upcoming recap will dive deeper into the ideas shared on stage.

    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
  • View organization page for Appen

    1,053,631 followers

    Appen x AI Circle Fireside Chat - Seattle We’re excited to bring together leading voices in AI for an evening of insight, innovation, and connection. 🎤 Speakers Vladimir Karpukhin – Co-author of the seminal RAG paper; former researcher at Meta; now at Google Si Chen – VP, Strategy & Marketing, Appen What to Expect A deep dive into the origins, evolution, and future of RAG - plus candid perspectives on where AI is headed Networking with innovators: technologists, founders, and leaders shaping the AI ecosystem Happy hour atmosphere with hors d’oeuvres, beer, and wine This event is designed for AI builders, researchers, and business leaders who want to engage directly with the people shaping the next wave of innovation. Date & Time: Wednesday, September 17, 7-9 PM PT 👉 Reserve your spot now: https://luma.com/gsm4ckop

    • No alternative text description for this image
  • View organization page for Appen

    1,053,631 followers

    🔊 Reflections from Interspeech 2025 One of the clearest signals from last week’s discussions: the future of speech technology will depend on data that is both broader in coverage and deeper in expertise. Across conversations, a few themes stood out: ✅ Low-resource languages remain a bottleneck. As models scale and fine-tune, the need for high-quality data in underrepresented languages is more urgent than ever. True global inclusivity in speech AI requires solving this gap. ✅ Medical speech data is in high demand. From diagnostics to patient interaction, researchers are seeking domain-specific datasets - paired with annotators who bring subject-matter expertise. Accuracy in these use cases is not optional; it’s critical. ✅ Specialized domains of speech are under active research. Work around speech disorders and children’s speech shows the field is expanding to areas where precision, sensitivity, and ethical responsibility matter most. At Appen, we see these conversations as a reminder that advancing speech AI isn’t only about larger models - it’s about more representative data, guided by human expertise. We’re excited to keep working with the research community to address these challenges and ensure the next generation of speech technologies is accessible, ethical, and effective. Tammy Ann Haskins, Giorgia Marcucci, George Krasovitsky

    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
  • View organization page for Appen

    1,053,631 followers

    Appen is sponsoring Interspeech 2025 in Rotterdam. 🇳🇱 The conference theme, Fair and Inclusive Speech Science and Technology, reflects the importance of ensuring speech technologies represent diverse voices, languages, and communities. 📍 Booth: 3rd Floor 📅 August 18-21, 2025 You’ll also find Appen at the Speech Science Festival on August 17, connecting with attendees and discussing advancements in speech technology. Our team - George Krasovitsky, Giorgia Marcucci, and Tammy Ann Haskins - will be available to share how Appen supports global innovators with diverse, high-quality speech and multimodal data to build safer, more reliable AI. 👉 Attending Interspeech? Visit our booth to continue the conversation.

    • No alternative text description for this image
  • View organization page for Appen

    1,053,631 followers

    Association for Computational Linguistics 2025 brought together the global NLP community, and Appen was there as a sponsor to connect, learn, and share ideas. This year saw record participation, with an increasingly global mix of researchers, practitioners, and industry leaders coming together to shape the future of language AI. From our booth, we connected with customers, academics, and innovators to explore cutting-edge research developments and the role of high-quality, human-aligned training data across LLMs, translation and localisation, speech and dialogue systems, and enterprise AI applications. Key themes from ACL 2025 that are shaping the NLP landscape: • Generalization of language models – Development towards models that perform reliably and fairly on unseen data, across domains, and in diverse contexts—reflecting real-world complexity and human-like intelligence • Human feedback for AI alignment – Moving beyond a single “gold standard” to incorporate diverse and sometimes conflicting human values into model alignment frameworks • Multilingual performance and cultural nuance – Addressing persistent performance and alignment gaps for low-resource languages, and building systems that work for all communities, not just the most resourced. • Responsible AI at Scale – Balancing speed, efficiency, and ethical safeguards to ensure LLMs remain trustworthy as they become more powerful. Thank you to everyone who visited our booth to exchange ideas, share challenges, and imagine what’s next for responsible AI. We leave ACL 2025 inspired and energized to continue advancing inclusive, safe, and high-performing AI systems through exceptional human-in-the-loop data. Ryan Kolln, Si Chen, Sergio Bruccoleri, Brian Jenkins, Christian Neff, George Krasovitsky

    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
      +1

Similar pages

Browse jobs