Wei Liu

Wei Liu · 2025-02-10T14:24:23.485Z

LLM and Human alignment with simple linear mapping

London, England, United Kingdom
848 followers 500+ connections

View mutual connections with Wei

Welcome back

Email or phone

Password

Forgot password?

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

Join to view profile

Amazon

The University of Sheffield

Personal Website

About

Research and apply NLP/Machine Learning techniques to solve interesting problems. Enjoy…

Articles by Wei

Don't ignore the bias in your ML system

Jan 12, 2020

Don't ignore the bias in your ML system

Just days before Christmas, I came across a Kaggle-like competition in China - CCF BDCI2019 金融信息负面及主体判定…
On China's cashless payment

Nov 18, 2018

On China's cashless payment

I travel back to China every year to visit my family. Every time I was overwhelmed by how fast things has changed, from…

1 Comment

Activity

As Microsoft marks its incredible 50th Anniversary, I find myself reflecting on my own journey with this amazing company. Having spent a decade at…

As Microsoft marks its incredible 50th Anniversary, I find myself reflecting on my own journey with this amazing company. Having spent a decade at…

Liked by Wei Liu
Happy and excited for all the teams responsible for the launch of https://nova.amazon.com! Congrats! Read more at https://lnkd.in/erDXU9_j

Happy and excited for all the teams responsible for the launch of https://nova.amazon.com! Congrats! Read more at https://lnkd.in/erDXU9_j

Liked by Wei Liu
LLM and Human alignment with simple linear mapping

LLM and Human alignment with simple linear mapping

Shared by Wei Liu

Join now to see all activity

Experience

Amazon

London, England, United Kingdom
-

London, United Kingdom
-

London, United Kingdom
-

London, United Kingdom
-

London, United Kingdom
-

London, United Kingdom
-
-

Shenzhen, Guangdong, China

Education

The University of Sheffield

2002 - 2003

Activities and Societies: CSSA (Chinese Students and Scholars Association)

deploy,maintain,upgrade the community website and discussion board.
1998 - 2002

计算机科学与应用

Licenses & Certifications

Deep Learning Specialization

Coursera

Issued Feb 2018

Credential ID 8X44HB8DTY6R

See credential
Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization

Coursera Course Certificates

Issued Dec 2017

Credential ID CME8JD54E9VT

See credential
Structuring Machine Learning Projects

Coursera Course Certificates

Issued Dec 2017

Credential ID KRL8SBAEJVGQ

See credential
Neural Networks and Deep Learning

Coursera Course Certificates

Issued Nov 2017

Credential ID MV7CDE7W3BG6

See credential
Neural Networks for Machine Learning

Coursera Course Certificates

Issued Jan 2017

Credential ID WXAURKMTWELU

See credential
Statistical Inference

Coursera Course Certificates

Issued Mar 2016

Credential ID NXCAJBK8YPD2

See credential
Duolingo Proficiency Exam in English: Expert

Duolingo

Issued Nov 2014

Credential ID 10.0/10

See credential

Publications

Efficient Minimal Perfect Hash Language Models

LREC 2010 May 20, 2010
The availability of large collections of text have made it possible to build language models that incorporate counts of billions of n-grams. This paper proposes two new methods of efficiently storing large language models that allow O(1) random access and use significantly less space than all known approaches. We introduce two novel data structures that take advantage of the distribution of n-grams in corpora and make use of various numbers of minimal perfect hashes to compactly store language…

The availability of large collections of text have made it possible to build language models that incorporate counts of billions of n-grams. This paper proposes two new methods of efficiently storing large language models that allow O(1) random access and use significantly less space than all known approaches. We introduce two novel data structures that take advantage of the distribution of n-grams in corpora and make use of various numbers of minimal perfect hashes to compactly store language models containing full frequency counts of billions of n-grams using 2.5 Bytes per n-gram and language models of quantized probabilities using 2.26 Bytes per n-gram. These methods allow language processing applications to take advantage of much larger language models than previously was possible using the same hardware and we additionally describe how they can be used in a distributed environment to store even larger models. We show that our approaches are simple to implement and can easily be combined with pruning and quantization to achieve additional reductions in the size of the language model.

Other authors
See publication
Professor or screaming beast? - Detecting Word Misuse in Chinese

LREC 2008 May 19, 2008

The Internet has become a very popular platform for communication around the world. However because most modern computer
keyboards are Latin-based, Asian language speakers (such as Chinese) cannot input characters (Hanzi) directly with these keyboards. As
a result, methods for representing Chinese characters using Latin alphabets were introduced. The most popular method among these is
the Pinyin input system. Pinyin is also called ”Romanised” Chinese in that it phonetically resembles a…

The Internet has become a very popular platform for communication around the world. However because most modern computer
keyboards are Latin-based, Asian language speakers (such as Chinese) cannot input characters (Hanzi) directly with these keyboards. As
a result, methods for representing Chinese characters using Latin alphabets were introduced. The most popular method among these is
the Pinyin input system. Pinyin is also called ”Romanised” Chinese in that it phonetically resembles a Chinese character. Due to the
highly ambiguous mapping from Pinyin to Chinese characters, word misuses can occur using standard computer keyboard, and more
commonly so in internet chat-rooms or instant messengers where the language used is less formal. In this paper we aim to develop a
system that can automatically identify such anomalies, whether they are simple typos intentional substitutions. After identifying them,
the system should suggest the correct word to be used.

See publication
Chinese Text Classification without Automatic Word Segmentation

ALPIT September 15, 2007
Due to the lack of word boundaries in Asian systems of writing, machine processing of these languages often involves segmenting text into word units. This paper tests the assumption that this segmentation is a necessary step for authorship attribution and topic classification tasks in Chinese, and demonstrates that it is not. We show extensive results for both tasks, considering both single words and short phrases as features, and examining the effect of document length on classification…

Due to the lack of word boundaries in Asian systems of writing, machine processing of these languages often involves segmenting text into word units. This paper tests the assumption that this segmentation is a necessary step for authorship attribution and topic classification tasks in Chinese, and demonstrates that it is not. We show extensive results for both tasks, considering both single words and short phrases as features, and examining the effect of document length on classification accuracy. Our experiments show that a naïve character bi-gram model of text performs as well as models generated using a state-of-the-art automatic segmenter.

Other authors
See publication

Languages

English

Native or bilingual proficiency
Mandarin Chinese

Native or bilingual proficiency
Cantonese

Native or bilingual proficiency

Recommendations received

Murat Odabasi

“We worked with Wei for about seven months to develop the next generation Content Routing system for Yahoo! Answers. The ambitious requirements of the project meant that we had challenging tasks to make complex calculations, integrate a bunch of different technologies, and deliver performance. Wei supplied the valuable data, information and insights which allowed us to make our decisions much more wisely than we would have made by ourselves. He was always polite, friendly, available and helpful, and he provided tangible contributions as we needed them throughout the development of the project. It was a true pleasure to work with him.”

1 person has recommended Wei

Join now to view

More activity by Wei

After 14 years, it's almost time for a new chapter... I've had an incredible opportunity to grow, learn and work on some truly ground-breaking…

After 14 years, it's almost time for a new chapter... I've had an incredible opportunity to grow, learn and work on some truly ground-breaking…

Liked by Wei Liu
🎉 We've just opened the doors to our brand-new office, and we can't wait to show you around! Check out our video tour and get a glimpse of our…

🎉 We've just opened the doors to our brand-new office, and we can't wait to show you around! Check out our video tour and get a glimpse of our…

Liked by Wei Liu
BBC News Interview with Samantha Simmonds, following UK Prime Minister #RishiSunak's speech on #AI 🔒 Why Focus on risks now? AI is evolving fast…

BBC News Interview with Samantha Simmonds, following UK Prime Minister #RishiSunak's speech on #AI 🔒 Why Focus on risks now? AI is evolving fast…

Liked by Wei Liu
The weeks spent in the burning desert sun paid off! Excited to share that our publication "Stealthy Terrain-Aware Multi-Agent Active Search" has…

The weeks spent in the burning desert sun paid off! Excited to share that our publication "Stealthy Terrain-Aware Multi-Agent Active Search" has…

Liked by Wei Liu
Made it to five years at Amazon!

Made it to five years at Amazon!

Liked by Wei Liu
Today, we announced our newest Amazon devices and the latest AI advancements powering them. It's an exciting step forward in our mission to make…

Today, we announced our newest Amazon devices and the latest AI advancements powering them. It's an exciting step forward in our mission to make…

Liked by Wei Liu
The future of analytics is (almost) here! 🤗 Big news from Infer! Coworker AI is about to go BETA 🎉 Been a bit quiet with product videos from my…

The future of analytics is (almost) here! 🤗 Big news from Infer! Coworker AI is about to go BETA 🎉 Been a bit quiet with product videos from my…

Liked by Wei Liu
Liked by Wei Liu

Liked by Wei Liu
Last night a team from across ComplyAdvantage took part in the J.P. Morgan Corporate Challenge in Battersea Park 🏃 With an average finish time of…

Last night a team from across ComplyAdvantage took part in the J.P. Morgan Corporate Challenge in Battersea Park 🏃 With an average finish time of…

Liked by Wei Liu
When robots have GUTS, they are brave! Humbled and honoured that our work "GUTS: Generalised Uncertainty-Aware Thompson Sampling for Multi-Agent…

When robots have GUTS, they are brave! Humbled and honoured that our work "GUTS: Generalised Uncertainty-Aware Thompson Sampling for Multi-Agent…

Liked by Wei Liu
NLP is getting more central in the tech industry and AI-based products. Especially with the support of language models powered by deep learning, the…

NLP is getting more central in the tech industry and AI-based products. Especially with the support of language models powered by deep learning, the…

Liked by Wei Liu

View Wei’s full profile

See who you know in common
Get introduced
Contact Wei directly

Join to view full profile

Other similar profiles

Explore more posts

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Wei Liu in United Kingdom

155 others named Wei Liu in United Kingdom are on LinkedIn

See others named Wei Liu

Add new skills with these courses

See all courses

Wei Liu

London, England, United Kingdom 848 followers 500+ connections

About

Articles by Wei

Don't ignore the bias in your ML system

On China's cashless payment

Activity

As Microsoft marks its incredible 50th Anniversary, I find myself reflecting on my own journey with this amazing company. Having spent a decade at…

Liked by Wei Liu

Happy and excited for all the teams responsible for the launch of https://nova.amazon.com! Congrats! Read more at https://lnkd.in/erDXU9_j

Liked by Wei Liu

LLM and Human alignment with simple linear mapping

Shared by Wei Liu

Experience

-

-

-

-

-

-

-

Education

Licenses & Certifications

Publications

LREC 2010 May 20, 2010

LREC 2008 May 19, 2008

ALPIT September 15, 2007

Languages

English

Native or bilingual proficiency

Mandarin Chinese

Native or bilingual proficiency

Cantonese

Native or bilingual proficiency

Recommendations received

Murat Odabasi

More activity by Wei

After 14 years, it's almost time for a new chapter... I've had an incredible opportunity to grow, learn and work on some truly ground-breaking…

Liked by Wei Liu

🎉 We've just opened the doors to our brand-new office, and we can't wait to show you around! Check out our video tour and get a glimpse of our…

Liked by Wei Liu

BBC News Interview with Samantha Simmonds, following UK Prime Minister #RishiSunak's speech on #AI 🔒 Why Focus on risks now? AI is evolving fast…

Liked by Wei Liu

The weeks spent in the burning desert sun paid off! Excited to share that our publication "Stealthy Terrain-Aware Multi-Agent Active Search" has…

Liked by Wei Liu

Made it to five years at Amazon!

Liked by Wei Liu

Today, we announced our newest Amazon devices and the latest AI advancements powering them. It's an exciting step forward in our mission to make…

Liked by Wei Liu

The future of analytics is (almost) here! 🤗 Big news from Infer! Coworker AI is about to go BETA 🎉 Been a bit quiet with product videos from my…

Liked by Wei Liu

Liked by Wei Liu

Last night a team from across ComplyAdvantage took part in the J.P. Morgan Corporate Challenge in Battersea Park 🏃 With an average finish time of…

Liked by Wei Liu

When robots have GUTS, they are brave! Humbled and honoured that our work "GUTS: Generalised Uncertainty-Aware Thompson Sampling for Multi-Agent…

Liked by Wei Liu

NLP is getting more central in the tech industry and AI-based products. Especially with the support of language models powered by deep learning, the…

Liked by Wei Liu

View Wei’s full profile

Other similar profiles

Marios Mourelatos

Rishabh Shukla

João Gomes

Kallirroi Dogani

Eeshan Malhotra

Inneke Mayachita

Bogdan Melnik

Joel Budu

Hamid Omidvar, Ph.D.

Julian Mack

Omololu Makinde, PhD

Ali Malek, PhD

Marc Romeyn

Adam Murphy

Christian Perone

Nikos Epitropakis

Shubham Agrawal

Adnan Shahzada

Kashyap Popat

Kensuke Muraki

London, England, United Kingdom
848 followers 500+ connections