Solar 10.7 b: Scaling large language models with simple yet effective depth up-scaling
We introduce SOLAR 10.7B, a large language model (LLM) with 10.7 billion parameters,
demonstrating superior performance in various natural language processing (NLP) tasks. …
demonstrating superior performance in various natural language processing (NLP) tasks. …
Effective cross-lingual transfer of neural machine translation models without shared vocabularies
Transfer learning or multilingual model is essential for low-resource neural machine translation
(NMT), but the applicability is limited to cognate languages by sharing their vocabularies. …
(NMT), but the applicability is limited to cognate languages by sharing their vocabularies. …
When and why is document-level context useful in neural machine translation?
Document-level context has received lots of attention for compensating neural machine
translation (NMT) of isolated sentences. However, recent advances in document-level NMT …
translation (NMT) of isolated sentences. However, recent advances in document-level NMT …
Pivot-based transfer learning for neural machine translation between non-English languages
We present effective pre-training strategies for neural machine translation (NMT) using
parallel corpora involving a pivot language, ie, source-pivot and pivot-target, leading to a …
parallel corpora involving a pivot language, ie, source-pivot and pivot-target, leading to a …
Bearing fault detection using scalogram and switchable normalization-based CNN (SN-CNN)
Bearings play a vital role in all rotating machinery, and their failure is one of the significant
causes of machine breakdown leading to a profound loss of safety and property. Therefore, …
causes of machine breakdown leading to a profound loss of safety and property. Therefore, …
sDPO: Don't Use Your Data All at Once
As development of large language models (LLM) progresses, aligning them with human
preferences has become increasingly important. We propose stepwise DPO (sDPO), an …
preferences has become increasingly important. We propose stepwise DPO (sDPO), an …
When and why is unsupervised neural machine translation useless?
This paper studies the practicality of the current state-of-the-art unsupervised methods in neural
machine translation (NMT). In ten translation tasks with various data settings, we analyze …
machine translation (NMT). In ten translation tasks with various data settings, we analyze …
Prompt-and trait relation-aware cross-prompt essay trait scoring
Automated essay scoring (AES) aims to score essays written for a given prompt, which defines
the writing topic. Most existing AES systems assume to grade essays of the same prompt …
the writing topic. Most existing AES systems assume to grade essays of the same prompt …
Generalizing back-translation in neural machine translation
Back-translation - data augmentation by translating target monolingual data - is a crucial
component in modern neural machine translation (NMT). In this work, we reformulate back-…
component in modern neural machine translation (NMT). In this work, we reformulate back-…
Emotion malleability beliefs matter in emotion regulation: a comprehensive review and meta-analysis
Individuals’ beliefs about the malleability of emotions have been theorised to play a role in
their psychological distress by influencing emotion regulation processes, such as the use of …
their psychological distress by influencing emotion regulation processes, such as the use of …