User profiles for Xia Song

Xia Song

- Verified email at microsoft.com - Cited by 7819

Xia Song

- Verified email at mail.sdsu.edu - Cited by 1828

Irradiation behavior in high entropy alloys

S Xia, Z Wang, T Yang, Y Zhang - Journal of Iron and Steel Research …, 2015 - Springer
As an increasing demand of advanced nuclear fission reactors and fusion facilities, the key
requirements for the materials used in advanced nuclear systems should encompass …

Language is not all you need: Aligning perception with language models

…, V Chaudhary, S Som, X Song… - Advances in …, 2023 - proceedings.neurips.cc
A big convergence of language, multimodal perception, action, and world modeling is a key
step toward artificial general intelligence. In this work, we introduce KOSMOS-1, a …

Phi-3 technical report: A highly capable language model locally on your phone

…, H Sharma, Y Shen, S Shukla, X Song… - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion
tokens, whose overall performance, as measured by both academic benchmarks and internal …

Ms marco: A human-generated machine reading comprehension dataset

T Nguyen, M Rosenberg, X Song, J Gao, S Tiwary… - 2016 - openreview.net
This paper presents our recent work on the design and development of a new, large scale
dataset, which we name MS MARCO, for MAchine Reading COmprehension. This new …

Using deepspeed and megatron to train megatron-turing nlg 530b, a large-scale generative language model

…, R Child, RY Aminabadi, J Bernauer, X Song… - arXiv preprint arXiv …, 2022 - arxiv.org
Pretrained general-purpose language models can achieve state-of-the-art accuracies in
various natural language processing domains by adapting to downstream tasks via zero-shot, …

Ms marco: A human generated machine reading comprehension dataset

…, B Mitra, T Nguyen, M Rosenberg, X Song… - arXiv preprint arXiv …, 2016 - arxiv.org
We introduce a large scale MAchine Reading COmprehension dataset, which we name MS
MARCO. The dataset comprises of 1,010,916 anonymized questions---sampled from Bing's …

Production, Characterization, and Antioxidant Activity of Fucoxanthin from the Marine Diatom Odontella aurita

S Xia, K Wang, L Wan, A Li, Q Hu, C Zhang - Marine drugs, 2013 - mdpi.com
The production, characterization, and antioxidant capacity of the carotenoid fucoxanthin from
the marine diatom Odontella aurita were investigated. The results showed that low light and …

Water‐use efficiency of forest ecosystems in eastern China and its relations to climatic variables

G Yu, X Song, Q Wang, Y Liu, D Guan, J Yan… - New …, 2008 - Wiley Online Library
• Carbon (C) and water cycles of terrestrial ecosystems are two coupled ecological processes
controlled partly by stomatal behavior. Water‐use efficiency (WUE) reflects the coupling …

A length-extrapolatable transformer

…, S Huang, A Benhaim, V Chaudhary, X Song… - arXiv preprint arXiv …, 2022 - arxiv.org
Position modeling plays a critical role in Transformers. In this paper, we focus on length
extrapolation, ie, training on short texts while evaluating longer sequences. We define attention …

InfoXLM: An information-theoretic framework for cross-lingual language model pre-training

…, F Wei, N Yang, S Singhal, W Wang, X Song… - arXiv preprint arXiv …, 2020 - arxiv.org
In this work, we present an information-theoretic framework that formulates cross-lingual
language model pre-training as maximizing mutual information between multilingual-multi-…