Gebruikersprofielen voor Nan Duan
Nan DuanVice President of JD.Com | ex- StepFun | ex- Microsoft Research Geverifieerd e-mailadres voor microsoft.com Geciteerd door 39787 |
Codebert: A pre-trained model for programming and natural languages
We present CodeBERT, a bimodal pre-trained model for programming language (PL) and
natural language (NL). CodeBERT learns general-purpose representations that support …
natural language (NL). CodeBERT learns general-purpose representations that support …
scGPT: toward building a foundation model for single-cell multi-omics using generative AI
Generative pretrained models have achieved remarkable success in various domains such
as language and computer vision. Specifically, the combination of large-scale diverse …
as language and computer vision. Specifically, the combination of large-scale diverse …
Question generation for question answering
This paper presents how to generate questions from given passages using neural networks,
where large scale QA pairs are automatically crawled and processed from Community-QA …
where large scale QA pairs are automatically crawled and processed from Community-QA …
Visual chatgpt: Talking, drawing and editing with visual foundation models
ChatGPT is attracting a cross-field interest as it provides a language interface with remarkable
conversational competency and reasoning capabilities across many domains. However, …
conversational competency and reasoning capabilities across many domains. However, …
Query rewriting in retrieval-augmented large language models
Large Language Models (LLMs) play powerful, black-box readers in the retrieve-then-read
pipeline, making remarkable progress in knowledge-intensive tasks. This work introduces a …
pipeline, making remarkable progress in knowledge-intensive tasks. This work introduces a …
Unixcoder: Unified cross-modal pre-training for code representation
Pre-trained models for programming languages have recently demonstrated great success
on code intelligence. To support both code-related understanding and generation tasks, …
on code intelligence. To support both code-related understanding and generation tasks, …
Agieval: A human-centric benchmark for evaluating foundation models
Assessing foundation models’ abilities for human-level tasks is crucial for Artificial General
Intelligence (AGI) development. Traditional benchmarks, which rely on artificial datasets, may …
Intelligence (AGI) development. Traditional benchmarks, which rely on artificial datasets, may …
Critic: Large language models can self-correct with tool-interactive critiquing
Recent developments in large language models (LLMs) have been impressive. However,
these models sometimes show inconsistencies and problematic behavior, such as …
these models sometimes show inconsistencies and problematic behavior, such as …
Graphcodebert: Pre-training code representations with data flow
Pre-trained models for programming language have achieved dramatic empirical
improvements on a variety of code-related tasks such as code search, code completion, code …
improvements on a variety of code-related tasks such as code search, code completion, code …
Tora: A tool-integrated reasoning agent for mathematical problem solving
Large language models have made significant progress in various language tasks, yet they
still struggle with complex mathematics. In this paper, we propose ToRA a series of Tool-…
still struggle with complex mathematics. In this paper, we propose ToRA a series of Tool-…