Gebruikersprofielen voor Jackie Kay
Jackie KayCusp AI Geverifieerd e-mailadres voor cusp.ai Geciteerd door 13873 |
Magnetic control of tokamak plasmas through deep reinforcement learning
Nuclear fusion using magnetic confinement, in particular in the tokamak configuration, is a
promising path towards sustainable energy. A core challenge is to shape and maintain a high-…
promising path towards sustainable energy. A core challenge is to shape and maintain a high-…
A generalist agent
…, G Barth-Maron, M Gimenez, Y Sulsky, J Kay… - arXiv preprint arXiv …, 2022 - arxiv.org
Inspired by progress in large-scale language modeling, we apply a similar approach towards
building a single generalist agent beyond the realm of text outputs. The agent, which we …
building a single generalist agent beyond the realm of text outputs. The agent, which we …
Sociotechnical safety evaluation of generative ai systems
Generative AI systems produce a range of risks. To ensure the safety of generative AI
systems, these risks must be evaluated. In this paper, we make two main contributions toward …
systems, these risks must be evaluated. In this paper, we make two main contributions toward …
Epistemic injustice in generative AI
This paper investigates how generative AI can potentially undermine the integrity of collective
knowledge and the processes we rely on to acquire, assess, and trust information, posing …
knowledge and the processes we rely on to acquire, assess, and trust information, posing …
Gaps in the safety evaluation of generative AI
Generative AI systems produce a range of ethical and social risks. Evaluation of these risks
is a critical step on the path to ensuring the safety of these systems. However, evaluation …
is a critical step on the path to ensuring the safety of these systems. However, evaluation …
Robust reinforcement learning for continuous control with model misspecification
We provide a framework for incorporating robustness -- to perturbations in the transition
dynamics which we refer to as model misspecification -- into continuous control Reinforcement …
dynamics which we refer to as model misspecification -- into continuous control Reinforcement …
Fairness for unobserved characteristics: Insights from technological impacts on queer communities
Advances in algorithmic fairness have largely omitted sexual orientation and gender identity.
We explore queer concerns in privacy, censorship, language, online safety, health, and …
We explore queer concerns in privacy, censorship, language, online safety, health, and …
Queer in AI: A case study in community-led participatory AI
Queerness and queer people face an uncertain future in the face of ever more widely
deployed and invasive artificial intelligence (AI). These technologies have caused numerous …
deployed and invasive artificial intelligence (AI). These technologies have caused numerous …
Self-supervised sim-to-real adaptation for visual robotic manipulation
Collecting and automatically obtaining reward signals from real robotic visual data for the
purposes of training reinforcement learning algorithms can be quite challenging and time-…
purposes of training reinforcement learning algorithms can be quite challenging and time-…
Learning gentle object manipulation with curiosity-driven deep reinforcement learning
Robots must know how to be gentle when they need to interact with fragile objects, or when
the robot itself is prone to wear and tear. We propose an approach that enables deep …
the robot itself is prone to wear and tear. We propose an approach that enables deep …