User profiles for Golnaz Ghiasi

Golnaz Ghiasi

Google DeepMind
Verified email at google.com
Cited by 27479

Dropblock: A regularization method for convolutional networks

G Ghiasi, TY Lin, QV Le - Advances in neural information …, 2018 - proceedings.neurips.cc
Deep neural networks often work well when they are over-parameterized and trained with a
massive amount of noise and regularization, such as weight decay and dropout. Although …

Nas-fpn: Learning scalable feature pyramid architecture for object detection

G Ghiasi, TY Lin, QV Le - … of the IEEE/CVF conference on …, 2019 - openaccess.thecvf.com
Current state-of-the-art convolutional architectures for object detection are manually designed.
Here we aim to learn a better architecture of feature pyramid network for object detection. …

Laplacian pyramid reconstruction and refinement for semantic segmentation

G Ghiasi, CC Fowlkes - European conference on computer vision, 2016 - Springer
CNN architectures have terrific recognition performance but rely on spatial pooling which
makes it difficult to adapt them to tasks that require dense, pixel-accurate labeling. This paper …

Scaling open-vocabulary image segmentation with image-level labels

G Ghiasi, X Gu, Y Cui, TY Lin - European conference on computer vision, 2022 - Springer
We design an open-vocabulary image segmentation model to organize an image into meaningful
regions indicated by arbitrary texts. Recent works (CLIP and ALIGN), despite attaining …

Simple copy-paste is a strong data augmentation method for instance segmentation

G Ghiasi, Y Cui, A Srinivas, R Qian… - Proceedings of the …, 2021 - openaccess.thecvf.com
Building instance segmentation models that are data-efficient and can handle rare object
categories is an important challenge in computer vision. Leveraging data augmentations is a …

Rethinking pre-training and self-training

B Zoph, G Ghiasi, TY Lin, Y Cui, H Liu… - Advances in neural …, 2020 - proceedings.neurips.cc
Pre-training is a dominant paradigm in computer vision. For example, supervised ImageNet
pre-training is commonly used to initialize the backbones of object detection and …

Learning data augmentation strategies for object detection

B Zoph, ED Cubuk, G Ghiasi, TY Lin, J Shlens… - European conference on …, 2020 - Springer
Much research on object detection focuses on building better model architectures and
detection algorithms. Changing the model architecture, however, comes at the cost of adding …

Exploring the structure of a real-time, arbitrary neural artistic stylization network

G Ghiasi, H Lee, M Kudlur, V Dumoulin… - arXiv preprint arXiv …, 2017 - arxiv.org
In this paper, we present a method which combines the flexibility of the neural algorithm of
artistic style with the speed of fast style transfer networks to allow real-time stylization using …

Occlusion coherence: Localizing occluded faces with a hierarchical deformable part model

G Ghiasi, CC Fowlkes - … of the IEEE conference on computer …, 2014 - openaccess.thecvf.com
The presence of occluders significantly impacts performance of systems for object recognition.
However, occlusion is typically treated as an unstructured source of noise and explicit …

Spinenet: Learning scale-permuted backbone for recognition and localization

X Du, TY Lin, P Jin, G Ghiasi, M Tan… - Proceedings of the …, 2020 - openaccess.thecvf.com
Convolutional neural networks typically encode an input image into a series of intermediate
features with decreasing resolutions. While this structure is suited to classification tasks, it …