User profiles for Golnaz Ghiasi
Golnaz GhiasiGoogle DeepMind Verified email at google.com Cited by 27479 |
Dropblock: A regularization method for convolutional networks
Deep neural networks often work well when they are over-parameterized and trained with a
massive amount of noise and regularization, such as weight decay and dropout. Although …
massive amount of noise and regularization, such as weight decay and dropout. Although …
Nas-fpn: Learning scalable feature pyramid architecture for object detection
Current state-of-the-art convolutional architectures for object detection are manually designed.
Here we aim to learn a better architecture of feature pyramid network for object detection. …
Here we aim to learn a better architecture of feature pyramid network for object detection. …
Laplacian pyramid reconstruction and refinement for semantic segmentation
G Ghiasi, CC Fowlkes - European conference on computer vision, 2016 - Springer
CNN architectures have terrific recognition performance but rely on spatial pooling which
makes it difficult to adapt them to tasks that require dense, pixel-accurate labeling. This paper …
makes it difficult to adapt them to tasks that require dense, pixel-accurate labeling. This paper …
Scaling open-vocabulary image segmentation with image-level labels
We design an open-vocabulary image segmentation model to organize an image into meaningful
regions indicated by arbitrary texts. Recent works (CLIP and ALIGN), despite attaining …
regions indicated by arbitrary texts. Recent works (CLIP and ALIGN), despite attaining …
Simple copy-paste is a strong data augmentation method for instance segmentation
Building instance segmentation models that are data-efficient and can handle rare object
categories is an important challenge in computer vision. Leveraging data augmentations is a …
categories is an important challenge in computer vision. Leveraging data augmentations is a …
Rethinking pre-training and self-training
Pre-training is a dominant paradigm in computer vision. For example, supervised ImageNet
pre-training is commonly used to initialize the backbones of object detection and …
pre-training is commonly used to initialize the backbones of object detection and …
Learning data augmentation strategies for object detection
Much research on object detection focuses on building better model architectures and
detection algorithms. Changing the model architecture, however, comes at the cost of adding …
detection algorithms. Changing the model architecture, however, comes at the cost of adding …
Exploring the structure of a real-time, arbitrary neural artistic stylization network
In this paper, we present a method which combines the flexibility of the neural algorithm of
artistic style with the speed of fast style transfer networks to allow real-time stylization using …
artistic style with the speed of fast style transfer networks to allow real-time stylization using …
Occlusion coherence: Localizing occluded faces with a hierarchical deformable part model
G Ghiasi, CC Fowlkes - … of the IEEE conference on computer …, 2014 - openaccess.thecvf.com
The presence of occluders significantly impacts performance of systems for object recognition.
However, occlusion is typically treated as an unstructured source of noise and explicit …
However, occlusion is typically treated as an unstructured source of noise and explicit …
Spinenet: Learning scale-permuted backbone for recognition and localization
Convolutional neural networks typically encode an input image into a series of intermediate
features with decreasing resolutions. While this structure is suited to classification tasks, it …
features with decreasing resolutions. While this structure is suited to classification tasks, it …