Skip to main content

Scaling vision transformers to 22 billion parameters

Data-centric ML benchmarking: Announcing DataPerf’s 2023 challenges

Leveraging transfer learning for large scale differentially private image classification

PRESTO – A multilingual dataset for parsing realistic task-oriented dialogues

Detecting novel systemic biomarkers in external eye photos

Visual language maps for robot navigation

Vid2Seq: a pretrained visual language model for describing multi-event videos

Responsible AI at Google Research: The Impact Lab

Learning from deep learning: a case study of feature discovery and validation in pathology

PaLM-E: An embodied multimodal language model

The BirdCLEF 2023 Challenge: Pushing the frontiers of biodiversity monitoring

Announcing the ICDAR 2023 Competition on Hierarchical Text Detection and Recognition

Universal Speech Model (USM): State-of-the-art speech AI for 100+ languages

Performer-MPC: Navigation via real-time, on-robot transformers

Distributed differential privacy for federated learning

Teaching old labels new tricks in heterogeneous graphs