Data Curation for Large Scale Detection Pretraining

Vivek Ramanujan , Haotian Zhang , Yinfei Yang , Ali Farhadi

Datacomp: In search of the next generation of multimodal datasets

Samir Yitzhak Gadre , Gabriel Ilharco , Alex Fang , Jonathan Hayase
Advances in Neural Information Processing Systems 36

152
2024
DataComp: In search of the next generation of multimodal datasets

Samir Yitzhak Gadre , Gabriel Ilharco , Alex Fang , Jonathan Hayase
arXiv e-prints arXiv: 2304.14108 -arXiv: 2304.14108

2023
A meta-analysis of overfitting in machine learning

Rebecca Roelofs , Vaishaal Shankar , Benjamin Recht , Sara Fridovich-Keil
NeurIPS 9175 -9185

190
2019
Language models scale reliably with over-training and on downstream tasks

Samir Yitzhak Gadre , Georgios Smyrnis , Vaishaal Shankar , Suchin Gururangan
arXiv preprint arXiv:2403.08540

4
2024
Supplementary: Do Image Classifiers Generalize Across Time?

Vaishaal Shankar , Deva Ramanan , Achal Dave , Benjamin Recht

Data filtering networks

Alex Fang , Albin Madappally Jose , Amit Jain , Ludwig Schmidt
arXiv preprint arXiv:2309.17425

28
2023
TiC-CLIP: Continual Training of CLIP Models

Saurabh Garg , Mehrdad Farajtabar , Hadi Pouransari , Raviteja Vemulapalli
International Conference on Learning Representations (ICLR)

5
2024
Openclip, July 2021

Gabriel Ilharco , Mitchell Wortsman , Ross Wightman , Cade Gordon
If you use this software, please cite it as below 7

159
Openclip, 2021

Gabriel Ilharco , Mitchell Wortsman , Ross Wightman , Cade Gordon
If you use this software, please cite it as below 3 ( 5)

58
Numpywren: Serverless linear algebra

Vaishaal Shankar , Karl Krauth , Qifan Pu , Eric Jonas
arXiv preprint arXiv:1810.09679

64
2018
Convolutional kitchen sinks for transcription factor binding site prediction

Alyssa Morrow , Vaishaal Shankar , Devin Petersohn , Anthony Joseph
arXiv preprint arXiv:1706.00125

24
2017
Ground Control to Major Tom: the importance of field surveys in remotely sensed data analysis

Ian Bolliger , Tamma Carleton , Solomon Hsiang , Jonathan Kadish
arXiv preprint arXiv:1710.09342

2
2017
Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum

Hadi Pouransari , Chun-Liang Li , Jen-Hao Rick Chang , Pavan Kumar Anasosalu Vasu
arXiv preprint arXiv:2405.13226

1
2024
Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation

Yuhui Zhang , Brandon McKinzie , Zhe Gan , Vaishaal Shankar
arXiv preprint arXiv:2311.16201

2023
Robust multimodal models have outlier features and encode more concepts

Jonathan Crabbé , Pau Rodríguez , Vaishaal Shankar , Luca Zappella
arXiv preprint arXiv:2310.13040

2023
Gradients for the Loss!

Fotis Iliopoulos , Vrettos Moulous , Vaishaal Shankar , Max Simchowitz

Back to the future: Malware detection with temporally consistent labels

Brad Miller , Alex Kantchelian , S Afroz , R Bachwani
Under submission

7
2015
Convolutions of random patches as a generalizable featurization for multi-domain prediction using remote sensing imagery

B. Recht , I. W. Bolliger , V. Shankar , T. Carleton
AGU Fall Meeting Abstracts 2018

2018