Vision + Language
-
VisionArena: 230K Real World User-VLM Conversations with Preference Labels
Christopher Chou*, Lisa Dunlap*, Koki Mashita, Krishna Mandal, Trevor Darrell, Ion Stoica, Joseph E. Gonzalez, Wei-Lin Chiang
TL;DR It’s the data release for Chatbot Arena, a platform for crowdsourcing preference votes.
-
VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models
-
From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline
Tianle Li*, Wei-Lin Chiang*, Evan Frick, Lisa Dunlap, Tianhao Wu, Banghua Zhu, Joseph E. Gonzalez, Ion Stoica
[Arxiv] Paper Code Blog Dataset Leaderboard
TL;DR Filter large, messy NLP datasets into a smaller set of high-quality prompts using LLMs
-
Describing Differences in Image Sets with Natural Language
Lisa Dunlap*, Yuhui Zhang*, Xiaohan Wang, R. Zhong, Trevor Darrell, Jacob Steinhardt, Joseph E. Gonzalez, Serena Yeung-Levy
[CVPR 2024 (oral)] PaperCodeWebsite
TL;DR Set Difference Captioning - describing differences in two large sets of images with language - has many impactful ML & data science applications
-
See, Say, and Segment: Teaching LMMs to Overcome False Premises
-
Diversify Your Vision Datasets with Automatic Diffusion-Based Augmentation (ALIA)
-
Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence
-
Using Language to Extend to Unseen Domains (LADS)
-
-
-
Deep Mixture of Experts Via Shallow Embedding
X. Wang, F. Yu, L. Dunlap, R. Wang, Y. A. Ma, A. Mirhoseini, T. Darrell, and J. E. Gonzalez.
[UAI 2019] Paper
TL;DR lots of MoE’s + sparse gating network = better accuracy and less computation
ML Systems
-
Improve Model Inference Cost with Image Gridding
S. Krishnaswamy, L. Dunlap, L. Chen, M. Zaharia, J. Zou, J. Gonzalez
[ICML 2023 DMLR workshop] Paper
TL;DR reduce vision model API costs by gridding your images together
-
-
-
Hypersched: Dynamic resource allocation for model development on a deadline
R. Liaw, R. Bhardwaj, L. Dunlap, A. Tumanov, J. E. Gonzalez, I. Stoica
[SoCC 2019] Paper
TL;DR when HP tuning on a time deadline, dynamically allocate resrouces to jobs
Misc
-
-
Machine Log Parsing with Named Entity Recognition
L. Dunlap, A. Starosta, K. Curtis, Z. Wang, C. Sarkar, R. Sriharsha.
[Nvidia GTC 2021] Blog.
TL;DR NER models work surprisingly well for log parsing
-
Habitat-dependent search behavior in the Colorado Checkered Whiptail (Aspidoscelis neotesselata)
K. Utsumi, C. Kusaks, R. Pedersen, C. Staley, L. Dunlap, S. G. Smith, M. A. Eifler, D. A. Eifler.
[Western North America Naturalist 2019] Paper
TL;DR whiptails behave differently in shrub grassland VS pine-juniper woodland