« Back
Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
github.com
Submitted by neehao 11 days ago