Tetromino Pixel

"Tetromino Pixel," a term encompassing various research directions, broadly focuses on leveraging pixel-level information from images and videos to achieve higher-level tasks. Current research emphasizes using deep learning models, including transformers, U-Nets, and diffusion models, to process visual data and integrate it with other modalities like text and 3D point clouds for applications such as image captioning, object detection, 3D reconstruction, and robotic control. This work is significant for advancing multimodal AI, improving the efficiency and interpretability of computer vision systems, and enabling new capabilities in areas like autonomous navigation and medical image analysis.

Papers

May 15, 2023

Not All Pixels Are Equal: Learning Pixel Hardness for Semantic Segmentation
Xin Xiao, Daiguo Zhou, Jiagao Hu, Yi Hu, Yongchao Xu
Semantic Segmentation Tetromino Pixel Segmentation Loss

April 27, 2023

Discovering Object-Centric Generalized Value Functions From Pixels
Somjit Nath, Gopeshh Raaj Subbaraj, Khimya Khetarpal, Samira Ebrahimi Kahou
Deep Reinforcement Learning Scientific Discovery Value Function Tetromino Pixel Fast Adaptation Dimensional Input Useful Representation

April 24, 2023

Beyond the Pixel: a Photometrically Calibrated HDR Dataset for Luminance and Color Prediction
Christophe Bolduc, Justine Giroux, Marc Hébert, Claude Demers, Jean-François Lalonde
High Dynamic Range Tetromino Pixel Lighting Estimation Luminance Consistency Color Prediction

April 14, 2023

A Unified HDR Imaging Method with Pixel and Patch Level
Qingsen Yan, Weiye Chen, Song Zhang, Yu Zhu, Jinqiu Sun, Yanning Zhang
High Dynamic Range Tetromino Pixel Class Relevant Patch

March 20, 2023

SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage
Song Park, Sanghyuk Chun, Byeongho Heo, Wonjae Kim, Sangdoo Yun
Supervised ImageNet Large Scale Tetromino Pixel K TOKEN Storage Efficient

March 16, 2023

A Picture is Worth a Thousand Words: Language Models Plan from Pixels
Anthony Z. Liu, Lajanugen Logeswaran, Sungryull Sohn, Honglak Lee
Large Language Model Pre Trained Language Model Task Planning Affordance Learning Word List Tetromino Pixel Long Horizon Task Web Screenshots Sequential Planning

February 28, 2023

Learning Sparse Control Tasks from Pixels by Latent Nearest-Neighbor-Guided Explorations
Ruihan Zhao, Ufuk Topcu, Sandeep Chinchali, Mariano Phielipp
Deep Reinforcement Learning Environment Exploration Sparse Reward Tetromino Pixel Model Free Reinforcement Learning Sparse Reward Environment Intermediate Latent Sparse Semantic Vision Based Reinforcement Learning

February 17, 2023

Mixed Traffic Control and Coordination from Pixels
Michael Villarreal, Bibek Poudel, Jia Pan, Weizi Li
Mobile Robot Tetromino Pixel Human Driven Vehicle Traffic Management Prior Coordination Traffic Management System Mixed Traffic Control

January 11, 2023

LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis
Jiapeng Zhu, Ceyuan Yang, Yujun Shen, Zifan Shi, Bo Dai, Deli Zhao, Qifeng Chen
Pixel Level Tetromino Pixel GAN Inversion GAN Training Latent Code GAN Generator Controllable Image Synthesis

December 28, 2022

Pixel Relationships-based Regularizer for Retinal Vessel Image Segmentation
Lukman Hakim, Takio Kurita
Loss Function Image Segmentation Tetromino Pixel Gradient Regularization Retinal Vessel Segmentation Pixel Wise Loss Pixel Relation

December 21, 2022

Generalized Decoding for Pixel, Image, and Language
Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao
Human Language Pixel Level Tetromino Pixel Accurate Decoding Open Vocabulary Segmentation

December 20, 2022

Which Pixel to Annotate: a Label-Efficient Nuclei Segmentation Framework
Wei Lou, Haofeng Li, Guanbin Li, Xiaoguang Han, Xiang Wan
Segmentation Model Tetromino Pixel Nucleus Segmentation Manual Annotation Nucleus Instance Segmentation

December 15, 2022

CLIPPO: Image-and-Language Understanding from Pixels Only
Michael Tschannen, Basil Mustafa, Neil Houlsby
Contrastive Loss Multimodal Model Tetromino Pixel Multimodal Task Text Contrastive Learning Clipped Stochastic Gradient Descent

December 13, 2022

Pixel is All You Need: Adversarial Trajectory-Ensemble Active Learning for Salient Object Detection
Zhenyu Wu, Lin Wang, Wei Wang, Qing Xia, Chenglizhao Chen, Aimin Hao, Shuo Li
Weakly Supervised Tetromino Pixel SALient Object Detection Adversarial Trajectory

November 14, 2022

PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain Adaptative Semantic Segmentation
Mu Chen, Zhedong Zheng, Yi Yang, Tat-Seng Chua
Semantic Segmentation Domain Adaptation Tetromino Pixel Image Domain

November 1, 2022

Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions
Alexey Skrynnik, Zoya Volovikova, Marc-Alexandre Côté, Anton Voronov, Artem Zholus, Negar Arabzadeh, Shrestha Mohanty, Milagro Teruel, Ahmed Awadallah, Aleksandr Panov, Mikhail Burtsev, Julia Kiseleva
Language Model Reinforcement Learning LeArning Abstract Natural Language Instruction Tetromino Pixel Sub Goal Pre Trained Reinforcement Learning

October 2, 2022

Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Yannick Hogewind, Thiago D. Simao, Tal Kachman, Nils Jansen
Safe Reinforcement Learning Tetromino Pixel Observable Markov Decision Process Partial Observability Reward Maximization Safety Critic

September 24, 2022

Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels
Sai Rajeswar, Pietro Mazzaglia, Tim Verbelen, Alexandre Piché, Bart Dhoedt, Aaron Courville, Alexandre Lacoste
Reinforcement Learning Tetromino Pixel Reinforcement Learning Benchmark Unsupervised Reinforcement Learning

September 23, 2022

Image Classification using Sequence of Pixels
Gajraj Kuldeep
Image Classification Recurrent Neural Network Long Short Term Memory LSTM Network Tetromino Pixel Sequence of Sequence

July 30, 2022

Temporal extrapolation of heart wall segmentation in cardiac magnetic resonance images via pixel tracking
Arash Rabbani, Hao Gao, Dirk Husmeier
Tetromino Pixel Cardiac Magnetic Resonance Scene Extrapolation Ventricle Segmentation