Tetromino Pixel

"Tetromino Pixel," a term encompassing various research directions, broadly focuses on leveraging pixel-level information from images and videos to achieve higher-level tasks. Current research emphasizes using deep learning models, including transformers, U-Nets, and diffusion models, to process visual data and integrate it with other modalities like text and 3D point clouds for applications such as image captioning, object detection, 3D reconstruction, and robotic control. This work is significant for advancing multimodal AI, improving the efficiency and interpretability of computer vision systems, and enabling new capabilities in areas like autonomous navigation and medical image analysis.

Papers

July 27, 2022

Adaptive sampling for scanning pixel cameras
Yusuf Duman, Jean-Yves Guillemaut, Simon Hadfield
Semantic Segmentation Adaptive Importance Image Quality Tetromino Pixel Low Power High Speed Imaging

July 26, 2022

PIXEL: Physics-Informed Cell Representations for Fast and Accurate PDE Solvers
Namgyu Kang, Byeonghyeon Lee, Youngjoon Hong, Seok-Bae Yun, Eunbyung Park
Physic Informed Neural Network Tetromino Pixel PDE Solver Cell Representation Inverse PDE Problem

July 14, 2022

Language Modelling with Pixels
Phillip Rust, Jonas F. Lotz, Emanuele Bugliarello, Elizabeth Salesky, Miryam de Lhoneux, Desmond Elliott
Language Model Pretrained Language Model Tetromino Pixel Text Encoder Language Modelling Code Switched

July 3, 2022

Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Edoardo Cetin, Philip J. Ball, Steve Roberts, Oya Celiktutan
Policy Reinforcement Learning Tetromino Pixel Convolutional Encoder DeepMind Control Suite Catastrophic Overfitting Policy Deep Reinforcement Learning Training Instability

June 13, 2022

Pixel to Binary Embedding Towards Robustness for CNNs
Ikki Kishida, Hideki Nakayama
Convolutional Neural Network Native Robustness Adversarial Perturbation Tetromino Pixel Binary Embedding

June 10, 2022

Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels?
Xiang Li, Jinghuan Shang, Srijan Das, Michael S. Ryoo
Reinforcement Learning Self Supervised Learning Tetromino Pixel Online Reinforcement Learning Self Supervised Loss Contrastive Reinforcement Learning

June 8, 2022

Deep Hierarchical Planning from Pixels
Danijar Hafner, Kuang-Huei Lee, Ian Fischer, Pieter Abbeel
World Model Sparse Reward Tetromino Pixel Hierarchical Reinforcement Learning Latent Intent

June 4, 2022

From Pixels to Objects: Cubic Visual Attention for Visual Question Answering
Jingkuan Song, Pengpeng Zeng, Lianli Gao, Heng Tao Shen
Visual Question Answering Arbitrary Object Tetromino Pixel Channel Attention Visual Question Answering Model 3D Attention

May 23, 2022

Denoising-based image reconstruction from pixels located at non-integer positions
Ján Koloda, Jürgen Seiler, André Kaup
Image Reconstruction Image Processing Tetromino Pixel Motion Compensation Positional Label

April 11, 2022

Evaluating Vision Transformer Methods for Deep Reinforcement Learning from Pixels
Tianxin Tao, Daniele Reda, Michiel van de Panne
Vision Transformer Deep Reinforcement Learning Transformer Architecture Tetromino Pixel Image Based Reinforcement Learning

April 3, 2022

Region-aware Attention for Image Inpainting
Zhilin Huang, Chujun Qin, Zhenyu Weng, Yuesheng Zhu
Tetromino Pixel Pixel Wise Region Attention Learnable Region

April 2, 2022

PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation
Jing He, Yiyi Zhou, Qi Zhang, Jun Peng, Yunhang Shen, Xiaoshuai Sun, Chao Chen, Rongrong Ji
Image Generation Tetromino Pixel Pixel Level Synthesis

March 23, 2022

Pixel VQ-VAEs for Improved Pixel Art Representation
Akash Saravanan, Matthew Guzdial
Jina Embeddings Tetromino Pixel Vq Vae

March 7, 2022

Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation
Alexander Long, Alan Blair, Herke van Hoof
Reinforcement Learning Algorithm Value Function Tetromino Pixel Data Efficient Episodic Reinforcement Learning Polygonal Environment Lazy Learning

March 2, 2022

Hybrid Tracker with Pixel and Instance for Video Panoptic Segmentation
Weicai Ye, Xinyue Lan, Ge Su, Hujun Bao, Zhaopeng Cui, Guofeng Zhang
Optical Flow Panoptic Segmentation Tetromino Pixel Inter Frame Human Instance Video Panoptic Segmentation C BIoU Tracker

February 4, 2022

Pixle: a fast and effective black-box attack based on rearranging pixels
Jary Pomponi, Simone Scardapane, Aurelio Uncini
Adversarial Attack Black Box Adversarial Sample Tetromino Pixel Adversarial Text

December 21, 2021

December 17, 2021

Image Inpainting Using AutoEncoder and Guided Selection of Predicted Pixels
Mohammad H. Givkashi, Mahshid Hadipour, Arezoo PariZanganeh, Zahra Nabizadeh, Nader Karimi, Shadrokh Samavi
Deep Neural Network U Net Tetromino Pixel Sequential Selection Missing Pixel

December 12, 2021

Magnifying Networks for Images with Billions of Pixels
Neofytos Dimitriou, Ognjen Arandjelovic
Computer Vision High Resolution Image Tetromino Pixel Gigapixel Image