1、Image Progress(图像处理)
- 去鬼影 - Generating Content for HDR Deghosting from Frequency View
 
- 去阴影 - HomoFormer: Homogenized Transformer for Image Shadow Removal
 
- 去模糊 - Unsupervised Blind Image Deblurring Based on Self-Enhancement
- Latency Correction for Event-guided Deblurring and Frame Interpolation
- LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network
- ID-Blau: Image Deblurring by Implicit Diffusion-based reBLurring AUgmentation
- Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown Domains
 ⭐code
- AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image Deblurring
 ⭐code
 ⭐code
- A Unified Framework for Microscopy Defocus Deblur with Multi-Pyramid Transformer and Contrastive Learning
 ⭐code
 
- 去雾 - ODCR: Orthogonal Decoupling Contrastive Regularization for Unpaired Image Dehazing
- Depth Information Assisted Collaborative Mutual Promotion Network for Single Image Dehazing
- A Semi-supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness Constraint
 ⭐code
 
- 去噪 - Real-World Mobile Image Denoising Dataset with Efficient Baselines
- GenesisTex: Adapting Image Denoising Diffusion to Texture Space
- Robust Image Denoising through Adversarial Frequency Mixup
- Exploring Efficient Asymmetric Blind-Spots for Self-Supervised Denoising in Real-World Scenarios
- Masked and Shuffled Blind Spot Denoising for Real-World Images
- LAN: Learning to Adapt Noise for Image Denoising
- Unmixing Diffusion for Self-Supervised Hyperspectral Image Denoising
- Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation
- Transfer CLIP for Generalizable Image Denoising
- Residual Denoising Diffusion Models
 ⭐code
- Equivariant plug-and-play image reconstruction
 ⭐code
- Patch2Self2: Self-supervised Denoising on Coresets via Matrix Sketching
- Hyper-MD: Mesh Denoising with Customized Parameters Aware of Noise Intensity and Geometric Characteristics
- Zero-Shot Illumination-Guided Joint Denoising and Adaptive Enhancement for Low-Light Images
 👍中文简介
 
- 去雨 - Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining
 ⭐code
 
- Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining
- 去反射 - Revisiting Single Image Reflection Removal In the Wild
- Language-guided Image Reflection Separation图像反射分离
 
- 修图 - Close Imitation of Expert Retouching for Black-and-White Photography
 
- 图像增强 - Color Shift Estimation-and-Correction for Image Enhancement
- FlowIE:Efficient Image Enhancement via Rectified Flow
- Fourier Priors-Guided Diffusion for Zero-Shot Joint Low-Light Enhancement and Deblurring
- Specularity Factorization for Low-Light Enhancement
- Zero-Reference Low-Light Enhancement via Physical Quadruple Priors
 ⭐code
- Towards Robust Event-guided Low-Light Image Enhancement: A Large-Scale Real-World Event-Image Dataset and Novel Approach
 ⭐code
- Empowering Resampling Operation for Ultra-High-Definition Image Enhancement with Model-Aware Guidance
- Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving
 
- 图像恢复 - Learning Diffusion Texture Priors for Image Restoration
- CoDe: An Explicit Content Decoupling Framework for Image Restoration
- Wavelet-based Fourier Information Interaction with Frequency Diffusion Adjustment for Underwater Image Restoration
- Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
- Look-Up Table Compression for Efficient Image Restoration
- HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models
- DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
- Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance
 ⭐code
- Deep Equilibrium Diffusion Restoration with Parallel Sampling
 ⭐code
- Distilling Semantic Priors from SAM to Efficient Image Restoration Models
- Boosting Image Restoration via Priors from Pre-trained Models
- Adapt or Perish: Adaptive Sparse Transformer with Attentive Feature Refinement for Image Restoration
- Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model
 ⭐code
- Restoration by Generation with Constrained Priors
 🏠project
- Multimodal Prompt Perceiver: Empower Adaptiveness Generalizability and Fidelity for All-in-One Image Restoration
- Improving Image Restoration through Removing Degradations in Textual Representations
 ⭐code
 
- 图像修复 - Brush2Prompt: Contextual Prompt Generator for Object Inpainting
- Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting
- NeRFiller: Completing Scenes via Generative 3D Inpainting
- MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior3D 修复
- Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting
 ⭐code
 
- 图像超级补全 - Shadow-Enlightened Image Outpainting
 
- 图像质量 - Blind Image Quality Assessment Based on Geometric Order Learning
- Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization
- Bridging the Synthetic-to-Authentic Gap: Distortion-Guided Unsupervised Domain Adaptation for Blind Image Quality Assessment
- TextCraftor: Your Text Encoder Can be Image Quality Controller
- Boosting Image Quality Assessment through Efficient Transformer Adaptation with Local Feature Enhancement
 
- 恶劣天气消除 - Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal
 ⭐code
- Language-driven All-in-one Adverse Weather Removal恶劣天气消除
 
- Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal
- 大气湍流去除 - NB-GTR: Narrow-Band Guided Turbulence Removal
 
- Image Portrait Relighting(图像重照光) - SwitchLight: Co-design of Physics-driven Architecture and Pre-training Framework for Human Portrait Relighting
 🏠project
 
- SwitchLight: Co-design of Physics-driven Architecture and Pre-training Framework for Human Portrait Relighting
- 图片缩小 - Deep Generative Model based Rate-Distortion for Image Downscaling Assessment
 
- 图像校正 - Rolling Shutter Correction with Intermediate Distortion Flow Estimation
 
- 图像着色 - Learning Inclusion Matching for Animation Paint Bucket Colorization
 ⭐code着色
- Automatic Controllable Colorization via Imagination
 ⭐code
 
- Learning Inclusion Matching for Animation Paint Bucket Colorization
- 运动(去)模糊 - Motion Blur Decomposition with Cross-shutter Guidance
- Spike-guided Motion Deblurring with Unknown Modal Spatiotemporal Alignment
 ⭐code
- Real-World Efficient Blind Motion Deblurring via Blur Pixel Discretization
- Efficient Multi-scale Network with Learnable Discrete Wavelet Transform for Blind Motion Deblurring
- Motion-adaptive Separable Collaborative Filters for Blind Motion Deblurring
 ⭐code
 
- 视频修复 - AVID: Any-Length Video Inpainting with Diffusion Model
 ⭐code
 🏠project
- Towards Language-Driven Video Inpainting via Multimodal Large Language Models
 🏠project
 
- AVID: Any-Length Video Inpainting with Diffusion Model
- 视频去雾 - Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance
 
- 视频去渲染 - Leveraging Frame Affinity for sRGB-to-RAW Video De-rendering
 
- 视频去模糊 - Frequency-aware Event-based Video Deblurring for Real-World Motion Blur
- Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring
 ⭐code
 🏠project
- FMA-Net: Flow Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring
 ⭐code
 🏠project
- DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video
 🏠project
 
- 视频增强 - Binarized Low-light Raw Video Enhancement
 
- 视频质量评估 - PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild
- Learned Scanpaths Aid Blind Panoramic Video Quality Assessment
- Modular Blind Video Quality Assessment
- KVQ: Kwai Video Quality Assessment for Short-form Videos
- CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement
 ⭐code
 
- 夜间颜色恒定 - NightCC: Nighttime Color Constancy via Adaptive Channel Masking
 
- 照明估计 - Towards a Perceptual Evaluation Framework for Lighting Estimation
 
2、Image Segmentation(图像分割)
- Matching Anything by Segmenting Anything
 ⭐code
- Unsupervised Universal Image Segmentation
- MESA: Matching Everything by Segmenting Anything
- MRFS: Mutually Reinforcing Image Fusion and Segmentation
- RobustSAM: Segment Anything Robustly on Degraded Images
- Hierarchical Histogram Threshold Segmentation - Auto-terminating High-detail Oversegmentation
- Multi-Space Alignments Towards Universal LiDAR Segmentation
- CoralSCOP: Segment any COral Image on this Planet分割
- SANeRF-HQ: Segment Anything for NeRF in High Quality
 🏠project
- ASAM: Boosting Segment Anything Model with Adversarial Tuning
- ODIN: A Single Model for 2D and 3D Segmentation
 ⭐code
- FocSAM: Delving Deeply into Focused Objects in Segmenting Anything
 👍摘要
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
- Universal Segmentation at Arbitrary Granularity with Language Instruction通用分割
- Segment and Caption Anything
 🏠project
- COCONut: Modernizing COCO Segmentation
 ⭐code
- Multi-view Aggregation Network for Dichotomous Image Segmentation
 ⭐code
- OMG-Seg: Is One Model Good Enough For All Segmentation?
 🏠project
- Unsegment Anything by Simulating Deformation
- BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
 ⭐code
- VRP-SAM: SAM with Visual Reference Prompt
- PEM: Prototype-based Efficient MaskFormer for Image Segmentation
- Fantastic Animals and Where to Find Them: Segment Any Marine Animal with Dual SAM
 ⭐code
- CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
 🏠project
- Benchmarking Segmentation Models with Mask-Preserved Attribute Editing
 ⭐code
- CuVLER: Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers
- Continual Segmentation with Disentangled Objectness Learning and Class Recognition
 ⭐code
- Kandinsky Conformal Prediction: Efficient Calibration of Image Segmentation Algorithms
 ⭐code
- Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation
 ⭐code
 🏠project
- Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model
- A Simple Recipe for Language-guided Domain Generalized Segmentation
 🏠project
- Rethinking Interactive Image Segmentation with Low Latency High Quality and Diverse Prompts
 ⭐code
- Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adaptation
 ⭐code
 👍分割一切模型SAM泛化能力差?域适应策略给解决了
- 开放词汇分割 - Transferable and Principled Efficiency for Open-Vocabulary Segmentation
- USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation
- Open-Vocabulary Segmentation with Semantic-Assisted Calibration
 ⭐code
- OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representation
 
- 视频分割 - UniVS: Unified and Universal Video Segmentation with Prompts as Queries
 ⭐code
- Turb-Seg-Res: A Segment-then-Restore Pipeline for Dynamic Videos with Atmospheric Turbulence
 🏠project视频分割
- Learning to Segment Referred Objects from Narrated Egocentric Videos
- Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
 ⭐code
 
- UniVS: Unified and Universal Video Segmentation with Prompts as Queries
- 语义分割 - Open Set Domain Adaptation for Semantic Segmentation
- ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention
- MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation
- TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation
- ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers
- HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation
- Contextrast: Contextual Contrastive Learning for Semantic Segmentation
- Open-Set Domain Adaptation for Semantic Segmentation
- SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation
 ⭐code
- Frequency-Adaptive Dilated Convolution for Semantic Segmentation
 ⭐code
- GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation
- Improving Bird's Eye View Semantic Segmentation by Task Decomposition
 ⭐code
- UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
- Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models
- Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball
- 3D 语义分割 - Hierarchical Intra-modal Correlation Learning for Label-free 3D Semantic Segmentation
- OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation
 
- 点云语义分割 - Rethinking Few-shot 3D Point Cloud Semantic Segmentation
 ⭐code
- PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation3D 点云语义分割
 
- Rethinking Few-shot 3D Point Cloud Semantic Segmentation
- 无监督语义分割 - Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation
- EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation
 ⭐code
 🏠project
 
- 小样本语义分割 - APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation
- Unlocking the Potential of Pre-trained Vision Transformers for Few-Shot Semantic Segmentation through Relationship Descriptors
 
- 零样本语义分割 - Exploring Regional Clues in CLIP for Zero-Shot Semantic Segmentation
 
- 半监督语义分割 - Training Vision Transformers for Semi-Supervised Semantic Segmentation
- Density-Guided Semi-Supervised 3D Semantic Segmentation with Dual-Space Hardness Sampling
- AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation
 ⭐code
- CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation
 ⭐code
- Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation
 ⭐code
- RankMatch: Exploring the Better Consistency Regularization for Semi-supervised Semantic Segmentation
 
- 弱监督语义分割 - Class Tokens Infusion for Weakly Supervised Semantic Segmentation
- Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation
- DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation
 ⭐code
- Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation
 ⭐code
- Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation
 ⭐code
- PSDPM: Prototype-based Secondary Discriminative Pixels Mining for Weakly Supervised Semantic Segmentation
- From SAM to CAMs: Exploring Segment Anything Model for Weakly Supervised Semantic Segmentation
 
- 域泛化语义分割 - Collaborating Foundation Models for Domain Generalized Semantic Segmentation
 ⭐code
- Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning
 ⭐code
- Stronger Fewer & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
 ⭐code
 
- Collaborating Foundation Models for Domain Generalized Semantic Segmentation
- 文本监督语义分割 - Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation
 
- 开放世界语义分割 - Open-World Semantic Segmentation Including Class Similarity
 ⭐code
 
- Open-World Semantic Segmentation Including Class Similarity
- 开放词汇语义分割 - Open-Vocabulary 3D Semantic Segmentation with Foundation Models
- Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
- CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
- Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation
- SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
 ⭐code
- Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
 ⭐code
 
 
- 全景分割 - Semantics Distortion and Style Matter: Towards Source-free UDA for Panoramic Segmentation
- ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning
 ⭐code
- PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation
 ⭐code
- Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations
 ⭐code
 
- 实例分割 - Extreme Point Supervised Instance Segmentation
- Mudslide: A Universal Nuclear Instance Segmentation Method
- Semantic-aware SAM for Point-Prompted Instance Segmentation
- SAI3D: Segment Any Instance in 3D Scenes
 🏠project
- DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data
- FISBe: A Real-World Benchmark Dataset for Instance Segmentation of Long-Range Thin Filamentous Structures
 ⭐code
- Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment based on Multi-Scale Aggregation and Anthropic Prior Knowledge
- 开放词汇实例分割 - MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation
 🏠project
 
- MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation
- 3D 实例分割 - BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation
 ⭐code
- Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
- Edge-Aware 3D Instance Segmentation Network with Intelligent Semantic Prior
- UnScene3D: Unsupervised 3D Instance Segmentation for Indoor Scenes
 
- BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation
 
- 场景分割 - No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation
 ⭐code
- MirageRoom: 3D Scene Segmentation with 2D Pre-trained Models by Mirage Projection
 
- No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation
- 动作分割 - Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos
- Coherent Temporal Synthesis for Incremental Action Segmentation
- Efficient and Effective Weakly-Supervised Action Segmentation via Action-Transition-Aware Boundary Alignment
- Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation
- FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Action Segmentation
 ⭐code
 
- 参考图像分割 - LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation
- Mask Grounding for Referring Image Segmentation
 🏠project
- Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
- Prompt-Driven Referring Image Segmentation with Instance Contrasting
 
- 指代表达式分割 - Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation
 ⭐code
 
- Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation
- VOS - Point-VOS: Pointing Up Video Object Segmentation
 🏠project
- Dual Prototype Attention for Unsupervised Video Object Segmentation
 ⭐code
- Depth-aware Test-Time Training for Zero-shot Video Object Segmentation
 ⭐code
- Putting the Object Back into Video Object Segmentation
 🏠project
- Event-assisted Low-Light Video Object Segmentation
- Guided Slot Attention for Unsupervised Video Object Segmentation
- LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
 ⭐code
- RMem: Restricted Memory Banks Improve Video Object Segmentation
 
- Point-VOS: Pointing Up Video Object Segmentation
- VSS - Infer from What You Have Seen Before: Temporally-dependent Classifier for Semi-supervised Video Semantic Segmentation
- Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes
 
- VIS - VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
 
- 抠图 - In-Context Matting
 ⭐code
- Unifying Automatic and Interactive Matting with Pretrained ViTs
- MaGGIe: Masked Guided Gradual Human Instance Matting
- EFormer: Enhanced Transformer towards Semantic-Contour Features of Foreground for Portraits Matting
 
- In-Context Matting
- 少样本分割 - Rethinking Prior Information Generation with CLIP for Few-Shot Segmentation
- Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation
 ⭐code
- Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach
- Adapt Before Comparison: A New Perspective on Cross-Domain Few-Shot Segmentation
- LLaFS: When Large Language Models Meet Few-Shot Segmentation
- Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining
 ⭐code
- Addressing Background Context Bias in Few-Shot Segmentation through Iterative Modulation
 
- 零样本分割 - Diffuse Attend and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion
 ⭐code
 
- Diffuse Attend and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion
- 裂纹分割 - Mind Marginal Non-Crack Regions: Clustering-Inspired Representation Learning for Crack Segmentation
 
- 交互式分割 - GraCo: Granularity-Controllable Interactive Segmentation
 📺video
 👍摘要
- MFP: Making Full Use of Probability Maps for Interactive Image Segmentation
 ⭐code
 
- GraCo: Granularity-Controllable Interactive Segmentation
- 无模态分割 - pix2gestalt: Amodal Segmentation by Synthesizing Wholes
 ⭐code
 🏠project
 
- pix2gestalt: Amodal Segmentation by Synthesizing Wholes
- 3D 分割 - OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive Learning
- PartDistill: 3D Shape Part Segmentation by Vision-Language Model Distillation
- Cross-Dimension Affinity Distillation for 3D EM Neuron Segmentation
- LASO: Language-guided Affordance Segmentation on 3D Object