已发表成果:
WOK 论文 118 篇;中文核心 1 篇;
Unsupervised Domain Adaptation on Person Reidentification Via Dual-Level Asymmetric Mutual Learning
CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection
Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation
HRSAM: Efficiently Segment Anything in High-Resolution Images
Local Manifold Learning for No-Reference Image Quality Assessment
HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection
Depth-Guided Semi-Supervised Instance Segmentation
Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
UniPTS: A Unified Framework for Proficient Post-Training Sparsity
FocSAM: Delving Deeply into Focused Objects in Segmenting Anything
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode Multi-view Latent Diffusion
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method
Multi-Modal Prompt Learning on Blind Image Quality Assessment
NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation
DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis
Occluded Person Re-identification via Saliency-Guided Patch Transfer
Attention Disturbance and Dual-Path Constraint Network for Occluded Person Re-identification
Weakly Supervised Open-Vocabulary Object Detection
DMAD: Dual Memory Bank for Real-World Anomaly Detection
ISTR: Mask-Embedding-Based Instance Segmentation Transformer
Adaptive Zone Learning for Weakly Supervised Object Localization
Few-Shot Object Detection via Classify-Free RPN
Global Selection and Local Attention Network for Referring Image Segmentation
Hierarchical Focused Feature Pyramid Network for Small Object Detection
Weakly Supervised Open-Vocabulary Object Detection
A Unified Framework for 3D Point Cloud Visual Grounding
Pseudo-label Alignment for Semi-supervised Instance Segmentation
Attack Can Benefit: An Adversarial Approach to Recognizing Facial Expressions under Noisy Annotations
Practical Cross-System Shilling Attacks with Limited Access to Data
InterFormer: Real-time Interactive Image Segmentation
Geometric-aware Pretraining for Vision-centric 3D Object Detection
You Only Segment Once: Towards Real-Time Panoptic Segmentation
Active Teacher for Semi-Supervised Object Detection
DistilPose: Tokenized Pose Regression with Heatmap Distillation
Unsupervised Domain Adaptation on Person Re-Identification via Dual-level Asymmetric Mutual Learning
Bilateral Knowledge Interaction Network for Referring Image Segmentation
Prioritized Subnet Sampling for Resource-Adaptive Supernet Training
CAM R-CNN: End-to-End Object Detection with Class Activation Maps
You Only Segment Once: Towards Real-Time Panoptic Segmentation
Beyond the Label Distribution Prior for Long-Tailed Recognition
DistilPose: Tokenized Pose Regression with Heatmap Distillation
Category-aware Allocation Transformer for Weakly Supervised Object Localization
InterFormer Real-time Interactive Image Segmentation
Pseudo-label Alignment for Semi-supervised Instance Segmentation
Self-Paced Partial Domain-Aware Learning for Face Anti-Spoofing
CANDY: Category-Kernelized Dynamic Convolution for Instance Segmentation
Cycle Encoding of a StyleGAN Encoder for Improved Reconstruction and Editability
Cycle Encoding of a StyleGAN Encoder for Improved Reconstruction and Editability
LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization
GuidedMix-Net: Semi-Supervised Semantic Segmentation by Using Labeled Images as Reference
Deepwalk-aware graph convolutional networks
Towards Lightweight Transformer via Group-wise Transformation for Vision-and-Language Tasks
Towards Robust Adversarial Training via Dual-label Supervised and Geometry Constraint
SeqTR: A Simple yet Universal Network for Visual Grounding
ARM: Any-Time Super-Resolution Method
Pruning Networks With Cross-Layer Ranking & k-Reciprocal Nearest Filters
Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning
Towards Lightweight Transformer Via Group-Wise Transformation for Vision-and-Language Tasks
Active Teacher for Semi-Supervised Object Detection
SeqTR: A Simple Yet Universal Network for Visual Grounding
Knowledge Condensation Distillation
Privacy-Preserving Face Recognition with Learnable Privacy Budgets in Frequency Domain
ARM: Any-Time Super-Resolution Method
Dual-Level Collaborative Transformer for Image Captioning
Architecture Disentanglement for Deep Neural Networks
EC-DARTS: Inducing Equalized and Consistent Optimization into DARTS
Parallel Detection-and-Segmentation Learning for Weakly Supervised Instance
Filter Sketch for Network Pruning
Knowledge-Driven Generative Adversarial Network for Text-to-Image Synthesis
Toward Joint Thing-and-Stuff Mining for Weakly Supervised Panoptic Segmentation
Image-to-image Translation via Hierarchical Style Disentanglement
SDD-FIQA: Unsupervised Face Image Quality Assessment with Similarity Distribution Distance
Exploring Language Prior for Mode-Sensitive Visual Attention Modeling
Link-aware semi-supervised hypergraph
Multi-task collaborative network for joint referring expression comprehension and segmentation
Cyclic guidance for weakly supervised joint detection and segmentation
Towards optimal structured CNN pruning via generative adversarial learning
Learning Similarity-specific Dictionary for Zero-shot Fine-grained Recognition
Towards Cross-modality Topic Modelling via Deep Topical Correlation Analysis
Hypergraph induced convolutional manifold networks
Generalized zero-shot vehicle detection in remote sensing imagery via coarse-to-fine framework
Many-to-One Gesture-to-Command Flexible Mapping Approach for Smart Teaching Interface Interaction
Cross-Modality Microblog Sentiment Prediction via Bi-Layer Multimodal Hypergraph Learning
Toward Optimal Manifold Hashing via Discrete Locally Linear Embedding
Multimodal media data understanding and analysis
Editorial Note: Multimodal Data Fusion, Learning and Applications
Weakly supervised vehicle detection in satellite images via multi-instance discriminative learning
Superpixel-based coastline extraction in SAR images with speckle noise removal
Joint Depth and Semantic Inference from a Single Image via Elastic Conditional Random Field
Vehicle Detection in High-Resolution Aerial Images Based on Fast Sparse Representation Classification and Multiorder Feature
Road Network Extraction via Aperiodic Directional Structure Measurement
A novel features ranking metric with application to scalable visual and bioinformatics data classification
Vehicle Detection in High-Resolution Aerial Images via Sparse Representation and Superpixels
Convolutional Deep Belief Networks for Single-Cell/Object Tracking in Computational Biology and Computer Vision
Semi-supervised feature learning for hyperspectral image classification
Vehicle detection from high-resolution aerial images based on superpixel and color name features
Robust Individual-Cell/Object Tracking via PCANet Deep Network in Biomedicine and Computer Vision
Deep neural networks-based vehicle detection in satellite images
Person re-identification based on multi-instance multi-label learning
Hypergraph regularized sparse feature learning
Interactive on-device Mobile Landmark Recognition with compact binary codes
Robust depth-based object tracking from a moving binocular camera
Localizing web videos using social images
Multimodal learning for view-based 3D object classification
Estimation of human body shape and cloth field in front of a kinect
Human behavior recognition based on 3D features and hidden markov models
Robust vehicle detection by combining deep features with exemplar classification
Question microblog identification and answer recommendation
Single/cross-camera multiple-person tracking by graph matching
Vehicle detection from highway satellite images via transfer learning
Question Popularity Analysis and Prediction in Community Question Answering Services
Shape completion for depth image via repeated objects registration
Robust latent semantic exploration for image retrieval in social media
A Topic Clustering Approach to Finding Similar Questions from Large Question and Answer Archives
News videos anchor person detection by shot clustering
High-capacity reversible watermarking scheme of 2D-vector data
双标签监督的几何约束对抗训练
软件学报,1000-9825,2022-04-15.