已发表成果:
WOK 论文 537 篇;中文核心 11 篇;其它论文 1 篇;专利发明 8 个;
An efficient blur kernel estimation method for blind image Super-Resolution
You only compress once: Towards effective and elastic BERT compression via exploit-explore stochastic nature gradient
Deep hybrid transformer network for robust modulation classification in wireless communications
Continual Face Forgery Detection via Historical Distribution Preserving
Adaptive Fuzzy Positive Learning for Annotation-Scarce Semantic Segmentation
CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection
Beyond Inter-Item Relations: Dynamic Adaptive Mixture-of-Experts for LLM-Based Sequential Recommendation
StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model
EASYINV: TOWARD FAST AND BETTER DDIM INVERSION
Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation
Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models
Image Captioning via Dynamic Path Customization
3D-GRES: Generalized 3D Referring Expression Segmentation
MOVE AND ACT: ENHANCED OBJECT MANIPULATION AND BACKGROUND INTEGRITY FOR IMAGE EDITING
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
AccDiffusion: An Accurate Method for Higher-Resolution Image Generation
ERQ: Error Reduction for Post-Training Quantization of Vision Transformers
Multi-branch Collaborative Learning Network for 3D Visual Grounding
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model
ANYSR: REALIZING IMAGE SUPER-RESOLUTION AS ANY-SCALE, ANY-RESOURCE
Oracle Bone Inscriptions Multi-modal Dataset
HRSAM: Efficiently Segment Anything in High-Resolution Images
Identity-Aware Variational Autoencoder for Face Swapping
HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection
Local Manifold Learning for No-Reference Image Quality Assessment
UIO-LLMS: UNBIASED INCREMENTAL OPTIMIZATION FOR LONG-CONTEXT LLMS
Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Depth-Guided Semi-Supervised Instance Segmentation
Evaluating and Analyzing Relationship Hallucinations in LVLMs
AnyTrans: Translate AnyText in the Image with Large Scale Models
VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation
Image Captioning via Dynamic Path Customization
UniPTS: A Unified Framework for Proficient Post-Training Sparsity
FocSAM: Delving Deeply into Focused Objects in Segmenting Anything
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode Multi-view Latent Diffusion
Optg: Optimizing Gradient-Driven Criteria in Network Sparsity
Boosting Multimodal Large Language Models with Visual Tokens Withdrawal for Rapid Inference
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation
GraCo: Granularity-Controllable Interactive Segmentation
ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method
Multi-Modal Prompt Learning on Blind Image Quality Assessment
NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation
ConCLVD: Controllable Chinese Landscape Video Generation via Diffusion Model
Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization
CycleTrans: Learning Neutral Yet Discriminative Features via Cycle Construction for Visible-Infrared Person Re-Identification
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
Deep Instruction Tuning for Segment Anything Model
DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis
Learning Image Demoiréing from Unpaired Real Data
Toward Open-Set Human Object Interaction Detection
Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models
AFFINEQUANT: AFFINE TRANSFORMATION QUANTIZATION FOR LARGE LANGUAGE MODELS
DMAD: Dual Memory Bank for Real-World Anomaly Detection
Autoregressive Queries for Adaptive Tracking with Spatio-Temporal Transformers
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
Semi-supervised Counting via Pixel-by-pixel Density Distribution Modelling
EBFT: Effective and Block-Wise Fine-Tuning for Sparse LLMs
An Efficient Blur Kernel Estimation Method for Blind Image Super-Resolution
Shadow-aware dynamic convolution for shadow removal
A closer look at branch classifiers of multi-exit architectures
Unified-Width Adaptive Dynamic Network for All-In-One Image Restoration
Feature Denoising Diffusion Model for Blind Image Quality Assessment
Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation
Cross-Modality Perturbation Synergy Attack for Person Re-identification
Learning Image Demoiréing from Unpaired Real Data
Defense Against Adversarial Attacks Using Topology Aligning Adversarial Training
Weakly-Supervised RGBD Video Object Segmentation
Preface
Preface
Preface
Preface
Preface
Preface
Preface
Preface
Preface
Two-Stage Deep Learning Segmentation for Tiny Brain Regions
Training-Free Transformer Architecture Search With Zero-Cost Proxy Guided Evolution
Uncovering the Over-Smoothing Challenge in Image Super-Resolution: Entropy-Based Quantification and Contrastive Optimization
EXPLORING TARGET REPRESENTATIONS FOR MASKED AUTOENCODERS
AFFINEQUANT: AFFINE TRANSFORMATION QUANTIZATION FOR LARGE LANGUAGE MODELS
DYNAMIC SPARSE NO TRAINING ?: TRAINING-FREE FINE-TUNING FOR SPARSE LLMS
MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization
Adaptive Zone Learning for Weakly Supervised Object Localization
FUNCTIONALLY SIMILAR MULTI-LABEL KNOWLEDGE DISTILLATION
GreedyAgent: Crafting Efficient Agents for Meta-learning from Learning Curves via Greedy Algorithm Selection
厦门大学纪荣嵘教授团队在深度伪造检测领域取得新进展
信息网络安全,1671-1122,2024-11-10.