Updated on 2025/08/13 09:12:35

Table of Contents
  1. 小目标
  2. 注意力机制
  3. 背景差分

小目标

Publish Date Title Authors PDF Code
2025/07/28 Tracking Moose using Aerial Object Detection Christopher Indris et al. 2507.21256v1 NULL
2025/07/28 An Improved YOLOv8 Approach for Small Target Detection of Rice Spikelet Flowering in Field Environments Beizhang Chen et al. 2507.20506v1 NULL
2025/07/25 Revisiting DETR for Small Object Detection via Noise-Resilient Query Optimization Xiaocheng Fang et al. 2507.19059v1 NULL
2025/07/17 RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images Xiaozheng Jiang et al. 2507.13120v1 NULL
2025/07/17 SOD-YOLO: Enhancing YOLO-Based Detection of Small Objects in UAV Imagery Peijun Wang et al. 2507.12727v1 NULL
2025/07/16 InterpIoU: Rethinking Bounding Box Regression with Interpolation-Based IoU Optimization Haoyuan Liu et al. 2507.12420v1 NULL
2025/07/09 A multi-modal dataset for insect biodiversity with imagery and DNA at the trap and individual level Johanna Orsholm et al. 2507.06972v1 NULL
2025/07/01 High-Frequency Semantics and Geometric Priors for End-to-End Detection Transformers in Challenging UAV Imagery Hongxing Peng et al. 2507.00825v2 NULL
2025/06/30 Event-based Tiny Object Detection: A Benchmark Dataset and Baseline Nuo Chen et al. 2506.23575v1 NULL
2025/06/15 MGDFIS: Multi-scale Global-detail Feature Integration Strategy for Small Object Detection Yuxiang Wang et al. 2506.12697v1 NULL
2025/06/11 CEM-FBGTinyDet: Context-Enhanced Foreground Balance with Gradient Tuning for tiny Objects Tao Liu et al. 2506.09897v1 NULL
2025/05/28 Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection Guiping Cao et al. 2505.21868v1 NULL
2025/05/27 Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO Muzhi Zhu et al. 2505.21457v1 NULL
2025/05/27 Robust Video-Based Pothole Detection and Area Estimation for Intelligent Vehicles with Depth Map and Kalman Smoothing Dehao Wang et al. 2505.21049v1 NULL
2025/05/22 MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection Yichen Li et al. 2505.16442v1 NULL
2025/05/15 Application of YOLOv8 in monocular downward multiple Car Target detection Shijie Lyu 2505.10016v2 NULL
2025/05/09 Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection Zhangchi Hu et al. 2505.05741v2 NULL
2025/05/05 DPNet: Dynamic Pooling Network for Tiny Object Detection Luqi Gong et al. 2505.02797v1 NULL
2025/04/30 Learning to Borrow Features for Improved Detection of Small Objects in Single-Shot Detectors Richard Schmit 2505.00044v1 NULL
2025/04/29 Purifying, Labeling, and Utilizing: A High-Quality Pipeline for Small Object Detection Siwei Wang et al. 2504.20602v1 NULL
2025/04/25 MASF-YOLO: An Improved YOLOv11 Network for Small Object Detection on Drone View Liugang Lu et al. 2504.18136v1 NULL
2025/04/18 HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection YangChen Zeng 2504.13469v1 NULL
2025/04/14 Small Object Detection with YOLO: A Performance Analysis Across Model Versions and Hardware Muhammad Fasih Tariq et al. 2504.09900v1 NULL
2025/04/11 SO-DETR: Leveraging Dual-Domain Features and Knowledge Distillation for Small Object Detection Huaxiang Zhang et al. 2504.11470v1 NULL
2025/03/29 Context in object detection: a systematic literature review Mahtab Jamali et al. 2503.23249v1 NULL
2025/03/26 Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications Mahya Nikouei et al. 2503.20516v1 NULL
2025/03/24 LGI-DETR: Local-Global Interaction for UAV Object Detection Zifa Chen 2503.18785v1 NULL
2025/03/18 YOLO-LLTS: Real-Time Low-Light Traffic Sign Detection via Prior-Guided Enhancement and Multi-Branch Feature Interaction Ziyu Lin et al. 2503.13883v3 NULL
2025/03/06 DEAL-YOLO: Drone-based Efficient Animal Localization using YOLO Aditya Prashant Naidu et al. 2503.04698v1 NULL
2025/03/06 ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem Yu-Hsi Chen et al. 2503.04500v2 NULL

注意力机制

Publish Date Title Authors PDF Code
2025/08/11 THAT: Token-wise High-frequency Augmentation Transformer for Hyperspectral Pansharpening Hongkun Jin et al. 2508.08183v1 NULL
2025/08/11 CD-TVD: Contrastive Diffusion for 3D Super-Resolution with Scarce High-Resolution Time-Varying Data Chongke Bi et al. 2508.08173v1 NULL
2025/08/11 Advancing Knowledge Tracing by Exploring Follow-up Performance Trends Hengyu Liu et al. 2508.08019v1 NULL
2025/08/11 A Physics-informed Deep Operator for Real-Time Freeway Traffic State Estimation Hongxin Yu et al. 2508.08002v1 NULL
2025/08/11 Learning Satellite Attitude Dynamics with Physics-Informed Normalising Flow Carlo Cena et al. 2508.07841v1 NULL
2025/08/11 DiTVR: Zero-Shot Diffusion Transformer for Video Restoration Sicheng Gao et al. 2508.07811v1 NULL
2025/08/11 Anatomy-Aware Low-Dose CT Denoising via Pretrained Vision Models and Semantic-Guided Contrastive Learning Runze Wang et al. 2508.07788v1 NULL
2025/08/11 Voice Pathology Detection Using Phonation Sri Raksha Siva et al. 2508.07587v1 NULL
2025/08/11 Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing Joonghyuk Shin et al. 2508.07519v1 NULL
2025/08/10 Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers Xin Ma et al. 2508.07246v1 NULL
2025/08/10 Enhancing Rumor Detection Methods with Propagation Structure Infused Language Model Chaoqun Cui et al. 2508.07209v1 NULL
2025/08/10 DySK-Attn: A Framework for Efficient, Real-Time Knowledge Updating in Large Language Models via Dynamic Sparse Knowledge Attention Kabir Khan et al. 2508.07185v1 NULL
2025/08/09 Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning Lijie Yang et al. 2508.07101v1 NULL
2025/08/09 Structure-Preserving Digital Twins via Conditional Neural Whitney Forms Brooks Kinch et al. 2508.06981v1 NULL
2025/08/09 Intrinsic Explainability of Multimodal Learning for Crop Yield Prediction Hiba Najjar et al. 2508.06939v1 NULL
2025/08/08 Early Detection of Pancreatic Cancer Using Multimodal Learning on Electronic Health Record Mosbah Aouad et al. 2508.06627v2 NULL
2025/08/08 WGAST: Weakly-Supervised Generative Network for Daily 10 m Land Surface Temperature Estimation via Spatio-Temporal Fusion Sofiane Bouaziz et al. 2508.06485v1 NULL
2025/08/08 MotionSwap Om Patil et al. 2508.06430v1 NULL
2025/08/08 Automatic Semantic Alignment of Flow Pattern Representations for Exploration with Large Language Models Weihan Zhang et al. 2508.06300v1 NULL
2025/08/08 An Interpretable Multi-Plane Fusion Framework With Kolmogorov-Arnold Network Guided Attention Enhancement for Alzheimer’s Disease Diagnosis Xiaoxiao Yang et al. 2508.06157v1 NULL
2025/08/08 Mask & Match: Learning to Recognize Handwritten Math with Self-Supervised Attention Shree Mitra et al. 2508.06107v1 NULL
2025/08/08 Adaptive Heterogeneous Graph Neural Networks: Bridging Heterophily and Heterogeneity Qin Chen et al. 2508.06034v1 NULL
2025/08/08 Crisp Attention: Regularizing Transformers via Structured Sparsity Sagar Gandhi et al. 2508.06016v1 NULL
2025/08/08 ECMF: Enhanced Cross-Modal Fusion for Multimodal Emotion Recognition in MER-SEMI Challenge Juewen Hu et al. 2508.05991v1 NULL
2025/08/08 DAFMSVC: One-Shot Singing Voice Conversion with Dual Attention Mechanism and Flow Matching Wei Chen et al. 2508.05978v1 NULL
2025/08/07 Temporal Cluster Assignment for Efficient Real-Time Video Segmentation Ka-Wai Yung et al. 2508.05851v1 NULL
2025/08/07 An Effective Approach for Node Classification in Textual Graphs Rituparna Datta et al. 2508.05836v1 NULL
2025/08/07 Discrepancy-Aware Contrastive Adaptation in Medical Time Series Analysis Yifan Wang et al. 2508.05572v1 NULL
2025/08/07 Deformable Attention Graph Representation Learning for Histopathology Whole Slide Image Analysis Mingxi Fu et al. 2508.05382v1 NULL
2025/08/07 FDC-Net: Rethinking the association between EEG artifact removal and multi-dimensional affective computing Wenjia Dong et al. 2508.05231v2 NULL

背景差分

Publish Date Title Authors PDF Code
2025/08/01 Noise Reduction Method for Radio Astronomy Single Station Observation Based on Wavelet Transform and Mathematical Morphology Ming-wei Qin et al. 2508.00386v1 NULL
2025/07/31 Quantum-enhanced dark matter detection using Schrödinger cat states Pan Zheng et al. 2507.23538v1 NULL
2025/07/18 Deep Image Reconstruction for Background Subtraction in Heavy-Ion Collisions Umar Sohail Qureshi et al. 2507.14036v1 NULL
2025/07/16 Frequency-responsive RCS characteristics and scaling implications for ISAC development Saúl Fenollosa et al. 2507.12235v1 NULL
2025/07/14 Global sky background images for JWST/NIRISS Wide-Field Slitless Spectroscopy Gaël Noirot 2507.10650v1 NULL
2025/07/02 Reliable Magnetometry for Antiferromagnets and Thin Films: Correcting Substrate Artifacts in Mn3Sn/MgO Systems Katarzyna Gas et al. 2507.01385v1 NULL
2025/06/22 In pursuit of the low-energy Solar neutron flux Prithish Halder et al. 2506.17985v1 NULL
2025/06/17 Comparison of Two Methods for Stationary Incident Detection Based on Background Image Deepak Ghimire et al. 2506.14256v1 NULL
2025/06/09 Optical tweezers as a tool for quantitative imaging Ilya M Beskin et al. 2506.08186v1 NULL
2025/06/03 COSMOS-Web: MIRI Data Reduction and Number Counts at 7.7$μ$m using JWST Santosh Harish et al. 2506.03306v1 NULL
2025/06/03 COSMOS-Web: Comprehensive Data Reduction for Wide-Area JWST NIRCam Imaging Maximilien Franco et al. 2506.03256v1 NULL
2025/05/23 A 1.8 m class pathfinder Raman LIDAR for the Northern Site of the Cherenkov Telescope Array Observatory – Performance Pedro Jose Bauza-Ruiz et al. 2505.17996v1 NULL
2025/05/21 A Methodology to Evaluate Strategies Predicting Rankings on Unseen Domains Sébastien Piérard et al. 2505.15595v1 NULL
2025/05/12 SAEN-BGS: Energy-Efficient Spiking AutoEncoder Network for Background Subtraction Zhixuan Zhang et al. 2505.07336v1 NULL
2025/04/30 Higher derivative corrections to Kerr-AdS black hole thermodynamics Wei Guo et al. 2504.21724v1 NULL
2025/04/25 Outlier-aware Tensor Robust Principal Component Analysis with Self-guided Data Augmentation Yangyang Xu et al. 2504.18323v1 NULL
2025/03/24 MEGA Mass Assembly with JWST: The MIRI EGS Galaxy and AGN Survey Bren E. Backhaus et al. 2503.19078v2 NULL
2025/03/24 The PHANGS-HST-Halpha Survey: Warm Ionized Gas Physics at High Angular resolution in Nearby GalaxieS with the Hubble Space Telescope Rupali Chandar et al. 2503.18791v1 NULL
2025/03/21 UV LIGHTS. New tools for revealing the low surface brightness regime in the ultraviolet Ignacio Ruiz Cejudo et al. 2503.17446v1 NULL
2025/03/14 Black Hole Action in Einstein’s other Gravity Michal Stano 2503.11746v1 NULL
2025/03/10 Identification and Removal of System-Induced Autofluorescence in Miniaturized Fiber-optic Fluorescence Endoscopes Lei Xiang et al. 2503.07921v1 NULL
2025/03/04 Quantitative exploration of the similarity of gamma-ray pulsar light curves C. R. García et al. 2503.02750v1 NULL
2025/03/04 LangGas: Introducing Language in Selective Zero-Shot Background Subtraction for Semi-Transparent Gas Leak Detection with a New Dataset Wenqi Guo et al. 2503.02910v3 NULL
2025/03/03 An Approach for Air Drawing Using Background Subtraction and Contour Extraction Ramkrishna Acharya 2503.01497v1 NULL
2025/03/01 Detection of Customer Interested Garments in Surveillance Video using Computer Vision Earnest Paul Ijjina et al. 2503.00442v1 NULL
2025/03/01 Scalable Real2Sim: Physics-Aware Asset Generation Via Robotic Pick-and-Place Setups Nicholas Pfaff et al. 2503.00370v2 NULL
2025/02/21 Equivariant localization in supergravity in odd dimensions Edoardo Colombo et al. 2502.15624v1 NULL
2025/02/13 ALMA-IMF XVII – Census and lifetime of high-mass prestellar cores in 14 massive protoclusters M. Valeille-Manet et al. 2502.09426v1 NULL
2025/02/06 Drone Beam Mapping of the TONE Radio Dish Array Emily R. Kuhn et al. 2502.03759v1 NULL
2025/01/14 Background subtraction method is not only much simpler, but also as applicable as covariant counterterm method Wei Guo et al. 2501.08214v3 NULL