Updated on 2024/06/27 08:51:11
小目标
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024/06/20 | Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines | Xinyi Ying et al. | 2406.14482v1 | link |
2024/06/12 | MWIRSTD: A MWIR Small Target Detection Dataset | Nikhil Kumar et al. | 2406.08063v1 | link |
2024/06/12 | Sense Less, Generate More: Pre-training LiDAR Perception with Masked Autoencoders for Ultra-Efficient 3D Sensing | Sina Tayebati et al. | 2406.07833v1 | link |
2024/06/09 | A DeNoising FPN With Transformer R-CNN for Tiny Object Detection | Hou-I Liu et al. | 2406.05755v3 | link |
2024/06/08 | Select-Mosaic: Data Augmentation Method for Dense Small Object Scenes | Hao Zhang et al. | 2406.05412v1 | link |
2024/05/18 | Visible and Clear: Finding Tiny Objects in Difference Map | Bing Cao et al. | 2405.11276v1 | NULL |
2024/05/02 | SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients | Tushar Verma et al. | 2405.01699v2 | NULL |
2024/04/25 | Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban Intersection | Mehmet Kerem Turkcan et al. | 2404.16944v1 | link |
2024/04/20 | Efficient and Concise Explanations for Object Detection with Gaussian-Class Activation Mapping Explainer | Quoc Khanh Nguyen et al. | 2404.13417v1 | NULL |
2024/04/09 | YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images | Chenguang Liu et al. | 2404.06180v2 | link |
2024/04/05 | SCAResNet: A ResNet Variant Optimized for Tiny Object Detection in Transmission and Distribution Towers | Weile Li et al. | 2404.04179v1 | link |
2024/04/04 | DQ-DETR: DETR with Dynamic Query for Tiny Object Detection | Yi-Xin Huang et al. | 2404.03507v2 | NULL |
2024/03/16 | HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection | Shibiao Xu et al. | 2403.10778v1 | link |
2024/03/06 | FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion | Hao Wang et al. | 2403.03463v1 | NULL |
2024/02/22 | YOLO-TLA: An Efficient and Lightweight Small Object Detection Model based on YOLOv5 | Peng Gao et al. | 2402.14309v1 | NULL |
2024/02/20 | YOLO-Ant: A Lightweight Detector via Depthwise Separable Convolutional and Large Kernel Design for Antenna Interference Source Detection | Xiaoyu Tang et al. | 2402.12641v1 | link |
2024/02/01 | Vehicle Perception from Satellite | Bin Zhao et al. | 2402.00703v1 | link |
2024/01/16 | Robust Tiny Object Detection in Aerial Images amidst Label Noise | Haoran Zhu et al. | 2401.08056v1 | link |
2024/01/16 | Small Object Detection by DETR via Information Augmentation and Adaptive Feature Fusion | Ji Huang et al. | 2401.08017v1 | NULL |
2024/01/08 | Dr$^2$Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning | Chen Zhao et al. | 2401.04105v2 | link |
2023/12/15 | Small Bird Detection using YOLOv7 with Test-Time Augmentation | Kosuke Shigematsu | 2401.01018v1 | NULL |
2023/12/05 | Towards Automatic Power Battery Detection: New Challenge, Benchmark Dataset and Baseline | Xiaoqi Zhao et al. | 2312.02528v2 | link |
2023/11/14 | Deep Learning-Based Object Detection in Maritime Unmanned Aerial Vehicle Imagery: Review and Experimental Comparisons | Chenjie Zhao et al. | 2311.07955v2 | NULL |
2023/11/13 | Enhancing Lightweight Neural Networks for Small Object Detection in IoT Applications | Liam Boyle et al. | 2311.07163v1 | NULL |
2023/11/08 | S$^3$AD: Semi-supervised Small Apple Detection in Orchard Environments | Robert Johanson et al. | 2311.05029v1 | NULL |
2023/10/22 | The Importance of Anti-Aliasing in Tiny Object Detection | Jinlai Ning et al. | 2310.14221v1 | link |
2023/10/21 | Multimodal Transformer Using Cross-Channel attention for Object Detection in Remote Sensing Images | Bissmella Bahaduri et al. | 2310.13876v3 | link |
2023/10/09 | DANet: Enhancing Small Object Detection through an Efficient Deformable Attention Network | Md Sohag Mia et al. | 2310.05768v2 | NULL |
2023/09/28 | HIC-YOLOv5: Improved YOLOv5 For Small Object Detection | Shiyi Tang et al. | 2309.16393v2 | link |
2023/09/27 | Joint-YODNet: A Light-weight Object Detector for UAVs to Achieve Above 100fps | Vipin Gautam et al. | 2309.15782v1 | NULL |
注意力机制
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024/06/25 | Brain Tumor Classification using Vision Transformer with Selective Cross-Attention Mechanism and Feature Calibration | Mohammad Ali Labbaf Khaniki et al. | 2406.17670v1 | NULL |
2024/06/25 | MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection | Michelle Adeline et al. | 2406.17654v1 | NULL |
2024/06/25 | Point Tree Transformer for Point Cloud Registration | Meiling Wang et al. | 2406.17530v1 | NULL |
2024/06/25 | Dual-Space Knowledge Distillation for Large Language Models | Songming Zhang et al. | 2406.17328v1 | link |
2024/06/25 | Multimodal Cross-Task Interaction for Survival Analysis in Whole Slide Pathological Images | Songhan Jiang et al. | 2406.17225v1 | link |
2024/06/24 | BrainMAE: A Region-aware Self-supervised Learning Framework for Brain Signals | Yifan Yang et al. | 2406.17086v1 | NULL |
2024/06/24 | FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models | Haonan Qiu et al. | 2406.16863v1 | link |
2024/06/24 | Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers | Chao Lou et al. | 2406.16747v1 | NULL |
2024/06/24 | Demystifying the Effect of Receptive Field Size in U-Net Models for Medical Image Segmentation | Vincent Loos et al. | 2406.16701v1 | NULL |
2024/06/24 | Multi-Modal Vision Transformers for Crop Mapping from Satellite Image Time Series | Theresa Follath et al. | 2406.16513v1 | NULL |
2024/06/24 | OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser | Jingze Shi et al. | 2406.16495v2 | link |
2024/06/24 | Wavelet Attention GRU for Efficient Industrial Gas Recognition with Novel Metrics | Ding Wang | 2406.16997v1 | NULL |
2024/06/24 | Lesion-Aware Cross-Phase Attention Network for Renal Tumor Subtype Classification on Multi-Phase CT Scans | Kwang-Hyun Uhm et al. | 2406.16322v1 | NULL |
2024/06/23 | CAVM: Conditional Autoregressive Vision Model for Contrast-Enhanced Brain Tumor MRI Synthesis | Lujun Gui et al. | 2406.16074v1 | NULL |
2024/06/23 | DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation | Yueru Luo et al. | 2406.16072v1 | link |
2024/06/23 | RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization | Mingshu Zhao et al. | 2406.16004v1 | link |
2024/06/22 | LaneSegNet Design Study | William Stevens et al. | 2406.15946v1 | NULL |
2024/06/22 | Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration | Zhongzhi Yu et al. | 2406.15765v1 | link |
2024/06/22 | TacoLM: GaTed Attention Equipped Codec Language Model are Efficient Zero-Shot Text to Speech Synthesizers | Yakun Song et al. | 2406.15752v1 | NULL |
2024/06/21 | Generating Music with Structure Using Self-Similarity as Attention | Sophia Hager et al. | 2406.15647v2 | NULL |
2024/06/21 | Open-Vocabulary Temporal Action Localization using Multimodal Guidance | Akshita Gupta et al. | 2406.15556v1 | NULL |
2024/06/21 | GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation | Chubin Zhang et al. | 2406.15333v1 | link |
2024/06/21 | Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild | Nadav Orzech et al. | 2406.15331v1 | NULL |
2024/06/21 | Fine-grained Attention in Hierarchical Transformers for Tabular Time-series | Raphael Azorin et al. | 2406.15327v1 | link |
2024/06/21 | A Wavelet Guided Attention Module for Skin Cancer Classification with Gradient-based Feature Fusion | Ayush Roy et al. | 2406.15128v1 | link |
2024/06/21 | FA-Net: A Fuzzy Attention-aided Deep Neural Network for Pneumonia Detection in Chest X-Rays | Ayush Roy et al. | 2406.15117v1 | link |
2024/06/21 | SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning | Matthias Weissenbacher et al. | 2406.15025v1 | NULL |
2024/06/21 | Optimised Grouped-Query Attention Mechanism for Transformers | Yuang Chen et al. | 2406.14963v1 | NULL |
2024/06/21 | Pathformer: Recursive Path Query Encoding for Complex Logical Query Answering | Chongzhi Zhang et al. | 2406.14880v1 | NULL |
2024/06/20 | Boosting Hyperspectral Image Classification with Gate-Shift-Fuse Mechanisms in a Novel CNN-Transformer Approach | Mohamed Fadhlallah Guerri et al. | 2406.14120v1 | NULL |
背景差分
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024/05/28 | Improving mid-infrared thermal background subtraction with Principal Component Analysis | Hélène Rousseau et al. | 2405.18043v1 | NULL |
2024/05/24 | SMILES Initial Data Release: Unveiling the Obscured Universe with MIRI Multi-band Imaging | Stacey Alberts et al. | 2405.15972v1 | NULL |
2024/05/21 | Monte Carlos for tau lepton – Standard Model and New Physics signatures | Z. Was | 2405.12570v2 | NULL |
2024/04/09 | JADES Data Release 3 – NIRSpec/MSA spectroscopy for 4,000 galaxies in the GOODS fields | Francesco D’Eugenio et al. | 2404.06531v1 | NULL |
2024/04/03 | Characterization of contaminants in the Lyman-alpha forest auto-correlation with DESI | J. Guy et al. | 2404.03003v2 | NULL |
2024/02/21 | QCD corrections to the Darwin coefficient in inclusive semileptonic $B\rightarrow X_u \ell \barν_\ell$ decays | Daniel Moreno | 2402.13805v2 | NULL |
2024/02/15 | Hand Shape and Gesture Recognition using Multiscale Template Matching, Background Subtraction and Binary Image Analysis | Ketan Suhaas Saichandran | 2402.09663v1 | NULL |
2024/02/12 | First Result for Dark Matter Search by WINERED | Wen Yin et al. | 2402.07976v1 | NULL |
2024/02/08 | Multiplicity Based Background Subtraction for Jets in Heavy Ion Collisions | Tanner Mengel et al. | 2402.10945v1 | NULL |
2024/02/01 | MUSTAN: Multi-scale Temporal Context as Attention for Robust Video Foreground Segmentation | Praveen Kumar Pokala et al. | 2402.00918v1 | NULL |
2024/01/11 | Enhancing Sensing-Assisted Communications in Cluttered Indoor Environments through Background Subtraction | Andrea Ramos et al. | 2401.05763v1 | NULL |
2024/01/05 | Traffic Cameras to detect inland waterway barge traffic: An Application of machine learning | Geoffery Agorku et al. | 2401.03070v1 | NULL |
2023/12/19 | The stellar mass function of quiescent galaxies in 2 < z < 2.5 protoclusters | Adit H. Edward et al. | 2312.12380v1 | NULL |
2023/12/14 | Solar flare catalog from 3 years of Chandrayaan-2 XSM observations | Aravind Bharathi Valluvan et al. | 2312.09191v2 | link |
2023/12/11 | Efficiency of solar microflares in accelerating electrons when rooted in a sunspot | Jonas Saqri et al. | 2312.06856v2 | NULL |
2023/12/11 | Non-negative matrix factorization approach to sky subtraction for optical spectroscopy | Fedor Kolganov et al. | 2312.06761v1 | NULL |
2023/12/04 | Cable Slack Detection for Arresting Gear Application using Machine Vision | Ari Goodman et al. | 2312.02320v1 | NULL |
2023/12/02 | Separating the spectral counterparts in NGC 1275/Perseus cluster in X-rays | Elena Fedorova et al. | 2312.01174v1 | NULL |
2023/10/25 | Spectral Background-Subtracted Activity Maps | Carsten Denker et al. | 2310.16747v1 | NULL |
2023/10/12 | Analytical estimation of the signal to noise ratio efficiency in axion dark matter searches using a Savitzky-Golay filter | A. K. Yi et al. | 2310.07967v2 | NULL |
2023/09/28 | The Hyper Suprime-Cam extended Point Spread Functions and applications | L. P. Garate-Nuñez et al. | 2309.16244v2 | link |
2023/09/27 | Learning Spatial-Temporal Regularized Tensor Sparse RPCA for Background Subtraction | Basit Alawode et al. | 2309.15576v1 | NULL |
2023/09/15 | A Ground Segmentation Method Based on Point Cloud Map for Unstructured Roads | Zixuan Li et al. | 2309.08164v1 | NULL |
2023/09/12 | Reference Frames and Black Hole Thermodynamics | Franco Fiorini et al. | 2309.06293v2 | NULL |
2023/08/23 | Computational models of object motion detectors accelerated using FPGA technology | Pedro Machado | 2310.06842v1 | NULL |
2023/08/02 | Measurement of the $B_s^0 \to μμ$ Effective Lifetime with the ATLAS Detector | ATLAS Collaboration | 2308.01171v2 | NULL |
2023/06/29 | Effect of Background Signal on Momentum Imaging | Sukanta Das et al. | 2306.16708v1 | NULL |
2023/06/02 | Inaccuracies and biases of the Gaussian size deconvolution for extracted sources and filaments | Alexander Men’shchikov | 2306.01563v2 | NULL |
2023/05/29 | Human Body Shape Classification Based on a Single Image | Cameron Trotter et al. | 2305.18480v1 | NULL |
2023/05/26 | Awesome SOSS: Transmission Spectroscopy of WASP-96b with NIRISS/SOSS | Michael Radica et al. | 2305.17001v2 | NULL |