Раздел посвящен методам распознавания, анализа и преобразования изображений, речи и других образов данных. Выберите подраздел для более точной классификации.
Статьи по коду 004.93
289 публикаций
Нажмите рядом со статьёй — скопируете ссылку для списка литературы по ГОСТ.
Patient-Level Diagnosis of Acute Myeloid Leukemia via Deep Learning Analysis of Bone Marrow Smear
Yuqi Ma, Tianyi Wang, Weihua Meng, Hongru Chen, Fajin Tao, Qunxian Lu, Lin An, Xiaodong Mo, Gen Yang
· 2026
Spatially Selective Self-Training for Unsupervised Building Change Detection
Wafaa I. M. Hussin, Zhi Lu, Anas M. I. Mohammed, Xiang Zhou, Ratiba A. H. Abubaker, Zhenming Peng
· 2026
Depth from Dual Differential Defocus and Stereo Consensus
Junjie Luo, Wei Xu, Dylan Chu, Emma Alexander, Qi Guo
· 2026
AtlasGS: Brain MRI Spatial Resolution Harmonization With Shared Gaussian Geometry
Yifan Gao, Peiran Xu, Yimeng He, Haoran Li, Ziyang Long, Yufeng Wang, Ju Dong Yang, Debiao Li
· 2026
DexPIE: Stable Dexterous Policy Improvement from Real-World Experience
Ruizhe Liao, Wenrui Chen, Liangji Zeng, Haoran Lin, Fan Yang, Kailun Yang, Yaonan Wang
· 2026
CineDance: Towards Next-Generation Multi-Shot Long-Form Cinematic Audio-Video Generation
Yuheng Chen, Teng Hu, Yuji Wang, Qingdong He, Zhucun Xue, Qianyu Zhou, Xiangtai Li, Lizhuang Ma, Jiangning Zhang, Dacheng Tao
· 2026
X-Palm: Paired Multispectral-to-Smartphone Dataset for Cross-Domain Palmprint Authentication
Jamal Seyedmohammadi, Pai Chet Ng, Angelo Genovese, Zhixiang Chi, Jeannie Lee, Konstantinos N. Plataniotis
· 2026
Vendor-agnostic 4D Phase Contrast MRI: a complete open-source pipeline for velocities, displacement, and strain analysis
Marta B. Maggioni, Sabine M. Räuber, Katarina Puš, Bostjan Šimunič, Xeni Deligianni, Regina M. M. Schlaeger, Francesco Santini
· 2026
EgoTactile: Learning Grasp Pressure for Everyday Objects from Egocentric Video
Yuan Zeng, Yujia Shi, Tiao Tan, Xingting Li, Yaqi Qin, Zongqing Lu, Wenming Yang, Jing-Hao Xue, Qingmin Liao
· 2026
SOMA: From Surface Observations to Muscle Anatomy
Eduardo Alvarado, Emily Kim, Gerrit Nolte, Friedemann Runte, Mario Botsch, Marc Habermann, Christian Theobalt
· 2026
RFDT-Channel: RGB-LiDAR-Based RF Digital Twin Scene Construction for 28 GHz Indoor Ray-Tracing Channel Simulation
Chengyang Yao, Cunhua Pan, Jiaming Zeng, Yuquan Sun, Haoyang Weng, Haojian Wang, Hong Ren, Jiangzhou Wang
· 2026
ResNet-34 with Lightweight Decoder for Accurate and Efficient Segmentation of Fetal Brain MRI
Ashiqur Rahman, Muhammad E. H. Chowdhury, Md. Abu Sayed, Md. Sharjis Ibne Wadud, Abu Naser Md. Arafat, Mehedi Hasan Prince
· 2026
Leaf Spectral Reflectance Prediction Using Multi-Head Attention Neural Networks
Parastoo Farajpoor, Alireza Pourreza, Mohammadreza Narimani, Ashraf El-Kereamy, Matthew W. Fidelibus
· 2026
Mathematical framework for perception-driven parameter choice in image denoising
Saara Isoranta, Emilia L. K. Blåsten, Lílian Ferreira de Freitas, Jukka Häkkinen, Markus Juvonen, Samuli Siltanen
· 2026
A unified deeplearning framework for contrast-phase-specific virtual monochromatic imaging
Antony Jerald, Hemant K Aggarwal, Brian Nett, Avinash Gopal, Phaneendra K Yalavarthy, Bipul Das, Rajesh Langoju
· 2026
Closed-Form Spectral Regularization for Multi-Task Model Merging
Yongxian Wei, Runxi Cheng, Xingxuan Zhang, Li Shen, Chun Yuan, Peng Cui, Dacheng Tao
· 2026
CULTURESCORE: Evaluating Cultural Faithfulness in Video Generation Models
Anku Rani, Wei Dai, Shravan Nayak, Pattie Maes, Mahdi M. Kalayeh, Paul Pu Liang
· 2026
AnchorWorld: Embodied Egocentric World Simulation with View-based Evolution Customization
Yu Li, Menghan Xia, Gongye Liu, Xintao Wang, Conglang Zhang, Lei Ke, Yuxuan Lin, Ruihang Chu, Pengfei Wan, Kun Gai, Yujiu Yang
· 2026
Differences in Detection: Explainability Where it Matters
Johannes Theodoridis, Johannes Maucher, Andreas Schilling
· 2026
Streaming Video Generation with Streaming Force Control
Hanhui Wang, Yiming Xie, Haiwen Feng, Zhaoyang Lv, Shenlong Wang, Huaizu Jiang
· 2026
UniSHARP: Universal Sharp Monocular View Synthesis
Meixi Song, Dizhe Zhang, Hao Ren, Ruiyang Zhang, Bo Du, Ming-Hsuan Yang, Lu Qi
· 2026
FM-fMRI: Event Conditioned Flow Matching for Rest-to-Task fMRI Time-Series Synthesis
Peiyu Duan, Jiyao Wang, Nicha C. Dvornek, Junlin Yang, Ziqi Gao, Lawrence H. Staib, James S. Duncan
· 2026
Measuring Prediction Uncertainty in Neural Cellular Automata
Ario Sadafi, Michael Deutges, Nassir Navab, Carsten Marr
· 2026
In-Context Multiple Instance Learning
Alexander Möllers, Marvin Sextro, Julius Hense, Gabriel Dernbach, Klaus-Robert Müller
· 2026
Thinking with Imagination: Agentic Visual Spatial Reasoning with World Simulators
Chenming Zhu, Jingli Lin, Yilin Long, Peizhou Cao, Tai Wang, Jiangmiao Pang, Xihui Liu
· 2026
Complexity-Balanced Diffusion Splitting
Noam Issachar, Dani Lischinski, Raanan Fattal
· 2026
Symb-xMIL: Symbolic Explanations for Multiple Instance Learning in Digital Pathology
Yanqing Luo, Julius Hense, Niklas Prenißl, Andreas Mock, Klaus-Robert Müller, Thomas Schnake, Mina Jamshidi Idaji
· 2026
SAM-Flow: Source-Anchored Masked Flow for Training-Free Image Editing
Haowang Cui, Rui Chen, Tao Luo, Tao Guo, Zheng Qin, Jiaze Wang
· 2026
Gender Artifacts from Art History to Text-to-Image Generation
Piera Riccio, Miriam Doh, Benedikt Höltgen, Noa Garcia, Nanne van Noord
· 2026
LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing
Jianzong Wu, Hao Lian, Jiongfan Yang, Dachao Hao, Ye Tian, Yunhai Tong, Jingyuan Zhu, Biaolong Chen, Qiaosong Qi, Aixi Zhang, Wanggui He, Mushui Liu, Jinlong Liu, Hao Jiang
· 2026
Closing the Alignment-Maturity Gap in Federated Prototype Learning
Mario Casado-Diez, Alejandro Dopico-Castro, Verónica Bolón-Canedo, Bertha Guijarro-Berdiñas
· 2026
Towards 3D-Aware Video Diffusion Models: Render-Free Human Motion Control with Mesh Tokenization
Jingyun Liang, Min Wei, Shikai Li, Yizeng Han, Hangjie Yuan, Lei Sun, Weihua Chen, Fan Wang
· 2026
Absorption and Phase-Contrast Microtomography Using Direct X-ray Detection With COTS CMOS Sensors
Damian L. Corzi, Jose Lipovetzky, Fabricio Alcalde Bessia, German Mato, Andres Cicuttin, Maria L. Crespo, Martin Perez, Mariano Gomez Berisso
· 2026
MORPHOS: Autoregressive 4D Generation with Temporal Structured Latents
Minkyung Kwon, Jinhyeok Choi, Youngjin Shin, Jaeyeong Kim, JongMin Lee, Seungryong Kim
· 2026
GloResNet: A lightweight 3D CNN with global topological features for preterm brain injury prediction
Boyu Yuan, Jiamiao Lu, Weichuan Zhang, Benqing Wu, Tuo Wang, Changshan Wang, Changming Sun, Liang Guo
· 2026
Not All Points Are Equal: Uncertainty-Aware 4D LiDAR Scene Synthesis
Xiang Xu, Alan Liang, Youquan Liu, Xian Sun, Linfeng Li, Lingdong Kong, Ziwei Liu, Qingshan Liu
· 2026
VEDAL: Variational Error-Driven Asynchronous Learning for 3D Gaussian Splatting Pruning
Aoduo Li, Jiancheng Li, Huan Ye, Hongjian Xu, Shiting Wu, Xiujun Zhang, Zimeng Li, Xuhang Chen
· 2026
Multi-modal Video Representation Alignment for Robust Self-supervised Driver Distraction Detection
David J. Lerch, Livien Majer, Zeyun Zhong, Manuel Martin, Frederik Diederichs, Rainer Stiefelhagen
· 2026
Do Multimodal Agents Really Benefit from Tool Use? A Systematic Study of Capability Gains
Garvin Guo, Donglei Yu, Yu Chen, Xiang Wang, Shuai Li, Xinpei Zhao, Huaxing Liu, Qinghao Wang, Minpeng Liao
· 2026
STAMBRIDGE: Spectral-Temporal Amplitude-aware Mid-Feature Bridge for EEG Visual Decoding
Jiahe Meng, Weiming Zeng, Yueyang Li, Bo Chai, Hongjie Yan, Zhiguo Zhang, Wai Ting Siok, Nizhuan Wang
· 2026
Motion-Robust Deep Reconstruction for Free-Breathing Cardiac Cine MRI
Mahmut Yurt, Kanghyun Ryu, Zhitao Li, Xucheng Zhu, Xianglun Mao, Martin Janich, Marcus Alley, Kawin Setsompop, John Pauly, Shreyas Vasanawala, Ali Syed
· 2026
SOCO: Benchmarking Semantic Object Correspondence in Vision Foundation Models
Olaf Dünkel, Basavaraj Sunagad, Haoran Wang, David T. Hoffmann, Christian Theobalt, Adam Kortylewski
· 2026
Linear Scaling Video VLMs for Long Video Understanding
Cristobal Eyzaguirre, Jiajun Wu, Juan Carlos Niebles
· 2026
Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models
Jiazheng Xing, Hangjie Yuan, Lingling Cai, Xinyu Liu, Yujie Wei, Fei Du, Hai Ci, Tao Feng, Jiasheng Tang, Weihua Chen, Fan Wang, Yong Liu
· 2026
Representation Forcing for Bottleneck-Free Unified Multimodal Models
Yuqing Wang, Zhijie Lin, Ceyuan Yang, Yang Zhao, Fei Xiao, Hao He, Qi Zhao, Zihan Ding, Fuyun Wang, Shuai Wang, Youliang Zhang, Haoqi Fan, Xihui Liu
· 2026
How can embedding models bind concepts?
Arnas Uselis, Darina Koishigarina, Seong Joon Oh
· 2026
A Clinically Validated Foundation Model for Comprehensive Lung Pathology Interpretation
Zhengrui Guo, Zhengyu Zhang, Jiabo Ma, Yihui Wang, Fengtao Zhou, Yingxue Xu, Ling Liang, Chenglong Zhao, Qi Xie, Jinbang Li, Shujing Guo, Fangyi Han, Zhijian Cen, Ziyi Liu, Cheng Jin, Junlin Hou, Zhixuan Chen, Yu Cai, Lijuan Qu, Shifu Chen, Yueping Liu, Zhe Wang, Xiuming Zhang, Muyan Cai, Li Liang, Hao Chen
· 2026
SwInception -- Local Attention Meets Convolutions
David Hagerman, Roman Naeem, Jakob Lindqvist, Carl Lindström, Fredrik Kahl, Lennart Svensson
· 2026
Genetically Aligned Patient Representations Improve Hematological Diagnosis
Muhammed Furkan Dasdelen, Fatih Ozlugedik, Ilaria Looser, Rao Muhammad Umer, Christian Pohlkamp, Carsten Marr
· 2026
NeuROK: Generative 4D Neural Object Kinematics
Chen Geng, Guangzhao He, Yue Gao, Yunzhi Zhang, Shangzhe Wu, Jiajun Wu
· 2026
VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion
Hidir Yesiltepe, Jiazhen Hu, Tuna Han Salih Meral, Adil Kaan Akan, Kaan Oktay, Hoda Eldardiry, Pinar Yanardag
· 2026
Large Depth Completion Model from Sparse Observations
Zhu Yu, Zhengyi Zhao, Runmin Zhang, Lingteng Qiu, Kejie Qiu, Yisheng He, Siyu Zhu, Zilong Dong, Si-Yuan Cao, Hui-Liang Shen
· 2026
SGMD: Score Gradient Matching Distillation for Few-Step Video Diffusion Distillation
Zhuguanyu Wu, Ruihao Gong, Yang Yong, Yushi Huang, Xiangyu Fan, Lei Yang, Dahua Lin, Xianglong Liu
· 2026
CCS: Clinical Consensus Selection for Radiology Report Generation
Xi Zhang, Yingshu Li, Zaiqiao Meng, Jake Lever, Edmond S. L. Ho
· 2026
How LoRA Remembers? A Parametric Memory Law for LLM Finetuning
Ziwen Xu, Haiwen Hong, Linsong Yu, Benglei Cui, Longtao Huang, Hui Xue, Ningyu Zhang
· 2026
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models
Min Zhao, Hongzhou Zhu, Bokai Yan, Zihan Zhou, Yimin Chen, Wenqiang Sun, Kaiwen Zheng, Guande He, Xiao Yang, Chongxuan Li, Fan Bao, Jun Zhu
· 2026
LoMo: Local Modality Substitution for Deeper Vision-Language Fusion
Feng Han, Zhixiong Zhang, Zheming Liang, Yibin Wang, Jiaqi Wang
· 2026
Deep Learning Strain Estimation: Is Physics-Based Simulation the Solution?
Thierry Judge, Nicolas Duchateau, Andreas Østvik, Khuram Faraz, Anders Austlid Taskén, Sigve Karlsen, Thor Edvardsen, Harald Brunvand, Md Abulkalam Azad, Havard Dalen, Bjørnar Grenne, Gabriel Kiss, Pierre-Yves Courand, Lasse Lovstakken, Pierre-Marc Jodoin, Olivier Bernard
· 2026
Janus-LoRA: A Balanced Low-Rank Adaptation for Continual Learning
Cheng Chen, Pengpeng Zeng, Yuyu Guo, Lianli Gao, Hengtao Shen, Jingkuan Song
· 2026
GEM: Generative Supervision Helps Embodied Intelligence
Ruowen Zhao, Bangguo Li, Zuyan Liu, Yinan Liang, Junliang Ye, Fangfu Liu, Diankun Wu, Zhengyi Wang, Xumin Yu, Yongming Rao, Han Hu, Jun Zhu
· 2026
7 Tesla Quantitative MRI and Machine Learning for Exploratory Motor Subtype Stratification and Diagnosis in Parkinson's Disease
Anne Louise Kristoffersen, Runa Geirmundsdatter Unsgård, Marc-Antoine Fortin, Ingrid Gylterud Kvålsgard, Kjersti Eline Stige, Thanh Pierre Doan, Erik Magnus Berntsen, Charalampos Tzoulis, Pål Erik Goa
· 2026
LV-OSD: Language-Vision-Complementary Open-Set Object Detection
Yupeng Zhang, Ruize Han, Wei Feng, Song Wang, Liang Wan
· 2026
EchoAvatar: Real-time Generative Avatar Animation from Audio Streams
Bohong Chen, Yumeng Li, Yinglin Xu, Youyi Zheng, Yanlin Weng, Kun Zhou
· 2026
No Safe Dose: How Training Data Drives Unsafe Image Generation
Felix Friedrich, Lukas Helff, Niharika Hegde, Patrick Schramowski, Kristian Kersting
· 2026
A novel ordinal multi-view aggregation scheme for oak defoliation
Francisco Bérchez-Moreno, Ricardo Enrique Hernández-Lambraño, David Guijo-Rubio, Víctor Manuel Vargas, Francisco José Ruiz-Gómez, Juan Carlos Fernández, Pablo González-Moreno
· 2026
CodecCap: High-Fidelity Codec-Inspired Residual Modeling for Dense Video Captioning
Zihan Lin, Songhe Deng, Shuwei He, Danxiang Zhu, Dan Zhang, Yishu Lei, Xianlong Luo, Shikun Feng, Rui Liu
· 2026
ChartAct: A Benchmark for Dynamic Chart Understanding
Muye Huang, Wu Lin, Lingling Zhang, Hang Yan, Zhiyuan Wang, Yumeng Fu, Zesheng Yang, Jun Liu
· 2026
SAM3-Assisted Training of Lightweight YOLO Models for Precision Pig Farming
Marcos Vinicius Mendes Faria, Thiago Borges Pereira, Isabella C. F. S. Condotta, Thiago Meireles Paixão, Francisco de Assis Boldt
· 2026
WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation
Kaining Ying, Hengrui Hu, Siyu Ren, Jiamu Li, Fengjiao Chen, Ziwen Wang, Xuezhi Cao, Xunliang Cai, Henghui Ding
· 2026
F-RNG: Feed-Forward Relightable Neural Gaussians
Guangming Fu, Jiahui Fan, Jian Yang, Miloš Hašan, Beibei Wang
· 2026
LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence
Xiang An, Yin Xie, Feilong Tang, Yunyao Yan, Huajie Tan, Didi Zhu, Changrui Chen, Xiuwei Zhao, Bin Qin, Kaicheng Yang, Yifei Shen, Yuanhan Zhang, Kaichen Zhang, Wenkang Zhang, Zheng Cheng, Nansen Zhang, Chunsheng Wu, Chunjiang Ge, Zimin Ran, Dehua Song, Chunyuan Li, Shikun Feng, Ming Hu, Zhangquan Chen, Junbo Niu, Bo Li, Ziyong Feng, Ziwei Liu, Zongyuan Ge, Jiankang Deng
· 2026
Astronomical Image Data Reduction for Moving Object Detection
Kevin Allekotte, Pablo De Cristóforis, Mario Melita, Marta Mejail
· 2013
Dynamic MRI Reconstruction Via Dual Deep Priors and Low-Rank Plus Sparse Modeling
Yongliang Sun, Siddhant Gautam, Chaoyan Huang, Nicole Seiberlich, Ismail Alkhouri, Saiprasad Ravishankar
· 2026
General Hazard Detection
Stephanie Ng, CP Lim, SueJen Looi, Hendrik Zurlinden, David Nguyen, Lei Wei, Saeid Nahavandi, Hailing Zhou
· 2026
Enhancing Blood Cells Classification using Hybrid Quantum Neural Networks
Guilherme Cruz, Nouhaila Innan, Alberto Marchisio, Gabriel Falcao, Muhammad Shafique
· 2026
GFSR: Geometric Fidelity and Spatial Refinement for Reliable Lane Detection
Tiancheng Wang, Zhaolu Ding, Richeng Xu, Tianhui Zheng, Hui Liu, Hanyu Xuan, Zhiliang Wu, Guanghui Yue
· 2026
SCOPE: Simulating Cross-game Operations in Playable Environments for FPS World Models
Zizhao Tong, Hongfeng Lai, Zeqing Wang, Zhaohu Xing, Kexu Cheng, Haoran Xu, Zhao Pu, Shangwen Zhu, Ruili Feng, Jian Zhao, Yan Zhang, Hao Tang, Yeying Jin, Ling Shao
· 2026
Robustness of breast lesion segmentation under MRI undersampling improves with k-space-aware deep learning
Lukas T. Rotkopf, Marco Schlimbach, Julius C. Holzschuh, Heinz-Peter Schlemmer, Jens Kleesiek, Moritz Rempe
· 2026
3D LULC classification using multispectral LiDAR and deep learning: current and prospective schemes
Narges Takhtkeshha, Aldino Rizaldy, Markus Hollaus, Juha Hyyppä, Fabio Remondino, Gottfried Mandlburger
· 2026
MotiMotion: Motion-Controlled Video Generation with Visual Reasoning
Lee Hsin-Ying, Hanwen Jiang, Yiqun Mei, Jing Shi, Ming-Hsuan Yang, Zhixin Shu
· 2026
Cambrian-P: Pose-Grounded Video Understanding
Jihan Yang, Zifan Zhao, Xichen Pan, Shusheng Yang, Junyi Zhang, Bingyi Kang, Hu Xu, Saining Xie
· 2026
EchoSR: Efficient Context Harnessing for Lightweight Image Super-Resolution
Hanli Zhao, Binhao Wang, Shihao Zhao, Tao Wang, Kaihao Zhang, Wanglong Lu
· 2026
See Silhouettes in Motion with Neuromorphic Vision
Pei Zhang, Shijie Lin, Zhou Ge, Jinpeng Chen, Wei Pu
· 2026
SdcNet for object recognition
Yunlong Ma, Chunyan Wang
· 2022
Learning Normalized Energy Models for Linear Inverse Problems
Nicolas Zilberstein, Santiago Segarra, Eero Simoncelli, Florentin Guth
· 2026
Dynamic resolution switching for live streaming
Xin Xiong, Yixu Chen, Hai Wei, Yongjun Wu, Sriram Sethuraman
· 2026
NeuroQA: A Large-Scale Image-Grounded Benchmark for 3D Brain MRI Understanding
Mohammad H. Abbasi, Favour Nerrise, Shaurnav Ghosh, Ridvan Yesiloglu, Yuncong Mao, Bailey Trang, Mohammad Asadi, Merryn Daniel, Gustavo Chau Loo Kung, Ken Chang, Pavan Pinkesh Shah, Adam Turnbull, Kyan Younes, Seena Dehkharghani, Ehsan Adeli
· 2026
Time-varying rPPG signal separation via block-sparse signal model
Kosuke Kurihara, Yoshihiro Maeda, Daisuke Sugimura, Takayuki Hamamoto
· 2026
Computer Vision Based Object Detection and Recognition System for Image Searching
Tanvir Ahamed Nayeem, S M Motaharuzzaman, Anika Tabassum Hoque, Md. Habibur Rahman
· 2022
Automatic Discovery of Disease Subgroups by Contrasting with Healthy Controls
Robin Louiset, Edouard Duchesnay, Benoit Dufumier, Antoine Grigis, Pietro Gori
· 2026
Deformba: Vision State Space Model with Adaptive State Fusion
Hongyu Ke, Jack Morris, Yongkang Liu, Satoshi Kitai, Kentaro Oguchi, Yi Ding, Haoxin Wang
· 2026
Object detection based on spatiotemporal background models
Satoshi Yoshinaga, Atsushi Shimada, Hajime Nagahara, Rin-ichiro Taniguchi
· 2014
FGSVQA: Frequency-Guided Short-form Video Quality Assessment
Xinyi Wang, Angeliki Katsenou, Junxiao Shen, David Bull
· 2026
Probability-Conserving Flow Guidance
Parsa Esmati, Junha Hyung, Amirhossein Dadashzadeh, Jaegul Choo, Majid Mirmehdi
· 2026
A framework for abandoned object detection from video surveillance
Rajesh Kumar Tripathi, Anand Singh Jalal, Charul Bhatnagar
· 2013
Physics-in-the-Loop: A Hybrid Agentic Architecture for Validated CAD Engineering Design
Elias Berger, Muhammad Usama, Jan Mehlstäubl, Bernhard Saske, Kristin Paetzold-Byhain
· 2026
Efficient Long-Context Modeling in Diffusion Language Models via Block Approximate Sparse Attention
Wenhu Zhang, Yiming Wu, Huanyu Wang, Yaoyang Liu, Huanzhang Dou, Senqiao Yang, Sitong Wu, Hanbin Zhao, Jiaya Jia
· 2026
Tango3D: Towards Alignment for Global and Local 2D-3D Correspondence
Zebin He, Mingxin Yang, Shuhui Yang, Hanxiao Sun, Xintong Han, Chunchao Guo, Wenhan Luo
· 2026
When Preference Labels Fall Short: Aligning Diffusion Models from Real Data
Weiyan Chen, Weijian Deng, Yao Xiao, Weijie Tu, ZiYi Dong, Ibrahim Radwan, Liang Lin, Pengxu Wei
· 2026
A Framework for Evaluating Zero-Shot Image Generation in Concept-based Explainability
Giacomo Astolfi, Matteo Bianchi, Riccardo Campi, Antonio De Santis, Marco Brambilla
· 2026
An object detection and recognition system for weld bead extraction from digital radiographs
Marcelo Kleber Felisberto, Heitor Silvério Lopes, Tania Mezzadri Centeno, Lúcia Valéria Ramos de Arruda
· 2006
Aurora: Unified Video Editing with a Tool-Using Agent
Yongsheng Yu, Ziyun Zeng, Zhiyuan Xiao, Zhenghong Zhou, Hang Hua, Wei Xiong, Jiebo Luo
· 2026
WavFlow: Audio Generation in Waveform Space
Feiyan Zhou, Luyuan Wang, Shoufa Chen, Zhe Wang, Zhiheng Liu, Yuren Cong, Xiaohui Zhang, Fanny Yang, Belinda Zeng
· 2026
CATA: Continual Machine Unlearning via Conflict-Averse Task Arithmetic
Shen Lin, Junhao Dong, Rongjie Chen, Xiaoyu Zhang, Li Xu, Xiaofeng Chen
· 2026
SPIKE: An Adaptive Dual Controller Framework for Cost-Efficient Long-Horizon Game Agents
Wencan Jiang, Jiangning Zhang, Jianbiao Mei, Jinzhuo Liu, Yu Yang, Xiaobin Hu, Zhucun Xue, Yong Liu, Dacheng Tao
· 2026
Object Recognition
Arcangelo Distante, Cosimo Distante
· 2020
SENSE: Satellite-based ENergy Synthesis for Sustainable Environment
Kailai Sun, Mingyi He, Heye Huang, Can Rong, Alok Prakash, Baoshen Guo, Shenhao Wang, Jinhua Zhao
· 2026
DanceHMR: Hand-Aware Whole-Body Human Mesh Recovery from Monocular Videos
Wenhao Shen, Ming Zhou, Hengyuan Zhang, Siyuan Bian, Youjiang Xu, Xi Lin
· 2026
TaskGround: Structured Executable Task Inference for Full-Scene Household Reasoning
ZhiYuan Feng, Yu Deng, Ruichuan An, Zhenhua Liu, Qixiu Li, Keming Wu, Zhiying Du, Weijie Wang, Haoxiao Wang, Shuang Chen, Sicheng Xu, Yaobo Liang, Jiaolong Yang, Baining Guo
· 2026
Spatial Competition for Low-Complexity Learned Image Compression
Théophile Blard, Pierrick Philippe, Théo Ladune, Xiaoran Jiang, Olivier Déforges
· 2026
Learning to Optimize Radiotherapy Plans via Fluence Maps Diffusion Model Generation and LSTM-based Optimization
Isabella Poles, Simon Arberet, Riqiang Gao, Martin Kraus, Marco D. Santambrogio, Florin C. Ghesu, Ali Kamen, Dorin Comaniciu
· 2026
WorldVLN: Autoregressive World Action Model for Aerial Vision-Language Navigation
Baining Zhao, Jiacheng Xu, Weicheng Feng, Xin Zhang, Zhaolu Wang, Haoyang Wang, Shilong Ji, Ziyou Wang, Jianjie Fang, Zhiheng Zheng, Weichen Zhang, Yu Shang, Wei Wu, Chen Gao, Xinlei Chen, Yong Li
· 2026
Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
Xiaoxuan He, Siming Fu, Zeyue Xue, Weijie Wang, Ruizhe He, Yuming Li, Dacheng Yin, Shuai Dong, Haoyang Huang, Hongfa Wang, Nan Duan, Bohan Zhuang
· 2026
Object recognition using discriminative parts
Ying-Ho Liu, Anthony J.T. Lee, Fu Chang
· 2012
3D Segmentation Using Viewpoint-Dependent Spatial Relationships
Ayaka Nanri, Klara Reichard, Mert Kiray, Federico Tombari, Benjamin Busam, Asako Kanezaki
· 2026
The Velocity Deficit: Initial Energy Injection for Flow Matching
Linze Li, Zong-Wei Hong, Shen Zhang, Bo Lin, Jinglun Li, Yao Tang, Jiajun Liang
· 2026
HDRFace: Rethinking Face Restoration with High-Dimensional Representation
Zirui Wang, Xianhui Lin, Yi Dong, Bo Wei, Gangjian Zhang, Siteng Ma, Zebiao Zheng, Xing Liu, Hong Gu, Minjing Dong
· 2026
Learning Direct Control Policies with Flow Matching for Autonomous Driving
Marcello Ceresini, Federico Pirazzoli, Andrea Bertogalli, Lorenzo Cipelli, Filippo D'Addeo, Anthony Dell'Eva, Alessandro Paolo Capasso, Alberto Broggi
· 2026
Holistic object detection and image understanding
Gonzalo Vaca-Castano, Niels DaVitoria Lobo, Mubarak Shah
· 2019
РАЗБИЕНИЕ КОНТУРА ИЗОБРАЖЕНИЯ ГРАФИЧЕСКОГО ОБЪЕКТА НА ФРАГМЕНТЫ В ЗАДАЧАХ КЛАССИФИКАЦИИ
Титов Алексей Иванович, Корсунов Николай Иванович, Щербинина Наталья Владимировна
· 2025
Ещё 8 статей в подразделах
+ Добавить статью