Springer Science and Business Media LLC

Công bố khoa học tiêu biểu

* Dữ liệu chỉ mang tính chất tham khảo

Sắp xếp:  
Self-attention Guidance Based Crowd Localization and Counting
Springer Science and Business Media LLC - - 2024
Zhouzhou Ma, Guanghua Gu, Wenrui Zhao
Most existing studies on crowd analysis are limited to the level of counting, which cannot provide the exact location of individuals. This paper proposes a self-attention guidance based crowd localization and counting network (SA-CLCN), which can simultaneously locate and count crowds. We take the form of object detection, using the original point annotations of crowd datasets as supervision to tr...... hiện toàn bộ
Robust Local Light Field Synthesis via Occlusion-aware Sampling and Deep Visual Feature Fusion
Springer Science and Business Media LLC - - 2023
Wenpeng Xing, Jie Chen, Yike Guo
AbstractNovel view synthesis has attracted tremendous research attention recently for its applications in virtual reality and immersive telepresence. Rendering a locally immersive light field (LF) based on arbitrary large baseline RGB references is a challenging problem that lacks efficient solutions with existing novel view synthesis techniques. In this work, we a...... hiện toàn bộ
Multimodal Pretraining from Monolingual to Multilingual
Springer Science and Business Media LLC - Tập 20 - Trang 220-232 - 2023
Liang Zhang, Ludan Ruan, Anwen Hu, Qin Jin
Multimodal pretraining has made convincing achievements in various downstream tasks in recent years. However, since the majority of the existing works construct models based on English, their applications are limited by language. In this work, we address this issue by developing models with multimodal and multilingual capabilities. We explore two types of methods to extend multimodal pretraining m...... hiện toàn bộ
Stability and Generalization of Hypergraph Collaborative Networks
Springer Science and Business Media LLC - Tập 21 - Trang 184-196 - 2024
Michael K. Ng, Hanrui Wu, Andy Yip
Graph neural networks have been shown to be very effective in utilizing pairwise relationships across samples. Recently, there have been several successful proposals to generalize graph neural networks to hypergraph neural networks to exploit more complex relationships. In particular, the hypergraph collaborative networks yield superior results compared to other hypergraph neural networks for vari...... hiện toàn bộ
An Empirical Study on Google Research Football Multi-agent Scenarios
Springer Science and Business Media LLC - - Trang 1-22 - 2024
Yan Song, He Jiang, Zheng Tian, Haifeng Zhang, Yingping Zhang, Jiangcheng Zhu, Zonghong Dai, Weinan Zhang, Jun Wang
Few multi-agent reinforcement learning (MARL) researches on Google research football (GRF) focus on the 11-vs-11 multi-agent full-game scenario and to the best of our knowledge, no open benchmark on this scenario has been released to the public. In this work, we fill the gap by providing a population-based MARL training pipeline and hyperparameter settings on multi-agent football scenario that out...... hiện toàn bộ
Chia sẻ trọng số trong các lớp nông thông qua các phép tích chập tương đương nhóm quay Dịch bởi AI
Springer Science and Business Media LLC - Tập 19 - Trang 115-126 - 2022
Zhiqiang Chen, Ting-Bing Xu, Jinpeng Li, Huiguang He
Phép toán tích chập có đặc tính equivariance nhóm dịch chuyển. Để đạt được nhiều tính chất equivariance nhóm hơn, các phép tích chập tương đương nhóm quay (RGEC) được đề xuất nhằm đạt được cả tính chất equivariance nhóm dịch chuyển và quay. Tuy nhiên, các công trình trước đó tập trung nhiều hơn vào số lượng tham số mà thường bỏ qua các chi phí tài nguyên khác. Trong bài báo này, chúng tôi xây dựng...... hiện toàn bộ
#RGEC #chia sẻ trọng số #tính trực giao #mạng nơron sâu #phép tích chập nhóm quay
Machine Learning for Cataract Classification/Grading on Ophthalmic Imaging Modalities: A Survey
Springer Science and Business Media LLC - Tập 19 - Trang 184-208 - 2022
Xiao-Qing Zhang, Yan Hu, Zun-Jie Xiao, Jian-Sheng Fang, Risa Higashita, Jiang Liu
Cataracts are the leading cause of visual impairment and blindness globally. Over the years, researchers have achieved significant progress in developing state-of-the-art machine learning techniques for automatic cataract classification and grading, aiming to prevent cataracts early and improve clinicians’ diagnosis efficiency. This survey provides a comprehensive survey of recent advances in mach...... hiện toàn bộ
ECG Biometrics via Enhanced Correlation and Semantic-rich Embedding
Springer Science and Business Media LLC - Tập 20 - Trang 697-706 - 2023
Kui-Kui Wang, Gong-Ping Yang, Lu Yang, Yu-Wen Huang, Yi-Long Yin
Electrocardiogram (ECG) biometric recognition has gained considerable attention, and various methods have been proposed to facilitate its development. However, one limitation is that the diversity of ECG signals affects the recognition performance. To address this issue, in this paper, we propose a novel ECG biometrics framework based on enhanced correlation and semantic-rich embedding. Firstly, w...... hiện toàn bộ
Practical Blind Image Denoising via Swin-Conv-UNet and Data Synthesis
Springer Science and Business Media LLC - Tập 20 - Trang 822-836 - 2023
Kai Zhang, Yawei Li, Jingyun Liang, Jiezhang Cao, Yulun Zhang, Hao Tang, Deng-Ping Fan, Radu Timofte, Luc Van Gool
While recent years have witnessed a dramatic upsurge of exploiting deep neural networks toward solving image denoising, existing methods mostly rely on simple noise assumptions, such as additive white Gaussian noise (AWGN), JPEG compression noise and camera sensor noise, and a general-purpose blind denoising method for real images remains unsolved. In this paper, we attempt to solve this problem f...... hiện toàn bộ
Towards Jumping Skill Learning by Target-guided Policy Optimization for Quadruped Robots
Springer Science and Business Media LLC - - 2024
Chi Zhang, Wei Zou, Ningbo Cheng, Shuomo Zhang
Endowing quadruped robots with the skill to forward jump is conducive to making it overcome barriers and pass through complex terrains. In this paper, a model-free control architecture with target-guided policy optimization and deep reinforcement learning (DRL) for quadruped robot jumping is presented. First, the jumping phase is divided into take-off and flight-landing phases, and optimal strateg...... hiện toàn bộ
Tổng số: 94   
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 10