WebHowever, whether it is possible to build a generic MLP-Like architecture on video domain has not been explored, due to complex spatial-temporal modeling with large computation burden. To fill this gap, we present an efficient self-attention free backbone, namely MorphMLP, which flexibly leverages the concise Fully-Connected ...
ECVA European Computer Vision Association
WebMorphmlp: A self-attention free, mlp-like backbone for image and video. DJ Zhang, K Li, Y Chen, Y Wang, S Chandra, Y Qiao, L Liu, MZ Shou. European Conference on Computer Vision (ECCV), 2024. 17 * 2024: Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition. WebMC-MLP is introduced, a general MLP-like backbone for computer vision that is composed of a series of fully-connected (FC) layers that is equipped with multi-coordinate frame receptive fields and the ability to learn information across different coordinate frames. In deep learning, Multi-Layer Perceptrons (MLPs) have once again garnered attention from … phenix orleans
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal ...
WebNov 1, 2024 · MorphMLP-B only uses 43% GFLOPs of MViT-B but achieves 2.4% top-1 improvement on SSV2, even though MorphMLP-B is pretrained on ImageNet1K while … WebIn this paper, we take a step further to extend our MorphMLP from image to video. To our best knowledge, this is the first self-attention free, MLP-Like backbone architecture in the … WebOur MorphMLP paper was accepted to ECCV 2024!. !. We current release the code and models for: Kintics-400. Something-Something V1. Something-Something V2. ImageNet … phenix oil coventry