2023.04.04 ArXiv精选
关注领域:
AIGC
3D computer vision learning
Fine-grained learning
GNN
其他
声明
论文较多,时间有限,本专栏无法做文章的讲解,只挑选出符合PaperABC研究兴趣和当前热点问题相关的论文,如果你的research topic和上述内容有关,那本专栏可作为你的论文更新源或Paper reading list.

Paper list:
今日ArXiv共更新151篇
AIGC
Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos
https://arxiv.org/pdf/2304.01186.pdf

腾讯的工作,视频领域的controlNet.
Robust Text-driven Image Editing Method that Adaptively Explores Directions in Latent Spaces of StyleGAN and CLIP
https://arxiv.org/pdf/2304.00964.pdf

文本驱动的图像编辑方法.
DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models
https://arxiv.org/pdf/2304.00916.pdf

太卷了,太卷了,自己看吧,朋友们。
VLP
Vision-Language Models for Vision Tasks: A Survey
https://arxiv.org/pdf/2304.00685.pdf

视觉语言预训练模型的最新综述
RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding
https://arxiv.org/pdf/2304.00962.pdf

区域级别的点云和文本的对比学习,主要用于open-world的3D场景理解。
AirLoc: Object-based Indoor Relocalization
https://arxiv.org/pdf/2304.00954.pdf

CMU的室内场景定位工作.
Multi-Modal Representation Learning with Text-Driven Soft Masks
https://arxiv.org/pdf/2304.00719.pdf
