2023.04.04 ArXiv精选

2023-04-06 09:31 作者:PaperABC 0人读过 | 我要投稿

论文较多，时间有限，本专栏无法做文章的讲解，只挑选出符合PaperABC研究兴趣和当前热点问题相关的论文，如果你的research topic和上述内容有关，那本专栏可作为你的论文更新源或Paper reading list．

Paper list:

今日ArXiv共更新151篇

Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos

https://arxiv.org/pdf/2304.01186.pdf

腾讯的工作，视频领域的controlNet.

Robust Text-driven Image Editing Method that Adaptively Explores Directions in Latent Spaces of StyleGAN and CLIP

https://arxiv.org/pdf/2304.00964.pdf

文本驱动的图像编辑方法.

DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models

https://arxiv.org/pdf/2304.00916.pdf

太卷了，太卷了，自己看吧，朋友们。

Vision-Language Models for Vision Tasks: A Survey

https://arxiv.org/pdf/2304.00685.pdf

视觉语言预训练模型的最新综述

RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

https://arxiv.org/pdf/2304.00962.pdf

区域级别的点云和文本的对比学习，主要用于open-world的3D场景理解。

AirLoc: Object-based Indoor Relocalization

https://arxiv.org/pdf/2304.00954.pdf

CMU的室内场景定位工作.

Multi-Modal Representation Learning with Text-Driven Soft Masks

https://arxiv.org/pdf/2304.00719.pdf

标签：