欢迎光临散文网 会员登陆 & 注册

2023.04.04 ArXiv精选

2023-04-06 09:31 作者:PaperABC  | 我要投稿
  • 关注领域

    • AIGC

    • 3D computer vision learning

    • Fine-grained learning

    • GNN

    • 其他

  • 声明

    • 论文较多,时间有限,本专栏无法做文章的讲解,只挑选出符合PaperABC研究兴趣和当前热点问题相关的论文,如果你的research topic和上述内容有关,那本专栏可作为你的论文更新源或Paper reading list.

Paper list:

今日ArXiv共更新151


AIGC

Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos

https://arxiv.org/pdf/2304.01186.pdf

腾讯的工作,视频领域的controlNet.


Robust Text-driven Image Editing Method that Adaptively Explores Directions in Latent Spaces of StyleGAN and CLIP

https://arxiv.org/pdf/2304.00964.pdf

文本驱动的图像编辑方法.


DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models

https://arxiv.org/pdf/2304.00916.pdf

太卷了,太卷了,自己看吧,朋友们。



VLP

Vision-Language Models for Vision Tasks: A Survey

https://arxiv.org/pdf/2304.00685.pdf

视觉语言预训练模型的最新综述



RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

https://arxiv.org/pdf/2304.00962.pdf

区域级别的点云和文本的对比学习,主要用于open-world的3D场景理解。


AirLoc: Object-based Indoor Relocalization

https://arxiv.org/pdf/2304.00954.pdf

CMU的室内场景定位工作.


Multi-Modal Representation Learning with Text-Driven Soft Masks

https://arxiv.org/pdf/2304.00719.pdf


2023.04.04 ArXiv精选的评论 (共 条)

分享到微博请遵守国家法律