About me

虚步, a Ph.D. student at BUPT. He is very fortunate to be advised by his advisor. His research is in the area of Vision, Language, and Reasoning, with a focus on Visual Dialogue. He is particularly interested in building a visually-grounded conversational AI (social robot) that can see the world and talk with us in natural language. Other interests include Visual/Language Grounding, Visual Reasoning, Visual Question Generation, and Visually-grounded Referring Expression.
Now I've been working on the GuessWhich, Visual Dialog(VisDial) and Talking-to-Videos(Video-Grounded Dialogue) task, please feel free to contact me with pangweitf@bupt.edu.cn or pangweitf@163.com if you have any questions or concerns.

Visual Dialog (VisDial) task

Visual Dialog needs an AI agent to chat with humans in natural, conversational language about visual content. Specifically, given a specific image, a dialog history, and a follow-up question about the image, the task for the AI agent is to answer the question in free-form natural language.

Paper

Performance

Training

References

https://visualdialog.org/
Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M.F. Moura, Devi Parikh, Dhruv Batra. Visual Dialog. In CVPR 2017.
Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M.F. Moura, Devi Parikh, Dhruv Batra. Visual Dialog: Supplementary Document. In CVPR 2017.
...

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About me

Visual Dialog (VisDial) task

Paper

Performance

Training

References

About

Releases

Packages

afarmer2005/Visual-Dialog-VisDial-

Folders and files

Latest commit

History

Repository files navigation

About me

Visual Dialog (VisDial) task

Paper

Performance

Training

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages