Publications

Published September 22, 2022

DocLayoutTTS: Dataset and Baselines for Layout-informed Document-level Neural Speech Synthesis

Interspeech 2022

Puneet Mathur, Franck Dernoncourt, Quan Hung Tran, Jiuxiang Gu, Ani Nenkova, Vlad Morariu, Rajiv Jain, Dinesh Manocha
  • AI & Machine Learning
  • Document Intelligence
  • Natural Language Processing

Published August 8, 2022

Neural Jacobian Fields: Learning Intrinsic Mappings of Arbitrary Meshes

SIGGRAPH

Noam Aigerman, Kunal Gupta, Vladimir (Vova) Kim, Siddhartha Chaudhuri, Jun Saito, Thibault Groueix
  • AI & Machine Learning
  • Computer Vision, Imaging & Video
  • Graphics (2D & 3D)

Published July 15, 2022

DynamicToC: Persona-based Table of Contents for Consumption of Long Documents

North American Chapter of the Association for Computational Linguistics (NAACL)

Himanshu Maheshwari, Nethraa Shivakumar, Shelly Jain, Tanvi Karandikar, Navita Goyal, Vinay Aggarwal, Sumit Shekhar
  • Document Intelligence
  • Natural Language Processing

Published July 15, 2022

Joint Extraction of Entities, Relations, and Events via Modeling Inter-Instance and Inter-Label Dependencies

NAACL 2022

Minh Van Nguyen, Bonan Min, Franck Dernoncourt, Thien Huu Nguyen
  • AI & Machine Learning
  • Document Intelligence
  • Natural Language Processing

Published July 15, 2022

DocTime: A Document-level Temporal Dependency Graph Parser

NAACL 2022

Puneet Mathur, Vlad Morariu, Verena Kaynig-Fittkau, Jiuxiang Gu, Franck Dernoncourt, Quan Hung Tran, Ani Nenkova, Dinesh Manocha, Rajiv Jain
  • AI & Machine Learning
  • Document Intelligence
  • Natural Language Processing

Published July 15, 2022

MINION: a Large-Scale and Diverse Dataset for Multilingual Event Detection

NAACL 2022

Amir Pouran Ben Veyseh, Minh Van Nguyen, Franck Dernoncourt, Thien Huu Nguyen
  • AI & Machine Learning
  • Document Intelligence
  • Natural Language Processing

Published July 15, 2022

Event Detection for Suicide Understanding

Findings of NAACL 2022

Luis Fernando Guzman-Nateras, Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen
  • AI & Machine Learning
  • Document Intelligence
  • Natural Language Processing

Published July 15, 2022

SemEval 2022 Task 12: Symlink – Linking Mathematical Symbols to their Descriptions

Semeval 2022

Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen
  • AI & Machine Learning
  • Document Intelligence

Published July 11, 2022

Fine-grained Image Captioning with CLIP Reward

Findings of NAACL 2022

Jaemin Cho, David Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, Mohit Bansal
  • AI & Machine Learning
  • Computer Vision, Imaging & Video
  • Natural Language Processing

Published July 11, 2022

Video-based Multimodal Intent Discovery

Findings of NAACL 2022

Adyasha Maharana, Quan Hung Tran, David Seunghyun Yoon, Franck Dernoncourt, Trung Bui, Walter Chang, Mohit Bansal
  • AI & Machine Learning
  • Computer Vision, Imaging & Video
  • Natural Language Processing

Published July 11, 2022

Curriculum Learning for Dense Retrieval Distillation

Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

Hansi Zeng, Hamed Zamani, Vishwa Vinay
  • AI & Machine Learning
  • Content Intelligence

Published July 11, 2022

Offline Evaluation of Ranked Lists using Parametric Estimation of Propensities

Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

Vishwa Vinay, Manoj Kilaru, David Arbour
  • AI & Machine Learning

Published June 24, 2022

GAN-Supervised Dense Visual Alignment (GANgealing)

Computer Vision and Pattern Recognition (CVPR'22)

William Peebles, Jun-Yan Zhu, Richard Zhang, Antonio Torralba, Alexei Efros, Eli Shechtman
  • AI & Machine Learning
  • Computer Vision, Imaging & Video

Published June 23, 2022

Ensembling Off-the-shelf Models for GAN Training

Computer Vision and Pattern Recognition (CVPR'22)

Nupur Kumari, Richard Zhang, Eli Shechtman, Jun-Yan Zhu
  • AI & Machine Learning
  • Computer Vision, Imaging & Video

Published June 22, 2022

StyleSDF: High-Resolution 3D-Consistent Image and Geometry Generation

Computer Vision and Pattern Recognition (CVPR'22)

Roy O-Elr, Xuan Luo, Mengyi Shan, Eli Shechtman, Jeong Joon Park, Ira Kemelmacher-Shlizerman
  • AI & Machine Learning
  • Computer Vision, Imaging & Video
  • Graphics (2D & 3D)

Published June 20, 2022

OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis

IEEE CVPR Workshop on Fair, Data Efficient and Trusted Computer Vision

Sumit Shekhar, Bhanu Prakash Reddy Guda, Ashutosh Chaubey, Ishan Jindal, Avneet Jain
  • Computer Vision, Imaging & Video
  • Document Intelligence

Published June 19, 2022

InsetGAN for Full-Body Image Generation

CVPR2022

Anna Frühstück, Krishna Kumar Singh, Eli Shechtman, Niloy J. Mitra, Peter Wonka, Jingwan (Cynthia) Lu
  • AI & Machine Learning
  • Computer Vision, Imaging & Video

Published June 19, 2022

Learning Motion-Dependent Appearance for High-Fidelity Rendering of Dynamic Humans from a Single Camera

CVPR2022

Jae Shin Yoon, Duygu Ceylan, Tuanfeng Y. Wang, Jingwan (Cynthia) Lu, Jimei Yang, Zhixin Shu, Hyun Soo Park
  • AI & Machine Learning
  • Computer Vision, Imaging & Video

Published June 19, 2022

Inverting and Editing Real Images with Spatially Varying and Automatic Latent Selection

CVPR2022

Gaurav Parmar, Yijun Li, Jingwan (Cynthia) Lu, Richard Zhang, Jun-Yan Zhu, Krishna Kumar Singh
  • AI & Machine Learning
  • Computer Vision, Imaging & Video

Published June 19, 2022

GLASS: Geometric Latent Augmentation for Shape Spaces

CVPR

Sanjeev Muralikrishnan, Siddhartha Chaudhuri, Noam Aigerman, Vladimir (Vova) Kim, Matt Fisher, Niloy J. Mitra
  • AI & Machine Learning
  • Computer Vision, Imaging & Video
  • Graphics (2D & 3D)
1 2 3 4 94