
Online forms are widely used to collect data from human and have a multi-billion market. Many software products provide online services for creating semi-structured forms where questions and descriptions are organized by pre-defined structures. However, the design and creation process of forms is still tedious and requires expert knowledge. To assist form designers, in this work we present FormLM to model online forms (by enhancing pre-trained language model with form structural information) and recommend form creation ideas (including question / options recommendations and block type suggestion). For model training and evaluation, we collect the first public online form dataset with 62K online forms. Experiment results show that FormLM significantly outperforms general-purpose language models on all tasks, with an improvement by 4.71 on Question Recommendation and 10.6 on Block Type Suggestion in terms of ROUGE-1 and Macro-F1, respectively.
17 pages, EMNLP 2022 Main Conference
FOS: Computer and information sciences, Artificial intelligence, Web Data Extraction, Geometry, Data science, Web Data Extraction and Crawling Techniques, Artificial Intelligence, FOS: Mathematics, Information retrieval, Macro, Computer Science - Computation and Language, Natural language processing, Statistical Machine Translation and Natural Language Processing, Computer science, Language Modeling, Process (computing), Programming language, World Wide Web, Computer Science, Physical Sciences, Computation and Language (cs.CL), Semantic Web and Ontology Development, Block (permutation group theory), Software, Mathematics, Information Systems
FOS: Computer and information sciences, Artificial intelligence, Web Data Extraction, Geometry, Data science, Web Data Extraction and Crawling Techniques, Artificial Intelligence, FOS: Mathematics, Information retrieval, Macro, Computer Science - Computation and Language, Natural language processing, Statistical Machine Translation and Natural Language Processing, Computer science, Language Modeling, Process (computing), Programming language, World Wide Web, Computer Science, Physical Sciences, Computation and Language (cs.CL), Semantic Web and Ontology Development, Block (permutation group theory), Software, Mathematics, Information Systems
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
