Structured Thoughts Automaton: First Formalized Execution Model for Auto-Regressive Language Models

Name: Structured Thoughts Automaton: First Formalized Execution Model for Auto-Regressive Language Models
Keywords: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computation and Language, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Formal Languages and Automata Theory (cs.FL), Computer Science - Formal Languages and Automata Theory, Computation and Language (cs.CL), Machine Learning (cs.LG)

Tristan Vanderbruggen; Chunhua Liao; Peter Pirkelbauer; Pei-Hung Lin

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2023

Data sources: arXiv.org e-Print Archive

https://dx.doi.org/10.48550/ar...

Article . 2023

License: CC BY

Data sources: Datacite

DBLP

Article

Data sources: DBLP

Structured Thoughts Automaton: First Formalized Execution Model for Auto-Regressive Language Models

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2023Embargo end date: 01 Jan 2023Publisher:arXivJournal:CoRR, volume abs/2306.10196

Authors: Tristan Vanderbruggen; Chunhua Liao; Peter Pirkelbauer; Pei-Hung Lin;

doi: 10.48550/arxiv.2306.10196

arXiv: 2306.10196

Structured Thoughts Automaton: First Formalized Execution Model for Auto-Regressive Language Models

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

In recent months, Language Models (LMs) have become a part of daily discourse, with focus on OpenAI and the potential of Artificial General Intelligence (AGI). Furthermore, the leaking of LLama's weights to the public has led to an influx of innovations demonstrating the impressive capabilities of generative LMs. While we believe that AGI is still a distant goal, we recognize the potential of LMs in solving tasks such as searching complex documents, compiling reports with basic analysis, and providing assistance in problem-solving. In this paper, we propose formalizing the execution model of language models. We investigate current execution models, to find that this formalism has received little attention, and present our contribution: the first formalized execution model for LMs. We introduce a new algorithm for sampling the predictions of LMs, which we use to build a reliable and inspectable execution model. We introduce a low-level language to write "cognitive program" for this execution model. We hope to shed light on the need for execution models for LMs and encourage further research in this area.

Submitted to CGO-24

Related Organizations

Lawrence Berkeley National Laboratory
United States
Lawrence Livermore National Laboratory
United States

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computation and Language, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Formal Languages and Automata Theory (cs.FL), Computer Science - Formal Languages and Automata Theory, Computation and Language (cs.CL), Machine Learning (cs.LG)

1 Research products, page 1 of 1

AutoCP software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Structured Thoughts Automaton: First Formalized Execution Model for Auto-Regressive Language Models

Structured Thoughts Automaton: First Formalized Execution Model for Auto-Regressive Language Models

1 Research products, page 1 of 1

AutoCP software on GitHub