Downloads provided by UsageCounts
Pengi is an Audio Language Model that leverages Transfer Learning by framing all audio tasks as text-generation tasks. It takes as input, an audio recording, and text, and generates free-form text as output. The unified architecture of Pengi enables open-ended tasks and close-ended tasks without any additional fine-tuning or task-specific extensions. The code repository is: microsoft/Pengi: An Audio Language model for Audio Tasks (github.com)
FOS: Computer and information sciences, Sound (cs.SD), zero-shot, acoustic scenes, Audio and Speech Processing (eess.AS), sound events, FOS: Electrical engineering, electronic engineering, information engineering, Pengi, audio captioning, audio question answering, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
FOS: Computer and information sciences, Sound (cs.SD), zero-shot, acoustic scenes, Audio and Speech Processing (eess.AS), sound events, FOS: Electrical engineering, electronic engineering, information engineering, Pengi, audio captioning, audio question answering, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 4 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
| views | 47 | |
| downloads | 47 |

Views provided by UsageCounts
Downloads provided by UsageCounts