Jeffrey A. Bilmes: Software and Data
Software
4/15/2021: We had a major server crash and some of the links below are no longer working. Due to COVID it has been difficult to get things restored, but I am hoping to have all of the links below working soon.- The graphical models toolkit GMTK hosted on github, as well as long PDF documentation
- The Vocal Joystick Software for windows.
- The PhiPAC automatically tuning matrix-matrix multiply library (and the first auto-tuning matrix multiply dense linear algebra library).
- The Measure propagation code and web page, a graph Laplacian Manifold approximation based semi-supervised learning algorithm that uses an objective function based on KL-divergence (code also includes fast C++ parallel implementation).
- The Buried Markov Model (BMM) code (includes mixtures of sparse linear conditional multi-time Gaussian models). Sorry, no documentation.
- Multi-party meeting scheduling with simple preference aggregation rules.
- Extensions to the old Berkeley parallel make software (or what is known as pmake). The original pmake utility is described here. We have made a number of significant extensions to pmake including a full gnu autoconf configuration, many new resource constraints (including dynamic), and other features (as well as removed some old ones that were no longer needed). The complete source code is at at pmake-3.0-alpha. Note that this is an alpha release, and is basically working but there are no plans for additional work to be done on this (at least by me or my group), nor can I answer any further questions about this (see the source code).
Data
- Corpus definitions and baseline systems for both the SVitchboard-II and FiSVer-I datasets can be found at this link. The paper describing it is here
- Cooperative Cut image data , a set of difficult to segment images (with elongated or narrow structures, and contrast gradients) along with ground truth labellings, and that were used in the following paper
- Vocal Joystick Vowel Corpus
- A small amount of hand-aligned French/English data, useful for statistical machine translation systems, done by Karim Filali.
- The COSINE multi-channel real-world in-situ noisy speech corpus. (now available for download).
- The Semi-Supervised Switchboard Transcription (S3TP) project and its data. In the 1990s, the switchboard transcription project gave us 1.5 hours of frame-by-frame phonetically transcribed switchboard conversational speech data. Here, we have used a modern semi-supervised learning algorithm to phonetically label at the frame level the remaining 250 hours of SWB-I, and we call this the semi-supervised switchboard transcription project (or s3tp). The data and algorithms are available here.