SSHOC Speech-to-text Workshop

Date: 16 April 2021 – 11:00 to 12:30 Location: Online

Survey Infrastructures systematically interview tens of thousands of individuals across Europe each year. Respondents are selected at random from all walks of life, and the hour-long interviews provide a range of data which has value for researchers and subsequently policy makers.

While complex life histories or events may be coded into the structured taxonomies required for cutting-edge sociological research, a large proportion of the information conveyed in an interview is lost. A respondent’s tone of voice, linguistic fluidity, and depth of vocabulary for example can provide insights about cognitive function, socio-economic status or verbal reasoning skills.

Making use of this lost data requires the integration of social survey and linguistic infrastructures. Such integration underpins the EOSC vision. As such, the basis for the work within SSHOC on analysing voice recorded interviews seeks to provide both a proof of concept and a framework for future research that explores this approach.

SSHOC (Social Sciences and Humanities Open Cloud) is EU funded project aiming at creating the social sciences and humanities area of the European Open Science Cloud (EOSC) thereby facilitating access to flexible, scalable research data and related services streamlined to the precise needs of the SSH community.

Agenda

  • Judith Koops from the Generations and Gender Programme, will provide an overview of the project. She will focus on the advantages of collaboration between the different infrastructures and new insights generated over the course of the project.
  • Joris Mulder from the LISS panel will demonstrate the tools used for collecting audio data through existing survey software in online interviews. He will provide an evaluation of the challenges encountered in this project as well as the way these issues were solved. 
  • Henk van den Heuvel from the Speech and Tech team will then describe the tools used for analysis of Oral History data which could be adapted for analysis of survey interviews. In particular he will address the so-called Transcription Chain, which is based on automatic speech-to-text conversion. The resulting text can, after manual correction, be processed by NLP tools to obtain more insights into its linguistic structure, or for topic detection or text summarisation, amongst others.
  • Giovanni Borghesan from the European Values Study will lead the interactive session where participants will discuss potential applications for the tools, the use of the data for new avenues of scientific research, as well as ways to improve the collection, processing and archiving of audio data.
Shopping Cart

Site Title & Logo

Site Icon

Site Icon European Values Study

Buttons

Colors

Brand
Alt Brand
Heading
Text
Primary
Secondary
Border
Subtle BG
Extra

Typography

Headings

A a B b C c D d E e F f G g H h I i J j K k L l M m N n O o P p Q q R r S s T t U u V v W w X x Y y Z z

Here's how the body text will look like on your website. You can customize the typography to match your brand personality. Whether you aim for a modern and sleek appearance or a more traditional and elegant feel, the right typography sets the tone for your content.

inherit / 48px 36px 32px / 1.4em

Heading 1

inherit / 40px 30px 26px / 1.3em

Heading 2

inherit / 32px 25px 22px / 1.3em

Heading 3

inherit / 24px 20px 18px / 1.2em

Heading 4

inherit / 20px 17px 15px / 1.2em

Heading 5

inherit / 17px 15px 13px / 1.25em

Heading 6

Explore different font families, sizes, weights, and styles to find the perfect combination that encapsulates the essence of your brand. With each adjustment, see how your message transforms, becoming a powerful reflection of your identity and vision.

Quote

The future will belongs to those who believe in the beauty of their dreams.


Elanor Rosevelt

Unordered List

  • List Item 1
  • List Item 2
  • List Item 3
Scroll to Top

Discover more from European Values Study

Subscribe now to keep reading and get access to the full archive.

Continue reading