76 - Increasing In-Class Similarity by Retrofitting Embeddings with Demographics, with Dirk Hovy
NLP Highlights27 Nov 2018

76 - Increasing In-Class Similarity by Retrofitting Embeddings with Demographics, with Dirk Hovy

EMNLP 2018 paper by Dirk Hovy and Tommaso Fornaciari. https://www.semanticscholar.org/paper/Improving-Author-Attribute-Prediction-by-Linguistic-Hovy-Fornaciari/71aad8919c864f73108aafd8e926d44e9df51615 In this episode, Dirk Hovy talks about natural language as social phenomenon which can provide insights about those who generate it. For example, this paper uses retrofitted embeddings to improve on two tasks: predicting the gender and age group of a person based on their online reviews. In this approach, authors embeddings are first generated using Doc2Vec, then retrofitted such that authors with similar attributes are closer in the vector space. In order to estimate the retrofitted vectors for authors with unknown attributes, a linear transformation is learned which maps Doc2Vec vectors to the retrofitted vectors. Dirk also used a similar approach to encode geographic information to model regional linguistic variations, in another EMNLP 2018 paper with Christoph Purschke titled “Capturing Regional Variation with Distributed Place Representations and Geographic Retrofitting” [link: https://www.semanticscholar.org/paper/Capturing-Regional-Variation-with-Distributed-Place-Hovy-Purschke/6d9babd835d0cdaaf175f098bb4fd61fd75b1be0].

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(145)

Are LLMs safe?

Are LLMs safe?

Curious about the safety of LLMs? 🤔 Join us for an insightful new episode featuring Suchin Gururangan, Young Investigator at Allen Institute for Artificial Intelligence and Data Science Engineer at A...

29 Feb 202442min

"Imaginative AI" with Mohamed Elhoseiny

"Imaginative AI" with Mohamed Elhoseiny

This podcast episode features Dr. Mohamed Elhoseiny, a true luminary in the realm of computer vision with over a decade of groundbreaking research. As an Assistant Professor at KAUST, Dr. Elhoseiny's ...

8 Jan 202423min

142 - Science Of Science, with Kyle Lo

142 - Science Of Science, with Kyle Lo

Our first guest with this new format is Kyle Lo, the most senior lead scientist in the Semantic Scholar team at Allen Institute for AI (AI2), who kindly agreed to share his perspective on #Science of ...

28 Des 202348min

141 - Building an open source LM, with Iz Beltagy and Dirk Groeneveld

141 - Building an open source LM, with Iz Beltagy and Dirk Groeneveld

In this special episode of NLP Highlights, we discussed building and open sourcing language models. What is the usual recipe for building large language models? What does it mean to open source them? ...

29 Jun 202329min

140 - Generative AI and Copyright, with Chris Callison-Burch

140 - Generative AI and Copyright, with Chris Callison-Burch

In this special episode, we chatted with Chris Callison-Burch about his testimony in the recent U.S. Congress Hearing on the Interoperability of AI and Copyright Law. We started by asking Chris about ...

6 Jun 202351min

139 - Coherent Long Story Generation, with Kevin Yang

139 - Coherent Long Story Generation, with Kevin Yang

How can we generate coherent long stories from language models? Ensuring that the generated story has long range consistency and that it conforms to a high level plan is typically challenging. In this...

24 Mar 202345min

138 - Compositional Generalization in Neural Networks, with Najoung Kim

138 - Compositional Generalization in Neural Networks, with Najoung Kim

Compositional generalization refers to the capability of models to generalize to out-of-distribution instances by composing information obtained from the training data. In this episode we chatted with...

20 Jan 202348min

137 - Nearest Neighbor Language Modeling and Machine Translation, with Urvashi Khandelwal

137 - Nearest Neighbor Language Modeling and Machine Translation, with Urvashi Khandelwal

We invited Urvashi Khandelwal, a research scientist at Google Brain to talk about nearest neighbor language and machine translation models. These models interpolate parametric (conditional) language m...

13 Jan 202335min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
jss
liberal-halvtime
rekommandert
sinnsyn
forskningno
villmarksliv
tomprat-med-gunnar-tjomlid
rss-paradigmepodden
fjellsportpodden
nevropodden
tidlose-historier
kvinnehelsepodden
dekodet-2
grunnstoffene
rss-zahid-ali-hjelper-deg
diagnose
rss-inn-til-kjernen-med-sunniva-rose
rss-rekommandert