Multi-modal Deep Learning for Complex Document Understanding with Doug Burdick - #541

Multi-modal Deep Learning for Complex Document Understanding with Doug Burdick - #541

Today we’re joined by Doug Burdick, a principal research staff member at IBM Research. In a recent interview, Doug’s colleague Yunyao Li joined us to talk through some of the broader enterprise NLP problems she’s working on. One of those problems is making documents machine consumable, especially with the traditionally archival file type, the PDF. That’s where Doug and his team come in. In our conversation, we discuss the multimodal approach they’ve taken to identify, interpret, contextualize and extract things like tables from a document, the challenges they’ve faced when dealing with the tables and how they evaluate the performance of models on tables. We also explore how he’s handled generalizing across different formats, how fine-tuning has to be in order to be effective, the problems that appear on the NLP side of things, and how deep learning models are being leveraged within the group. The complete show notes for this episode can be found at twimlai.com/go/541

Episoder(779)

Graph Analytic Systems with Zachary Hanif - TWiML Talk #188

Graph Analytic Systems with Zachary Hanif - TWiML Talk #188

In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. We start our discussion wit...

8 Okt 201854min

Diversification in Recommender Systems with Ahsan Ashraf - TWiML Talk #187

Diversification in Recommender Systems with Ahsan Ashraf - TWiML Talk #187

In this episode of our Strata Data conference series, we’re joined by Ahsan Ashraf, data scientist at Pinterest. We discuss his presentation, “Diversification in recommender systems: Using topical var...

4 Okt 201844min

The Fastai v1 Deep Learning Framework with Jeremy Howard - TWiML Talk #186

The Fastai v1 Deep Learning Framework with Jeremy Howard - TWiML Talk #186

In today's episode we're presenting a special conversation with Jeremy Howard, founder and researcher at Fast.ai. This episode is being released today in conjunction with the company’s announcement of...

2 Okt 20181h 11min

Federated ML for Edge Applications with Justin Norman - TWiML Talk #185

Federated ML for Edge Applications with Justin Norman - TWiML Talk #185

In this episode we’re joined by Justin Norman, Director of Research and Data Science Services at Cloudera Fast Forward Labs. In my chat with Justin we start with an update on the company before diving...

27 Sep 201847min

Exploring Dark Energy & Star Formation w/ ML with Viviana Acquaviva - TWiML Talk #184

Exploring Dark Energy & Star Formation w/ ML with Viviana Acquaviva - TWiML Talk #184

In today’s episode of our Strata Data series, we’re joined by Viviana Acquaviva, Associate Professor at City Tech, the New York City College of Technology. In our conversation, we discuss an ongoing p...

26 Sep 201840min

Document Vectors in the Wild with James Dreiss - TWiML Talk #183

Document Vectors in the Wild with James Dreiss - TWiML Talk #183

In this episode of our Strata Data series we’re joined by James Dreiss, Senior Data Scientist at international news syndicate Reuters. James and I sat down to discuss his talk from the conference “Doc...

24 Sep 201840min

Applied Machine Learning for Publishers with Naveed Ahmad - TWiML Talk #182

Applied Machine Learning for Publishers with Naveed Ahmad - TWiML Talk #182

In today’s episode we’re joined by Naveed Ahmad, Senior Director of data engineering and machine learning at Hearst Newspapers. In our conversation, we discuss into the role of ML at Hearst, including...

20 Sep 201839min

Anticipating Superintelligence with Nick Bostrom - TWiML Talk #181

Anticipating Superintelligence with Nick Bostrom - TWiML Talk #181

In this episode, we’re joined by Nick Bostrom, professor at the University of Oxford and head of the Future of Humanity Institute, a multidisciplinary institute focused on answering big-picture questi...

17 Sep 201844min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
i-retten
stopp-verden
forklart
popradet
nokon-ma-ga
dine-penger-pengeradet
det-store-bildet
fotballpodden-2
rss-gukild-johaug
aftenbla-bla
hanna-de-heldige
rss-ness
bt-dokumentar-2
e24-podden
frokostshowet-pa-p5
rss-dannet-uten-piano
rss-penger-polser-og-politikk