Shadow Profiles on Social Networks
Data Skeptic13 Feb 2015

Shadow Profiles on Social Networks

Emre Sarigol joins me this week to discuss his paper Online Privacy as a Collective Phenomenon. This paper studies data collected from social networks and how the sharing behaviors of individuals can unintentionally reveal private information about other people, including those that have not even joined the social network! For the specific test discussed, the researchers were able to accurately predict the sexual orientation of individuals, even when this information was withheld during the training of their algorithm.

The research produces a surprisingly accurate predictor of this private piece of information, and was constructed only with publically available data from myspace.com found on archive.org. As Emre points out, this is a small shadow of the potential information available to modern social networks. For example, users that install the Facebook app on their mobile phones are (perhaps unknowningly) sharing all their phone contacts. Should a social network like Facebook choose to do so, this information could be aggregated to assemble "shadow profiles" containing rich data on users who may not even have an account.

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(601)

[MINI] Sample Sizes

[MINI] Sample Sizes

There are several factors that are important to selecting an appropriate sample size and dealing with small samples. The most important questions are around representativeness - how well does your sam...

18 Sep 201513min

The Model Complexity Myth

The Model Complexity Myth

There's an old adage which says you cannot fit a model which has more parameters than you have data. While this is often the case, it's not a universal truth. Today's guest Jake VanderPlas explains th...

11 Sep 201530min

[MINI] Distance Measures

[MINI] Distance Measures

There are many occasions in which one might want to know the distance or similarity between two things, for which the means of calculating that distance is not necessarily clear. The distance between ...

4 Sep 201512min

ContentMine

ContentMine

ContentMine is a project which provides the tools and workflow to convert scientific literature into machine readable and machine interpretable data in order to facilitate better and more effective ac...

28 Aug 201553min

[MINI] Structured and Unstructured Data

[MINI] Structured and Unstructured Data

Today's mini-episode explains the distinction between structured and unstructured data, and debates which of these categories best describe recipes.

21 Aug 201513min

Measuring the Influence of Fashion Designers

Measuring the Influence of Fashion Designers

Yusan Lin shares her research on using data science to explore the fashion industry in this episode. She has applied techniques from data mining, natural language processing, and social network analys...

14 Aug 201524min

[MINI] PageRank

[MINI] PageRank

PageRank is the algorithm most famous for being one of the original innovations that made Google stand out as a search engine. It was defined in the classic paper The Anatomy of a Large-Scale Hypertex...

7 Aug 20158min

Data Science at Work in LA County

Data Science at Work in LA County

In this episode, Benjamin Uminsky enlightens us about some of the ways the Los Angeles County Registrar-Recorder/County Clerk leverages data science and analysis to help be more effective and efficien...

29 Juli 201541min

Populärt inom Vetenskap

allt-du-velat-veta
dumma-manniskor
p3-dystopia
ufo-sverige
rss-ufobortom-rimligt-tvivel
kapitalet-en-podd-om-ekonomi
svd-nyhetsartiklar
hacka-livet
paranormalt-med-caroline-giertz
ufo-sverige-2
rss-spraket
sexet
rss-vetenskapsradion
medicinvetarna
det-morka-psyket
rss-vetenskapsradion-2
dumforklarat
rss-dennis-world
rss-tidslinjen-podcast
rss-tidsmaskinen