The Velocity of Censorship: High-Fidelity Detection of Microblog Post Deletions

Zhu, Tao; Phipps, David; Pridgen, Adam; Crandall, Jedidiah R.; Wallach, Dan S.

Computer Science > Computers and Society

arXiv:1303.0597 (cs)

[Submitted on 4 Mar 2013 (v1), last revised 10 Jul 2013 (this version, v2)]

Title:The Velocity of Censorship: High-Fidelity Detection of Microblog Post Deletions

Authors:Tao Zhu, David Phipps, Adam Pridgen, Jedidiah R. Crandall, Dan S. Wallach

View PDF

Abstract:Weibo and other popular Chinese microblogging sites are well known for exercising internal censorship, to comply with Chinese government requirements. This research seeks to quantify the mechanisms of this censorship: how fast and how comprehensively posts are this http URL analysis considered 2.38 million posts gathered over roughly two months in 2012, with our attention focused on repeatedly visiting "sensitive" users. This gives us a view of censorship events within minutes of their occurrence, albeit at a cost of our data no longer representing a random sample of the general Weibo population. We also have a larger 470 million post sampling from Weibo's public timeline, taken over a longer time period, that is more representative of a random sample.
We found that deletions happen most heavily in the first hour after a post has been submitted. Focusing on original posts, not reposts/retweets, we observed that nearly 30% of the total deletion events occur within 5- 30 minutes. Nearly 90% of the deletions happen within the first 24 hours. Leveraging our data, we also considered a variety of hypotheses about the mechanisms used by Weibo for censorship, such as the extent to which Weibo's censors use retrospective keyword-based censorship, and how repost/retweet popularity interacts with censorship. We also used natural language processing techniques to analyze which topics were more likely to be censored.

Comments:	arXiv admin note: substantial text overlap with arXiv:1211.6166
Subjects:	Computers and Society (cs.CY); Information Retrieval (cs.IR); Social and Information Networks (cs.SI)
Cite as:	arXiv:1303.0597 [cs.CY]
	(or arXiv:1303.0597v2 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.1303.0597

Submission history

From: Tao Zhu [view email]
[v1] Mon, 4 Mar 2013 04:15:03 UTC (546 KB)
[v2] Wed, 10 Jul 2013 01:21:03 UTC (533 KB)

Computer Science > Computers and Society

Title:The Velocity of Censorship: High-Fidelity Detection of Microblog Post Deletions

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:The Velocity of Censorship: High-Fidelity Detection of Microblog Post Deletions

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators