Gather all the information from every single document.

TagWorks was designed by a sociologist and data scientist to efficiently analyze large sets of documents in rich detail. This web-based system can help you finish your giant data labeling project up to ten times faster.

ui graphic-site.png

What can TagWorks do?


Complexity at scale

With TagWorks you can tackle complex, large-scale projects with ease. Efficiently annotate, tag and classify tens of thousands or millions of documents with hundreds of labels.


Up to 10x faster

Complete what would normally be a decade long project in a year. Tagworks removes the need to train wave-after-wave  of research assistants, saving you time and money.


The power of the crowd

Easily enlist thousands of crowd workers to extract the information you need. Every annotator will be tested and pre-qualified before they work on your project.


Eliminates task management

TagWorks automates worker, document, and task management, saving you hundreds of hours otherwise spent training team members and directing traffic.


Validated results

Unique interfaces make it simple for you to review and validate your results. Easily share reliability statistics and data provenance with your peers and reviewers.


Web based

TagWorks is completely web based, so no software installation or maintenance is required. Crowd workers and collaborators can join your project with the click of a mouse.


How does TagWorks work?

  1. Gather your thousands of documents and upload them onto your TagWorks server.

  2. Our team helps you convert your conceptual scheme into an assembly line of annotation tasks.

  3. You and your team perform a few dozen tasks to test and refine your assembly line. Your best work will establish a “gold standard” set of high-quality tags that will be used to qualify online workers.

  4. Open your data labeling factory to thousands of online crowd workers or volunteers.

  5. TagWorks automatically finds agreement among annotators, and produces the validation statistics you need to publish your results.


Who uses TagWorks?


Columbia University’s History Lab is using TagWorks to annotate an archive of over 1 million diplomatic reports.


The University of Texas School of Information is using TagWorks to build an AI capable of cataloguing all the software scientists use to conduct their research.


The Public Editor project is using TagWorks to transparently assess the credibility of news articles and news organizations.


Where did TagWorks come from?


Sociologist Nick Adams wanted to better understand the complex social interactions during the Occupy movement by scrutinizing thousands of news articles. His goal was to annotate over 8000 articles with labels identifying over 300 separate variables. With all these data, he would be able to refine multi-level time series models of interactions among police, protesters, and city governments. But it was not easy.


The labeling process was complex, tedious, and required the close management of many research assistants. But automated approaches to language could not come close to identifying all the information relevant to theories of protest policing. Many ambitious researchers have faced similar challenges. They always had to simplify their approach or reduce their document set.


But Adams persisted, and reached a breakthrough. He figured out how to divide the whole annotation process into simpler tasks that online workers could do without face-to-face training. The new assembly line approach could be managed by software and validated by algorithms, so researchers could stick to what they do best.

Adams built a prototype and recruited veteran software engineer Norman Gilmore. Together they’re leading the TagWorks team to make sure you can answer questions even bigger than your data.


 When Is TagWorks The Right Solution?


When it is

If you have a large (or even gigantic) set of documents and/or you have an intricate conceptual scheme with dozens or even hundreds of labels, TagWorks is the solution you have been looking for.

When it isn’t

If you have less than 500 documents to analyze and/or you have a simple conceptual scheme with a handful of labels, there are a number of other tools that can meet your needs.  


If you’d like to learn more about the suitability of TagWorks for your project, get in touch and we can set up a free consultation.


Get started

Interested in using TagWorks? Complete the form below and we’ll chat soon to schedule a free consultation.

Name *
Join our mailing list