Marvin Center Floor Diagram (PDF)

Pre-conference event links:

  • Tutorial: Sunday, June 10, 2012, 8:00 am - 5:00 pm Location: Room 302 (Third Floor)
  • Doctoral Consortium: Sunday, June 10, 2012, 8:00 am - 5:00 pm Location: Room 308 (Third Floor)

JCDL 2012 Conference events

Monday, June 11th

8:00 am - 4:00 pm Registration Location: Lobby (Third Floor)
8:00 am - 9:00 am Breakfast
Location: Grand Ballroom (Third Floor)

9:00 am - 9:15 am Opening Remarks Location: Betts Theatre (First Floor)

Monday, June 11th, 9:15 am - 10:15 am, Betts Theatre (First Floor)
Keynote: All You Cared about Is Gone and All Your Friends Are Dead: The Fun Frolic of Preservation Activism

Keynote Speaker:  Jason Scott

 Yes, the endless progression of all things towards loss, destruction and disappearance are a bit of a bummer, but that's no reason to be down about it! Against the wave of thrown-up hands and jettisoned heritage comes The Archive Team, a rogue band of activist preservationists who are attacking the problem from all sides, logic and futility be tossed. Archive Team's mascot, Jason Scott, will walk through the groups' approach to the problem, solutions found so far, and lay out the groundwork for where this wheels-off-the-cart mineshaft is leading.


10:15 am - 10:45 am Break Location: Grand Ballroom (Third Floor)

Monday, June 11th, 10:45 am - 12:15 pm (two concurrent sessions)

Session 1
Session chair:
Robert Sanderson
Location: Continental Ballroom (Third Floor)

Session 2
Session chair:
Xiao Hu
Location: Betts Theatre (First Floor)

Paper (Full)
On the Institutional Archiving of Social Media
Catherine Marshall; Frank Shipman

Paper (Short)
To Envisage and Design the Transition from a Digital Archive System Developed for Domain Experts to One for Non-domain Users

Maristella Agosti; Nicola Orio

Paper (Short)
Visualizing Digital Collections at Archive-It 

Kalpesh Padia; Yasmin Alnoamany; Michele Weigle

Paper (Short)
Data, Data Use, & Inquiry: A New Point of View on Data Curation

Jillian Wallis; Laura Wynholds; Christine Borgman; Ashley Sands; Sharon Traweek

Paper (Short)
Digital Preservation and Knowledge Discovery Based on Documents from an International Health Science Program
Dharitri Misra; Robert Hall; Susan Payne; George Thoma

Paper (Full)
Identifying Core Concepts in Educational Resources
James Foster; Md. Arafat Sultan; Holly Devaul; Ifeyinwa Okoye; Tamara Sumner

Paper (Short)
Teacher Sociality and Information Diffusion in Educational Digital Libraries
Ogheneovo Dibie; Keith Maull; Tamara Sumner

Paper (Short)
Is it Time to Change the OER Repositories Role?
Christo Dichev; Darina Dicheva

Paper (Short)
Deduced Social Networks for Educational Digital Library
Monika Akbar; Clifford Shaffer; Edward Fox

Paper (Short)
A Tale of Two Studies: Is Dissemination Working?

Flora McMartin; Joe Tront; Wesley Shumar, Sarah Holsted






12:15 pm - 1:30 pm Lunch   Location: Grand Ballroom (Third Floor)

Monday, June 11th, 1:30 pm - 3:00 pm (two concurrent sessions)

Session 3
Bibliographic Networks
Session chair: Kazunari Sugiyama
Location: Continental Ballroom (Third Floor)

Session 4
Moderator: Leslie Johnston, Library of Congress
Location: Betts Theatre (First Floor

Paper (Full)
To Better Stand on the Shoulder of Giants: Learning to Identify Potentially Influential Literature     
Rui Yan (Nominated for 
Best Student Paper)

Paper (Full)
BibRank: a Language-Based Model for Co-Ranking Entities in Bibliographic Networks      
Laure Soulier; Lamjed Ben Jabeur; Lynda Tamine; Wahiba Bahsoun

Paper (Full)
Modeling and Exploiting Heterogeneous Bibliographic Networks for Expertise Ranking
Hongbo Deng; Jiawei Han; Michael R. Lyu; Irwin King
(Nominated for Vannevar Bush Best Paper)

Big Data Is Already Here, and It’s Not Always What We Think


Libraries have over 20 years experience in managing large numbers of digital files, and indexing catalog records and full-text documents. In the last several years, research libraries in general have been facing a growing expectation that large digital library collections and record sets can be mined and analyzed for research purposes.  Many Libraries now have several collections of unstructured or semi-structured content that can be measured in hundreds of terabytes.  This panel will address trends and forecasts for the introduction, adoption and maturity of tools and services and configurations of technical environments to support processing, management, mining, indexing, and analysis of large volumes of unstructured digital content.


3:00 pm - 3:30 pm Break Location: Grand Ballroom (Third Floor)

Monday, June 11th, 3:30 pm - 5:00 (two concurrent sessions)

Session 5
Session chair:
Unmil P. Karadkar
Location: Continental Ballroom (Third Floor)

Session 6
Session chair:
Sally Jo Cunningham
Location: Betts Theatre (First Floor)

Paper (Full)
Live Television in a Digital Library   
Maxime Rouast; David Bainbridge

Paper (Full)
Transforming Japanese Archives into Accessible Digital Books   
Tatsuya Ishihara; Toshinari Itoko; Daisuke Sato; Asaf Tzadok; Hironobu Takagi

Paper (Full)
IPKB: A Digital Library for Invertebrate Paleontology  
Yuanliang Meng; Junyan Li; Patrick Denton; Yuxin Chen; Bo Luo; Paul Selden; Xue-Wen Chen



Paper (Full)
Descriptive Metadata, Iconclass, and Digitized Emblem Literature       
Timothy W. Cole; Myung-Ja Han; Jordan Vannoy

Paper (Full)
Generating Ground Truth for Music Mood Classification Using Mechanical Turk  
Jin Ha Lee; Xiao Hu
(Nominated for Vannevar Bush Best Paper)

Paper (Short)
Categorization of Computing Education Resources into the ACM Computing Classification System
Yinlin Chen; Paul Bogen; Edward Fox; Haowei Hsieh; Lillian Cassel

Paper (Short)
Re-ranking Bibliographic Records for Personalized Library Search       
Tadashi Nomoto



5:15 pm - 8:30 pm Posters & demonstrations session and Reception

5:15 pm – Minute Madness Location: Betts Theatre (First Floor)

6:15 pm – Poster viewing, networking, and mingling Location: Grand Ballroom and Terrace (Third Floor)

8:30 pm – Reception ends


Tuesday, June 12th

8:00 am - 4:00 pm Registration Location: Lobby (Third Floor)
8:00 am - 9:00 am Breakfast
Location: Grand Ballroom (Third Floor)

Tuesday, June 12th, 9:00 am - 10:00 am, Betts Theatre (First Floor)
Keynote: The Reality of Reproducibility for in silico 
Keynote Speaker
: Carole Goble

Reproducibility, at least in principle if not in practice, underpins the scientific method. For an experimental finding to be reproducible its materials must be available and its methods clear, accurate and transparent. In in silico science the materials and methods are digital: datasets, digital publications and software. For the materials the case is being made for the public availability and reusability of scientific data, and open access to publications. But what about curation and preservation of digital experimental method? To truly reproduce results we need methodological transparency. One increasingly important class of digital method is the scientific workflow: an executable description of procedures that define the sequence of computational steps in an automated data analysis. However, things are never simple. Workflows are executable software sensitive to their components and their environments. Decay in a component leads to decay of the workflow.

Consequently, Wf4Ever ( is an EU-funded project that aims to develop technological infrastructure for the preservation and efficient retrieval and reuse of scientific workflows as a step towards scientific reproducibility.

In this talk I will explore the reality of in silico science reproducibility. In particular I will: highlight the confusions and spectrums of what we mean by reproducibility; differentiate between the preservation of workflows and conservation of workflows; and sketch the role provenance has to play. I will present our first steps towards reproducibility framework based on semantically encoded suitable Research Objects and the services that support their creation and management. I will draw on our extensive experience of Linked Data; our scientific domains ranging from Astrophysics, Genomics, and Biodiversity, to Digital document preservation and Social Science; and our software and services: the Taverna Scientific Workflow System; the myExperiment workflow community sharing environment; the BioCatalogue of community contributed Web Services in the Life Sciences; and the dLibra digital document repository.


10:00 am - 10:30 am Break Location: Grand Ballroom (Third Floor)

Tuesday, June 12th, 10:30 - 12:00 (two concurrent sessions)

Session 7
Session chair: Luis Francisco-Revilla
Location: Continental Ballroom (Third Floor)


Session 8
Moderator: Brett Bobley, Director, Office of Digital Humanities, NEH
Location: Betts Theatre
(First Floor)

Paper (Full)
Content-based Layouts for Exploratory Metadata Search in Scientific Research Data
Jürgen Bernard; Tobias Ruppert; Maximilian Scherer; Jörn Kohlhammer;Tobias Schreck (Nominated for 
Best Student Paper)

Paper (Full)
A Comparison of Techniques for Detecting Abnormal Change in Blogs
Paul Bogen; Richard Furuta; Frank Shipman

Paper (Short)
Refactoring HUBzero for Linked Data    
Michael Witt; Yongyang Yu

Paper (Short)
Treating Data Like Software: A Case for Production Quality Data
Jennifer Schopf






The Digging into Data Challenge: A Roundtable Discussion

Roundtable Participants:

Stuart Dempster, Director, The Strategic Content Alliance, JISC

E. Thomas Ewing, Professor of History, Virginia Tech, NEH PI for An Epidemiology of Information: Data Mining the 1918 Influenza Pandemic

Chuck Henry, President, CLIR

Ray Larson, Professor, School of Information, UC Berkeley, IMLS PI for Integrating Data Mining and Data Management Technologies for Scholarly

Jennifer Serventi, Senior Program Officer, Office of Digital Humanities, NEH

Cassidy R. Sugimoto, Assistant Professor, School of Library and Information Science, Indiana University, NSF PI for Cascades, Islands, or Streams? Time, Topic, and Scholarly Activities in Humanities and Social Science Research

Chuck Thomas, Senior Library Program Officer, IMLS

Elizabeth Tran, Associate Program Officer, NSF


12:00 pm - 1:30 pm Lunch  Location: Grand Ballroom (Third Floor)

Tuesday, June 12th, 1:30 pm - 3:00 pm (two concurrent sessions)

Session 9
Named Entities
Session chair:
George Buchanan
Location: Continental Ballroom (Third Floor)
Session 10
Books and Reading
Session chair:
Rudi Schmiede
Location: Betts Theatre (First Floor)

Paper (Full)
Active Associative Sampling for Author Name Disambiguation     
Marcos Goncalves; Rodrigo Silva; Anderson Ferreira; Adriano Veloso; Alberto Laender

Paper (Full)
AckSeer: A Repository and Search Engine for Automatically Extracted Acknowledgments from Digital Libraries     
Madian Khabsa; Pucktada Treeratpituk; C. Lee Giles

Paper (Short)
Similar Researcher Search in Academic Environments     
Sujatha Das Gollapalli; Prasenjit Mitra; C. Lee Giles

Paper (Short)
An Analysis of the Named Entity Recognition Problem in Digital Library Metadata
Nuno Freire; Jose Borbinha; Pável Calado

Paper (Full)
The "City of Lit" Digital Library - A Case Study of Interdisciplinary Research and Collaboration
Haowei Hsieh; Bridget Draxler; Nicole Dudley; Jon Winet

Paper (Full)
Student Researchers, Citizen Scholars, and the Trillion Word Library
Gregory Crane; Bridget Almas; Alison Babeu

Paper (Short)
Learning Topics and Related Passages in Books 
David Newman; Youn Noh; Kat Hagedorn; Arun Balagopalan

Paper (Short)
Emphasis on Examining Results in Fiction Searches Contributes to Finding Good Novels   
Suvi Oksanen; Pertti Vakkari



3:00 pm - 3:30 pm Break Location: Grand Ballroom

Tuesday, June 12th, 3:30 pm - 5:00 pm (two concurrent sessions)

Session 11
Concepts and Topics
Session chair:
David Mimno
Location: Continental Ballroom (Third Floor)

Session 12
Session chair:
Martin Klein
Location: Betts Theatre (First Floor)

Paper (Full) 
Event-centric Search and Exploration in Document Collections
Strötgen; Michael Gertz (Nominated for Best Student Paper)

Paper (Short) 
Dynamic Online Views of Meta-Indexes   
Michael Huggett; Edie Rasmussen

Paper (Short) 
Topic Models for Taxonomies    
Anton Bakalov; Andrew McCallum; Hanna Wallach; David Mimno

Paper (Full) 
Concept Chaining Utilizing Meronyms in Text Characterization   
Lori Watrous-Deversterre; Chong Wang; Min Son




Paper (Full) 
Improving Multi-faceted Book Search by Incorporating Sparse Latent Semantic Analysis of Click-through Logs     
Deng Yi; Yin Zhang; Haihan Yu; Yanfei Yin; Jing Pan; Baogang Wei

Paper (Full) 
Exploiting Real-time Information Retrieval in Microblogosphere 
Feng Liang; Runwei Qiang; Jianwu Yang

Paper (Short) 
Personalized Query Expansion in the QIC System
Prat Tanapaisankit; Lori Watrous-De Versterre; Min Song

Paper (Short) 
Investigating Keyphrase Indexing with Text Denoising   
Rushdi Shams; Robert Mercer



7:00 pm - 9:00 pm Awards Banquet Location: Sequioa Restaurant 

6:00 pm – Bus departs enroute restaurant
7:00 pm – Reception, Banquet, and Awards Presentation

  • Vannevar Bush Best Paper Award
  • Best Student Paper Award

9:00 pm – Banquet ends


Wednesday, June 13th

8:00 am - 11:30 am Registration Location: Lobby (Third Floor)
8:00 am - 9:00 am Breakfast
Location: Grand Ballroom (Third Floor)

Wednesday, June 13th, 9:00 am - 10:30 (two concurrent sessions)

Session 13
Session chair:
Edie Rasmussen
Location: Continental Ballroom (Third Floor)

Session 14
User Behavior
Session chair:
Pertti Vakkari
Location: Betts Theatre (First Floor)

Paper (Full)
A Hybrid Two-stage Approach for Discipline-independent Canonical Representation Extraction from References     
Sung Hee Park; Roger Ehrich; Edward Fox

Paper (Full)
Web-based Citation Parsing, Correction, and Augmentation
Liangcai Gao; Xixi Qi; Zhi Tang; Xiaofan Lin; Ying Liu

Paper (Short)
Improving Algorithm Search Using the Algorithm Co-citation Network     
Suppawong Tuarob; Prasenjit Mitra; C. Lee Giles

Paper (Short)
Evaluating and Ranking Patents Using Weighted Citations
Sooyoung Oh; Zhen Lei; Prasenjit Mitra; John Yen

Paper (Full)
Book Selection Behaviour in the Physical Library: Implications for eBook Collection    
Annika Hinze; Nicolas Vandershantz; Claire Timpany; Sally Jo Cunningham; Dana McKay

Paper (Full)
How Do People Organize Their Photos in Each Event and How Does It Affect Storytelling, Searching, and Interpretation Tasks?     
Jesse Prabawa Gozali; Min-Yen Kan; Hari Sundaram

Paper (Full)
Co-reading: Investigating Collaborative Group Reading  
Jennifer Pearson; Tom Owen; Harold Thimbleby; George Buchanan
(Nominated for Vannevar Bush Best Paper)





10:30 am - 11:00 am Break Location: Grand Ballroom (Third Floor)

Wednesday, June 13th, 11:00 am - 12:15 pm  Location: Betts Theatre
Closing and Keynote: The Sensible Moment: 1680 - 2012
Keynote Speaker: George Dyson

"There is as it were a continued Chain of Ideas coyled up in the Repository of the Brain, the first end of which is farthest removed from the Center or Seat of the Soul where the Ideas are formed, which is always the Moment present when considered,” explained Robert Hooke in 1680, before estimating the number of distinct ideas that could be stored in a single human brain. Over the next three centuries, the costs of data storage (and retrieval) have declined to where the cost of storing (and indexing) information is often less than the cost of making a decision to throw it away. The advent of high-speed random-access memory in the mid-twentieth century highlights how we got to here from there.


Post-conference event link:

  • Workshops: Wednesday, June 13, 1:00 pm - Thursday, June 14, 5:00 pm