Submitted by Portal Editor Ahlam Saleh (email@example.com), Research Librarian at the Health Sciences Library, University of Arizona, Tucson, AZ, on behalf of Chris Kollen.
Thank you to Chris Kollen for being our guest blogger for this post. Chris Kollen is the Data Curation Librarian at the University of Arizona Libraries.
The University of Arizona (UA) Libraries has sponsored various programs during International Open Access Week for the past several years. The theme for this year’s Open Access week is “Open in Action” – encourages all researchers, students, and others to take concrete steps to make their own work more openly available and encourage others to do the same.
Transparency, integrity, and reproducibility are defining aspects of the scientific method. According to a survey conducted by Nature in 2016, 52% of the respondents agreed there is a “significant crisis of reproducibility”. During International Open Access Week this year, the UA Libraries sponsored a program, Data Reproducibility: Integrity and Transparency, an issue of prime interest to UA researchers.
The first speaker, Ivan Oransky, M.D., co-founded Retraction Watch in 2010 with Adam Marcus.
Dr. Oransky’s presentation, Retractions, Post-Publication Peer Review, and Fraud: Scientific
Publishing’s Wild West, discussed the increase in retractions, the reasons for the increase, whether or not fraud is on the increase, the growth of post-publication peer review, and other trends.
The second speaker was Anita Bandrowski from the University of California San Diego and lead for the Resource Reproducibility Initiative. Dr. Bandrowski presentation, To authenticate or Not to Authenticate? addressed the National Institute of Health requirement to authenticate Key Biological Resources for most grant proposals. She discussed what Key Biological Resources are, ongoing efforts to understand how to authenticate them, the journal response, practical steps to improve reporting of research in journal publications, and resources available.
Our third speaker was Christian Collberg, Professor in the Department of Computer Science at the UA. Dr. Collberg’s presentation, Repeatability in Computer Systems Research described a study conducted with Dr. Todd Proebsting on the extent to which Computer Systems researchers share their code and data. They examined papers with results backed by code to see what fraction they would be able to obtain and build the code.
The program was one of the most successful Open Access programs the UA Libraries has sponsored! We are planning on making the recording of the program and the speaker’s slides
available at the University of Arizona’s Campus Repository.
Call for Proposals: Breakout Session and Poster Presentation Abstracts
The e-Science Symposium Planning Committee invites you to submit a proposal for participation at the 9th Annual University of Massachusetts and New England Area Librarian eScience Symposium, to be held on Thursday, April 6th, 2017 at the University of Massachusetts Medical School in Worcester, MA.
This year’s symposium theme “Libraries in Data Science: Addressing Gaps and Bridges” focuses on collaborations and opportunities for librarians becoming involved in data science at their institutions. Check out the confirmed agenda at the 2017 symposium conference page!
We are interested in receiving proposals for presentations, interactive workshops and posters that highlight librarians involved in collaborations with a research group, department, center, or lab on or off their respective academic campus.
Proposals should be tied to one of these four categories:
- Data Repositories (example: developing data repositories; advising researchers on discipline-specific repositories)
- Education/Training (example: data information literacy)
- Funding Agencies (example: writing data management plan templates for NIH)
- Institutional Models (example: solo data librarian vs center/team)
Breakout Sessions: 60 minute presentation or interactive workshop. Selected presenters will receive an honorarium and all travel expenses paid for by the NN/LM NER. All sessions will be video recorded and featured on the e-Science Symposium website. Because there are a limited number of breakout sessions, all presentation and interactive workshop proposals will be considered for a poster presentation.
The deadline for submitting a Breakout Session abstract is Wednesday, December 21st, 2016.
Proposal decisions will be made by Friday, January 20th, 2017
Poster Session: Awards to the Most Informative Poster in Communicating e-Science Librarianship, Poster for Best Example of e-Science in Action, and Best Poster Overall.
The deadline for submitting a Poster Session abstract is Wednesday, February 8th, 2017.
Proposal decisions will be made by Friday, February 24th, 2017
Abstracts must follow the Medical Library Association guidelines for creating a structured abstract, as outlined at http://research.mlanet.org/structured_abstract.html.
Submission Process: Click on Submit Proposal in the left navigation pane to submit your proposal. The submitting/corresponding author will need to log in or create a free eScholarship@UMMS account. Follow the entry instructions for each field. Before you begin the online submission process, please be sure you have the following items ready:
- The submission’s title
- Names, affiliations, and email addresses of all authors
- The abstract
- A list of keywords
Electronic presentation materials (PowerPoints, PDFs, etc) will be required to be submitted by Wednesday, March 22nd, 2017 to be posted on the e-Science Symposium website and stored permanently with a Creative Commons License in the eScholarship@UMMS Open Access Repository following the symposium.
Submitted by Portal Editor Hillary Miller, Scholarly Communications Outreach Librarian, Virginia Commonwealth University, firstname.lastname@example.org
Policies around data sharing have been quickly developing and changing, and much is still unclear. Below you’ll find three recently released resources to help you understand the current state of open data and anticipate the developments that are ahead.
The Scholarly Publishing and Academic Resources Coalition (SPARC) has released a new resource for understanding, comparing, and tracking federal agency policies on article and data sharing. A wide range of policy requirements are covered: implementation timelines, exclusions on sharing certain types of data, when and how data are to be shared, data management planning, data preservation, compliance enforcement, and more. In addition to providing information on the current agency requirements, the resource is also “looking forward” by providing any known information about anticipated policy developments. The tool will be updated when new policies are released and when current plans are revised. This is an excellent resource for librarians and one that can easily be shared with researchers and other stakeholders.
Figshare has shared the results of a survey of 2,000 researchers that aimed to assess the global landscape around open data and sharing practices. “The State of Open Data” highlights researchers’ awareness of open data and their perspective on making their own research data open. The report also includes contributed articles from leading professionals in the open data space that cover the state of open data across the globe. While awareness, use, and sharing of open data among researchers were high, the report identifies areas for improvement in researcher knowledge that librarians can help to fill, including understanding data licensing, compliance with funder policies, and data citation. See this accompanying blog post for all of the key findings of the report.
“Open Data Transition Report: An Action Plan for the Next Administration,” written under the leadership of the Center for Open Data Enterprise, outlines 27 recommendations for the incoming Presidential Administration to consider to keep the momentum of open data moving forward to the benefit of the government, its citizens, researchers, and businesses. The report presents the administration with priority goals and concrete recommendations to achieve these goals. Goal three of the report (beginning on page 46) is to “share scientific research data to spur innovation and scientific discovery.” The recommendations within this section include establishing a new “Federal Research Data Council” to serve as a support structure for increased collaboration in government and research institutions around open data, open science, and open research; developing an Annual Research Data Census for federally funded research; and working with stakeholders across the research enterprise to establish new incentives for data sharing. This is an exciting development for open data, as it is the very first time an incoming Administration has been presented with government-wide recommendations on open data.
Submitted by Laura Palumbo (email@example.com), Chemistry & Physics Librarian and Science Data Specialist at Rutgers University Libraries in New Brunswick, NJ, on behalf of Dr. Cathy Lawson.
Thank you to Cathy Lawson for being our guest blogger for this post. Dr. Lawson is an Associate Professor in the department of Chemistry and Chemical Biology at Rutgers University in New Brunswick, NJ. In her research she “explores the diverse landscape of biological structure-function relationships, with the goal of improving our fundamental understanding of life processes.” I had the good fortune to audit the course she describes below.
In late 2014 the NIH Big Data 2K (BD2K) initiative put out a call for proposals to create open educational resources for sharing, annotating, and curating Biomedical Big Data. My colleagues and I at the Research Collaboratory for Structural Bioinformatics (rcsb.org) realized that the call presented a unique and exciting opportunity to educate people about the work that we do with passion every single day: enabling public, worldwide availability and re-use of the thousands of complex and beautiful molecular structures produced every year by the structural biology community. The RCSB, along with international partners, manages the Protein Data Bank (PDB), the now 45-year old worldwide repository for experimentally determined 3D structures of proteins, nucleic acids and macromolecular complexes, and also manages or co-manages several related archives.
Our proposal was well received and the two-year curriculum development project “Enabling Data Science in Biology” was born (edsb.rcsb.org). We thus began synthesizing an eight module course centered around our “data pipeline” (data creation, deposition, curation, archiving, distribution), drawing on the expertise and many years of experience of our assembled faculty team. The lectures, exercises, and assignments were first delivered as a Spring 2016 graduate course at Rutgers. Professor Helen Berman, who directed the RCSB PDB between 1999 and 2014 and is now Director Emerita, took the lead in the curriculum development and first pilot delivery.
The ultimate goal of the project is to create a set of openly accessible curricular materials that can be repurposed by other scientists, information scientists/librarians, and educators. This effort supports the BD2K mission, enabling training of a new cadre of scientists with knowledge and skills in data archive management and data sharing. We are currently refining the content and will be producing videos in preparation for our second “pilot” in Spring 2017. We also plan to create and deliver these materials in a Massive Open Online Course (MOOC) by the end of next summer.
Posted by Portal Editor Julie Goldman, eScience Coordinator, National Network of Libraries of Medicine, New England Region, University of Massachusetts Medical School, firstname.lastname@example.org
Conference Recap: Imagine! Innovate! Inspire! at NAHSL 2016
Ladies and gentlemen, step right up and see the greatest show on Earth!
If you were looking for a show with all the bells and whistles, NAHSL 2016 had you covered. From ring masters to magicians, flying trapeze artists to lion tamers, there were main events and sideshows for all! And if you weren’t inspired yet, you could create a portrait and imagine another time in the old-fashioned photo booth.
NAHSL 2016 was held in New Haven, at the distinguished Omni Hotel at Yale University.
Lightning rounds, posters, and spotlight sessions highlighted the work of library performers from the New England region. Their work shared imaginative and innovative ideas for transforming the library space and library instruction. Hopefully other conference attendees were inspired to try new things and seek new collaborations.
Here is a recap on the Main Events: The 3-Ringed Circus
Jack Hughes, M.D.
Biomedical Ethics, Yale University
Access, Costs, and Quality: The Iron Triangle of Healthcare
Jack had attendees interacting and imagining a world where healthcare was universally available. He broken down healthcare coverage in the US, which is mostly subsidized private group insurance, and also shared Anti-Obamacare Ads, which were an early Halloween surprise!
Susan Keane Baker, M.H.A.
Patient and Customer Experience Expert & Consultant
Speed Stuns and Other Customer Service Tactics to Differentiate your Library
Susan made attendees think about our interactions and inspired us to “be nice first!” Susan shared real life examples of how libraries can interact positively online with social media, and also how librarians constantly need to refine our customer service skills!
“If you’re gonna do it anyway, be nice about it” – Susan Keane Baker
Check out her pocket guide: Split-Second Kindness: making a difference when time is limited
MIT Laboratory for Social Machines
Data for Humans: A Healthier Revolution
William shared innovative technologies to solve social problems and encouraged attendees to do regular “digital detox.” William makes the case that although print publications and paper are falling out of favor with the rise of digital technology, paper does still perform some tasks better. Powers predicts that paper will not be entirely given up as a medium, and even argues that even novel technologies are striving to be more, not less, like paper.
Read his paper Hamlet’s Blackberry: Why Paper is Eternal
And read the follow up book: Hamlet’s Blackberry: A Practical Philosophy for Building a Good Life in the Digital Age
In the end, the conference attendees were left inspired, ready to imagine and do something innovative! Thank you NAHSL 2016!
NAHSL 2017 will be in Waltham, MA.
Be sure to follow the NAHSL 2017 twitter account to stay up to date.
We are very excited to bring the ACRL RDM Roadshow one-day workshop, “Building your Research Data Management Toolkit: Integrating RDM into Your Liaison Work,” to the New England area!
Where: Boston University, Photonics Center, Colloquium Room, Boston, MA 02215
When: Friday, January 13th 2017
Research data management has emerged as a need among academic researchers and liaisons are building skills in response. This one-day workshop will assist liaisons to identify their existing skills and mindsets that transfer to research data management services and then create a learning plan for the RDM specific knowledge needed to serve their subject disciplines. Tools, hints, and tricks will be shared that facilitate partnerships on campus with disciplinary faculty and with other RDM service providers.
Abigail Goben, MLS, Assistant Professor and Liaison Librarian, University of Illinois at Chicago
Abigail is an Assistant Professor and Liaison Librarian at the University of Illinois at Chicago Library of the Health Sciences, where she serves as liaison to the College of Dentistry and collaborates with faculty on research data management. Her research focuses on institutional data policies; educational needs in research data management; and open access. Goben is engaged with efforts on campus surrounding research data management and data governance for the University of Illinois Hospital and Health Sciences System. Beyond UIC, Goben is active with the Library Information Technology Association and Association for College and Research Libraries.
Megan Sapp Nelson, MLS, Associate Professor of Library Science and Liaison Librarian, Purdue University
Megan is an Associate Professor of Library Sciences and Liaison Librarian at Purdue University, where she serves as liaison to Civil Engineering and related engineering and technology disciplines and Earth, Atmospheric and Planetary Sciences. Her research focuses on data information literacy; teaching professional skills to engineering students; and integration of information literacy and data information literacy within the disciplinary curricula. Sapp Nelson is involved in campus efforts surrounding data information literacy at the undergraduate and graduate levels. Beyond Purdue, Sapp Nelson is active in American Society of Engineering Education’s Engineering Libraries Division. Learn more about Megan in her ACRL Member of the Week profile on ACRL Insider.
Workshop participants will be able to:
- Identify data within the research process and lifecycle in order to articulate the role of the libraries in the management of data to researchers
- Learn how to develop expertise in the nuances of disciplinary requirements for data management in order to educate their faculty and students about data best practices for their discipline
- Articulate specific existing skills that they already possess as librarians which transfer to data services in order to begin building a toolkit of research data management skills
- Identify campus partners in research data management in order to create an environment of research data management support for their faculty
- Articulate the parts of a data management plan in order to describe its role as a living document within a research project
- Apply their relevant prior knowledge of their disciplines in order to create a research data management interview plan in order to facilitate faculty engagement
Who Should Attend
This workshop is intended for liaisons who are seeking to engage with research data management for the first time, or who have a very basic knowledge of research data management. This workshop is an introductory level experience. Attendees are not expected to have previous experience with research data management. The primary audience is subject liaison librarians, secondary audience include senior library administrators, middle management and department heads, and technical services librarians and staff. Other campus partners such as Office of Research, Sponsored Programs, Technology Transfer, IRB, or campus IT may be interested but would be a tertiary audience.
For more information about the ACRL RDM Roadshow, please visit:
In addition to designing and presenting the workshop, Abigail and Megan have designed a research project using surveys to investigate future innovations in professional development for research data management. Participants of the workshop will also have the opportunity to participate in a survey as part of their research on further professional development needs.
Registration is capped at 100 participants, so please register now!
Posted on behalf of Anna Newman, Open Access Specialist, Mugar Memorial Library, Boston University, email@example.com
On November 9, the BU Libraries will be hosting OpenCon 2016 Boston, a satellite event of OpenCon 2016, a global conference about open access, open data, and open education.
OpenCon Boston will feature an update on Open Data to Open Knowledge, a partnership between the City of Boston and the Boston Public Library to make open government data an accessible public resource. Other sessions will include a discussion of information justice and lightning talks on open research tools and approaches. We’re also partnering with the BU Study Group to offer an evening open science/data presentation.
OpenCon Boston is open to all; if you are interested in attending any or all of the sessions, please register at is.gd/openconboston. This is a great opportunity to meet others in the Boston area who are interested in open access and to learn more about open access issues.
Wednesday, November 9, 1:00-4:00pm, Mugar Memorial Library
Welcome & Intro / 1:00-1:15pm
From “Open” to Justice / 1:15-2:00pm
Screening of a presentation by Audrey Watters from OpenCon 2014, followed by a discussion of information justice.
Update on the City of Boston’s Open Data to Open Knowledge Project / 2-2:45pm
An update on Open Data to Open Knowledge, the City of Boston’s Knight Foundation-funded initiative to make open government data an accessible resource.
Open Research Lightning Talks / 3:00-4:00pm
Tentatively scheduled talks include an introduction to ORCID, how and why to make your work available in an institutional repository, a case study of open research, and an overview of open access mandates for grant funding.
BU Study Group Open Science/Data Presentation / 6-7:30pm
With the BU Study Group, we’ll be sponsoring an optional evening presentation/demonstration on an open science/data topic TBD.
Questions about the event can be directed to Anna Newman:
firstname.lastname@example.org or 617-358-8563.
For more information see http://guides.library.umass.edu/RDMR.
Location: University of New Hampshire, Durham, NH
Morning Presentations & Roundtables:
The 1925 Room, Elliott Alumni Center
9 Edgewood Road, Durham, NH 03824
Afternoon Data Visualization Research Lab Tour:
Jere A. Chase Ocean Engineering Lab
Center for Coastal & Ocean Mapping/Joint Hydrographic Center
24 Colovos Road, Durham, NH 0382
Participants may plan to arrive for registration at 9:30 am.
Morning and afternoon refreshments will be provided.
Morning Event: Data Visualization and the Library (10:00 am – 12:00 pm)
“Two Perspectives from Library Collaborators“
Speaker 1: Steve Braun, Northeastern University
Steven Braun is the Data Analytics and Visualization Specialist in the Northeastern University Libraries Digital Scholarship Group. In this role, Steven holds weekly drop-in office hours and schedules consultations to provide help to students, faculty, and staff on any matters related to data analysis or visualization, delivers public and custom course workshops on data and visualization best practices, and works on internal and external visualization projects across many disciplines. Steven has an M.S. in molecular biophysics and biochemistry from Yale University, and a BA in chemistry and Asian studies from St. Olaf College.
Speaker 2: Patrick Rashleigh, Brown University
Patrick Rashleigh is the Data Visualization Coordinator at the Brown University Library, where he engages in the production and dissemination of visual modes of scholarly communication and analysis. In practice, this means working with faculty, students, and colleagues on visually-oriented projects and holding instruction sessions on visualization tools and techniques. It also involves coordinating and supporting innovative applications of the Sidney E. Frank Digital Studio with its display wall, full color 3D printer, and video/audio recording studio. Patrick has an MA in Ethnomusicology from York University, and a BA in English Literature from the University of British Columbia.
Afternoon Tour: Data Visualization Research Lab (12:15 pm – 1:00 pm)
The Data Visualization Research Lab (DVRL) has the goal of carrying out research into advanced interactive visualization. The lab is has a special interest in techniques that can be applied to ocean mapping and ocean technologies in general. There are three broad strategies of research carried out: The Science of Data Visualization, Tool Building, and Visualizations for Education and Outreach. We will tour their facility which is located in the UNH Center for Coastal & Ocean Mapping/Joint Hydrographic Center. This will be an opportunity to see how an institution is providing data vis services to the campus outside of the library.
Lunch 1:00 pm – 2:00 pm
Participants will have lunch on their own. Options include campus dining locations or at another off-campus restaurant (for suggestions). Please be aware that all these options require walking, so participants may want to bring their own lunch.
Afternoon Event: The 4th Research Data Management Roundtable (2:00 pm – 4:30 pm)
Topic: “Event Planning and Outreach”
This is the fourth in a series of informal roundtable discussions on specific research data management topics. In this set of discussions, participants will focus on event planning and outreach related to marketing planned events. We’ll discuss what types of events participants are organizing, who libraries are partnering with, how events are promoted, and much more!
We encourage all to come and participate! Haven’t planned an RDM event yet? Not sure where to start? Come get ideas! Have lots of experience with planning events? We want to hear from you! Come share your flyers, materials, or lessons learned from your efforts!
UNH is located off of NH Route 4, easily accessible from I-95 and Route 101.
An Amtrak station is located directly on the UNH campus. Take the Downeaster!
Please note parking is $10 and there will be a dedicated lot to park in. If you plan to drive, please consider carpooling. You must indicate if you are driving on the registration form.
More information to come!
For questions about registration, contact Tom Hohenstein (email@example.com).
All events are sponsored by the NN/LM NER and University of New Hampshire.
For complete information see http://guides.library.umass.edu/RDMR.
Submitted by e-Science Portal Editor Leah Honor, Education and Clinical Services Librarian, Lamar Soutter Library, University of Massachusetts Medical School, Leah.Honor@umassmed.edu
A few weeks ago I began attending weekly group viewings of the BD2K Guide to the Fundamentals of Data Science. While we are only two sessions in, and still doing a lot of high level review, the series promises to delve in greater detail into many of aspects of big data, including indexing, management, analysis, and modeling.
The first session, an introduction to the series and the topic in general, was presented by Dr. Mark Munsen – Professor of Biomedical Informatics at Stanford University, and reviewed many of the common themes heard when talking about big data: the data lifecycle, the emergence of big data and what that has meant for biomedical research and clinical practice, and the need for management and preservation of all this data. I was struck by his discussion on the idea of open data, which is often held up as a worthy goal, and one promoted by everyone from librarians to national funding agencies, being overly simplistic. Just because data is open in the sense that it is available, does not mean it is FAIR: findable, accessible, interoperable, and reusable. Having data shared in proprietary formats, or without sufficient contextual information to make it understandable, may meet the technical requirements of ‘open’ but it is in no way fair. Dr. Munsen also brought up a few well known examples of big data and data management gone wrong: the Mars Climate Orbiter, having been built and programmed using the metric system for measurements, crashes after it is unable to decode directions based on the English system; the 2012 Nature commentary by Begly and Ellis exposing the dismal rates of reproducibility on so called landmark studies1.
The second speaker was Dr. William Hersch from Oregon Health & Science University, introducing the first topic – Data Indexing and Retrieval, which will be the focus of the next 5 sessions. While still a rather broad overview, from this you can see how the series will continuously tighten the focus on the topics of interest, presenting various viewpoints as specialists from a broad range of disciplines are brought in to cover their areas of expertise. Dr. Hersch went over metadata and versioning control, indexing vocabularies and ontologies, provenance, and search vs. retrieval; all of which are topics of sessions to come.
The series continues every Friday at 12pm EST, and you can view it live, or catch up on any sessions after the fact on the series website: http://www.bigdatau.org/data-science-seminars
1Begley, C. Glenn, and Lee M. Ellis. “Drug development: Raise standards for preclinical cancer research.” Nature 483.7391 (2012): 531-533. http://dx.doi.org/10.1038/483531a
The e-Science Portal for New England Librarians has a new group of librarians taking over the editorial duties for both the portal and the Community Blog! The Editorial Board works in collaboration to provide and manage information that meet the scope and purpose of the Portal for Librarians:
Portal Scope Statement: The e-Science portal is a resource for librarians, library students, information professionals, and interested individuals to learn about and discuss:
- Library roles in e-Science
- Fundamentals of domain sciences
- Emerging trends in supporting networked scientific research
The e-Science portal is intended for librarians and individuals interested in how research and academic organizations generate, share, store and/or use data for scientific research, with an emphasis on research in the health, biological, and physical sciences.
e-Science Program Coordinator
NN/LM New England Region, University of Massachusetts Medical School
Content Sections: About, Connect with Others, Professional Education
Julie Goldman, BS, MLIS, is the eScience Coordinator for the National Network of Libraries of Medicine, New England Region. Julie started at the University of Massachusetts Medical School as a Library Fellow in the Lamar Soutter Library. She is actively working on resources and events for e-science librarians. Julie is the Managing Editor for the Journal of eScience Librarianship and also served on the teaching staff for “Scientific Research Data Management,” a course in the LIS curriculum at Simmons College in Boston. Julie is also Co-PI on a NIH BD2K grant to develop a massive open online course (MOOC) for biomedical research data management. Julie earned her Masters in Library and Information Science from Simmons College and a Bachelor’s of Science in Marine Biology from the University of New Hampshire.
Education and Clinical Services Librarian/Informationist
University of Massachusetts Medical School
Content Section: Research Environment
Leah Honor is an Education and Clinical Services Librarian at the Lamar Soutter Library of the University of Massachusetts Medical School. She works closely with the graduate schools of medicine, nursing, and biological sciences, as well as basic science researchers and clinicians, providing reference, research, and instruction services. She acts as an Informationist liaison to the Child and Adolescent NeuroDevelopment Initiative, researching shared data identification, reuse, and citation standards. She received her Masters of Library and Information Science at Pratt Institute.
Scholarly Communication Outreach Librarian
Virginia Commonwealth University
Content Section: Data Literacy
Hillary Miller is the Scholarly Communications Outreach Librarian at Virginia Commonwealth University where she provides guidance, delivers instruction, and implements a program of outreach initiatives relating to the creation, use, and dissemination of scholarly works. Her research areas include the evolving digital contexts for scholarship, research, and creative expression; social cognitive theory-based approaches to understanding copyright behavior; and legal and social issues surrounding the use of information. She received her Masters of Science in Library Science at The University of North Carolina at Chapel Hill.
Chemistry & Physics Librarian/Science Data Specialist
Content Sections: Physical Sciences & Engineering, Data Management Co-Editor
Laura Palumbo is the Chemistry & Physics Librarian and Science Data Specialist at Rutgers University Libraries in New Brunswick. She liaises with departments in the physical sciences, and provides research assistance and instruction in information literacy and critical thinking, as well as in research data management. She coordinates efforts, identifies opportunities, and develops services for the access and preservation of data for the Rutgers New Brunswick scientific research community. She conducts workshops with campus partners on various aspects of research data sharing and reuse, bringing together faculty and students from departments across the New Brunswick campuses. She is active in several professional organizations, and serves as the Assistant Editor of Practical Academic Librarianship, an international open access journal publication of the Special Libraries Association. Prior to her work in libraries, she was a civil engineer in government agencies and consulting firms.
University of Arizona
Content Sections: Life Sciences & Open Science, Data Management Co-Editor
Ahlam Saleh is the Research Librarian at the University of Arizona Health Sciences Library where she provides assistance and training to students, staff, faculty, and researchers on literature searches, resource selection, reference management, and NIH Public Access Policy compliance. She also helps to coordinate bringing other library service programs such as scholarly communications and data management, to the health sciences campus community. Ahlam obtained here master’s degree in library science from SUNY Buffalo where she also obtained her medical degree. Prior to the University of Arizona, Ahlam served as a Reference Librarian for eight years at the University of Pittsburgh Health Sciences Library System.
Please welcome this great group of science librarians, and look for their postings
here on the e-Science Community Blog!
The good, The bad, and The…
A DataCure Webinar Series
Please join our panel of presenters in this exciting two-part webinar series to hear about how academic libraries are taking the lead in developing cross-campus collaborations in establishing research data committees to spearhead institutional efforts related to data stewardship and digital projects. This interactive session will lead participants through the various steps needed in order to initiate a similar effort within their institutional context.
Amy Koshoffer, Science Informationist, UC Libraries, University of Cincinnati
Amy Koshoffer holds degrees in Ecology, Ethology and Evolution from University of Illinois, Library and Information Science from Kent State University and Mathematics from University of Cincinnati. Amy worked as a senior research assistant in the UC College of Medicine in the Departments of Cancer Biology and most recently Dermatology. Her research group focused on understanding melanocyte biology especially the mechanisms that trigger vitiligo and other hypopigmentary disorders. Amy is currently the Science Informationist for UCLibraries where her work focuses on providing data manageme nt support for researchers and developing new research data services.
Renaine Julian, Data Research Librarian, Florida State University Libraries
Renaine Julian is the STEM Data & Research Librarian at Florida State University. In this role, he works with researchers to help find, use, evaluate, and manage research data. Renaine also serves as the subject librarian for the FSU/FAMU College of Engineering, Earth, Ocean and Atmospheric Science as well as Urban & Regional Planning. He holds a MLIS, a MS in Urban and Regional Planning as well as a BS in Political Science, all from Florida State University. Renaine’s interests include: research data management, scholarly communication, and citizen science.
Cinthya Ippoliti, Associate Dean for Research and Learning Services, Oklahoma State University
Cinthya Ippoliti is the Associate Dean for Research and Learning Services at Oklahoma State University where she provides administrative leadership for the library’s academic liaison program as well as services for undergraduate and graduate students. Previously, she was Head of Teaching and Learning Services at the University of Maryland Libraries where she was in charge of the spaces, services, and programming offered by the Terrapin Learning Commons in addition to coordinating the libraries’ First Year instruction program.
Betty Rozum, Data Services Coordinator & Undergraduate Research Librarian, Utah State University
Betty Rozum is the Data Services Coordinator and Undergraduate Research Librarian at Utah State University, a position she has held since its creation in July 2015. Her position is responsible for coordinating efforts within the USU Office of Research and Graduate Studies, USU Information Technology Office, and the Library to manage research data and publication deposits and help ensure compliance with federal mandates. Part of this role includes coordinating the campus wide data committee. Along with these tasks, Betty serves as the campus resource for data management, providing assistance and education about best practices. Prior to this position, Betty was the Associate Dean for Technical Services at Utah State University Libraries for 14 years.
Christine Kollen, Data Curation Librarian, Office of Digital Innovation and Stewardship (ODIS), University of Arizona
Chris Kollen is the Data Curation Librarian at the University of Arizona (UA) Libraries in the Office of Digital Innovation and Stewardship. She leads the University of Arizona’s efforts in providing data management support for researchers and graduate students. She is chair of the Campus Data Management and Curation Committee and the project manager for the Data Management and Data Curation Pilot project. She also leads the Library’s GIS and geospatial data services and is the project manager for the Spatial Data Explorer, the Library’s geospatial data portal.
David Minor, Program Director for Research Data Curation, UC San Diego
David Minor is the Director of the Research Data Curation Program in the UC San Diego Library. In this role he helps define and lead work needed for the contemporary and long-term management digital resources. His position includes significant interaction with stakeholders on the UC San Diego campus, throughout the UC System, and national initiatives. His program also includes management of Chronopolis, a national-scale digital preservation network that originated with funds from the Library of Congress’ NDIIPP Program. Chronopolis is also a founding partner in the Digital Preservation Network (DPN), helping set a new national digital preservation agenda. David received his BA in philosophy from Carleton College and MLS from the University of Wisconsin-Madison.
Part 1: Building bridges and planting seeds
Tuesday, December 6th 2016, 12 EST | 90 minutes
- Focus on early-stage efforts to build partnerships and bring key stakeholders to the table
- Acquire practical tips for addressing the institutional challenges involved in developing a campus-wide data committee
- Identify key individuals who could be instrumental in establishing a campus-wide data committee
Part 2: Launching your collaboration
Wednesday, December 7th 2016, 12EST | 90 minutes
- Identify best practices and lessons learned for working with established committees who are working on broad-scale projects and programs
- Evaluate different institutional models to compare and customize for different academic environments
- Analyze best practices strategies for successful project management, collaboration, and program development for established committees
This webinar series is designed to be a two-part class.
Please register for both sessions!
Getting Started with Statistics for Librarians
Join us at the Lamar Soutter Library at the University of Massachusetts Medical School in Worcester, MA to view the webinar: Thursday, September 22nd 2016 10:30am-12:00pm EST
Does the term “p value” make you anxious when you read research articles? How high is your confidence level in interpreting research data? As librarians interact with more research through efforts with evidence-based practice and systematic reviews, they have a growing need for understanding statistics. This confidence-building course will help you be more comfortable with statistical concept for reading research findings and interpreting data.
- Understand basic statistical terms such as standard deviation, t test, p value, and so on
- Identify test measures used in published research studies
- Interpret the data, formulas, and graphs used in the research findings
Participants will earn 1.5 MLA continuing education (CE) contact hours.
- Examine sample research papers and their data analysis findings. Highlight statistical terms that will be discussed.
- Define the statistical concepts.
- Use interactive exercises to verify understanding. For example, to examine different research studies and decide which test measure was used.
- Reexamine the research papers and data to further reinforce the concept.
Jin Wu is an emerging technologies librarian at the University of Southern California (USC) Norris Medical Library. In this role, she tracks, investigates, and uses data proactively to address emerging trends that improve library services and promote library resources. Wu has extensive experience using various tools to gather and apply user information to make data-driven strategy and planning decisions. She has taught MLA continuing education courses on statistics and data analysis a few times in the past years. The classes were well attended and scored very well on evaluations. Wu is active in professional organizations and has given multiple presentations at national and local conferences. She is also a master’s of business administration candidate at USC Marshall School of Business.
Reminder: participants are registering for a free viewing the webinar at UMass Medical School in Worcester, MA. This is not a traditional webinar accessible from anywhere!
Demystifying R: An Introduction for Librarians MLA Webinar
Join us at the Lamar Soutter Library at the University of Massachusetts Medical School in Worcester, MA to view the webinar: Wednesday, October 5th 2016 2:00-3:30pm EST
If you work with researchers or stay up-to-date on “big data” or data science, you have probably heard of R. But what exactly is it, and why should librarians learn it? This webinar will help demystify this popular programming language and provide some real-world examples of how it can help librarians in their daily work. Whether it’s for assisting patrons with their research data or working with your own library data, R can be a useful skill to add to your librarian toolboxes. This webinar will provide an introduction to R, including how it can be used for data processing, visualization, and analysis of a variety of different types of data. We will also discuss some key terminology and concepts to get you started and provide you with resources for learning more about R.
- What the R programming language is and some of its key features
- Some key terminology and a basic understanding of how R works
- Some uses for that R may be a good solution for your data needs, including data processing and management, visualization, and statistical analysis
- How R can be useful for working with research data, as well as with library data, including bibliometric data, library statistics, or budget data
- Where to find free resources for learning R
Participants will earn 1.5 MLA continuing education (CE) contact hours.
Lisa Federer, AHIP, currently serves as research data informationist at the National Institutes of Health Library, where she provides training and support in the management, organization, sharing, and reuse of biomedical research data. She is the author of several peer-reviewed articles and the editor of the forthcoming Medical Library Association Guide to Data Management for Librarians. An active member of MLA, she currently chairs the Medical Informatics Section and the Lucretia W. McClure Excellence in Education Award Jury. She holds a master’s of library and information science from the University of California–Los Angeles and graduate certificates in data science (Georgetown University) and data visualization (New York University), and she is a doctoral student at the University of Maryland.
Reminder: participants are registering for a free viewing the webinar at UMass Medical School in Worcester, MA. This is not a traditional webinar accessible from anywhere!
Posted on behalf of Laura Pavlech, DVM, MSLS, Research & Instruction Librarian, Hirsh Health Sciences Library, Tufts University
Librarian for Research Data – Tisch Library at Tufts University
The Tisch Library at Tufts University in Medford, Massachusetts is seeking a Librarian for Research Data.
Tisch Library supports Tufts’ School of Arts & Sciences and School of Engineering (AS&E), providing services and resources to assist members of the Tufts community in their teaching, learning and research. Tisch Library provides collections, expertise, and technology-rich environments to support the creation and dissemination of scholarship.
The Scholarly Communications & Collections department of Tisch Library supports the mission of the university by engaging with the long history of scholarly communication from rare books to research data, and by recognizing how different formats, different modes of communicating information, and different disciplinary practices impact the transfer and development of knowledge over time. The department makes strategic decisions about how best to provide access and, where appropriate, long-term preservation, to the collections needed to support the teaching and research excellence of AS&E.
Tisch Library seeks an enthusiastic and knowledgeable Librarian for Research Data (Assistant Librarian/Associate Librarian) to lead Tisch outreach around research data and related data management services for the AS&E community. This individual will work with other library and campus partners (e.g. research administration) to adapt, design, and develop services that enable faculty and students to preserve, share, and curate their research data.
Please see the posting for a complete description of the position: http://tufts.taleo.net/careersection/ext/jobdetail.ftl?job=16001555&lang=en
Posted on behalf of Kevin Reed, Knowledge Management Librarian and Alisa Surkis, Head/Translational Science Librarian, NYU Health Sciences Library, NYU School of Medicine
We would like to request your participation in piloting research data management education materials for medical librarians. We are currently funded by a grant from the Big Data to Knowledge Initiative at NIH to develop a curriculum for medical librarians to facilitate their teaching research data management at their own institutions. There are two components to the training materials:
Part 1: Seven online modules (approximately three hours of content) designed to teach medical librarians about the practice and culture of research and best practices in research data management.
- To take the modules, you must first register into our Compass Learning System: http://compass.iime.cloud/accounts/register/ (you will receive an email requesting approval after you’ve registered)
- Once you have registered, the modules can be found here (you will need to be signed in to take the modules): http://compass.iime.cloud/mix/G3X5E/
Part 2: A teaching toolkit including slides, scripts, and evaluation materials to teach an in-person introductory research data management class for researchers at your institution.
We are currently seeking participants to pilot part 1. Following that, we will seek out a subset of participants with whom to pilot part 2, which will involve structured observations of classes taught by the librarians at their institutions. All participants in piloting part 1 will be given access to the materials in part 2, regardless of whether or not they are part of the piloting of those materials.
My colleague, Alisa Surkis, and I have been teaching research data management to our fellow medical librarians at the past three MLA annual meetings, based on our own experiences in providing research data management services at NYU School of Medicine. We hope that the materials we have created here will make the core elements of that class more broadly available to facilitate the teaching of research data management at medical libraries across the United States.
If you intend to take these modules, please contact Kevin Read at firstname.lastname@example.org or Alisa Surkis at email@example.com to confirm your participation. You do not need to await a reply from us to begin taking the modules. We are also available to answer your questions at any time.
An announcement from the BD2K Training Coordinating Center
John Darrell Van Horn, M.Eng., Ph.D.
Associate Professor of Neurology
USC Mark and Mary Stevens Neuroimaging and Informatics Institute
2001 North Soto Street, SSB1-102
Los Angeles, California 90032
Email: firstname.lastname@example.org, email@example.com
Please look forward to a weekly webinar-based lecture series entitled the . This is a series of high-level didactic overviews across the range of topics important for data science, intended to provide a general biomedical audience with an appreciation of the elemental issues related to data science research and applications.
The series will be held each Friday, 12pm EST (9am PST) beginning September 9th, 2016.
Please join from your computer, tablet or smartphone: https://global.gotomeeting.com/join/786506213
You may also dial in using your phone.
United States : +1 (872) 240-3311
Access Code: 786-506-213
Registration is not required. Bookmark the webinar link for easy access to our weekly event!
Our initial set of confirmed data science lecturers includes: Mark Musen (Stanford), William Hersh (Oregon Health Sciences), Lucila Ohno-Machado (UCSD), Michel Dumontier (Stanford), Zachary Ives (Penn), Suzanne Sansone (Oxford), Chaitan Baru (NSF), Brian Caffo (Johns Hopkins), and Naomi Elhadad (Columbia).
This series is sponsored by the NIH Office of the Associate Director for Data Science, the Big Data to Knowledge (BD2K) Training Coordination Center, and the BD2K Centers Coordination Center.
A dedicated webpage with additional information, the complete schedule of speakers, and a collection of all the recorded lectures is forthcoming and will be available shortly.
Introduction to big data and the data lifecycle
Section 1: Data Management Overview
- Finding and accessing data sets, Indexing and Identifiers
- Data curation and Version control
- Metadata standards
Section 2: Data Representation Overview
- Databases and data warehouses, Data: structures, types, integrations
- Social networking data
- Data wrangling, normalization, pre-processing
- Exploratory Data Analysis
- Natural Language Processing
Section 3: Computing Overview
- Programming and software engineering; API; optimization
- Cloud, Parallel, Distributed Computing, and High Performance Computing
- Commons: lessons learned, current state
Section 4: Data Modeling and Inference Overview
- Smoothing, Unsupervised Learning/Clustering/Density Estimation
- Supervised Learning/prediction/Machine Learning, dimensionality reduction
- Algorithms and their Optimization
- Multiple hypothesis testing, False Discovery Rate
- Data issues: Bias, Confounding, and Missing data
- Causal inference
- Data Visualization tools and communication
- Modeling Synthesis
Section 5: Additional topics
- Open science
- Data sharing (including social obstacles)
- Ethical Issues
- Extra considerations/limitations for clinical data
- Summary and NIH context
Section 6: Specific examples
Share this announcement with others: students, staff, and colleagues are all invited!