The Tisch Library at Tufts University in Medford/Somerville, Massachusetts is seeking a Librarian for Research Data. Please see the posting for a complete description of the position: http://tufts.taleo.net/careersection/ext/jobdetail.ftl?job=15001602&lang=en.
Contributed by Donna Kafel, Project Coordinator for the New England e-Science Program, Donna.Kafel@umassmed.edu
Andrew Johnson is the Research Data Librarian at the University of Colorado, Boulder, and PI for DataQ, “a collaborative platform for answering research data questions in academic libraries,” that was recently launched in August. DataQ is a unique resource in that it provides a platform where librarians can submit research data management and curation questions which in turn are fielded by the Editorial Team and answered by a DataQ Editor. DataQ is meant to be interactive–community members who have created a DataQ log-in account, are welcome to add to the answers or post comments.
DataQ is funded by an IMLS Sparks! Ignition Grant for Libraries and co-sponsored by University of Colorado Boulder, the Greater Western Library Alliance, and the Great Plains Network.
I spoke with Andrew over the phone recently to learn more about the DataQ project that he and co-PI Megan Bresnahan have led since they were awarded the IMLS Sparks! Ignition grant in November 2014. Much of our discussion revolved around project management aspects of the DataQ grant. Here is an outline of our conversation:
Donna: How did you come up with the idea for DataQ?
Andrew: Megan actually came up with the idea while we were attending RDAP in Baltimore a few years ago. We were trying to think of ways that we could extend local support for librarians engaging in RDM services to the wider community, so she thought that a service like DataQ could be one way to do that. Prior to DataQ, I’d been active in the DataFOUR project (http://imls.gwla.org/), which was sponsored by the GWLA and GPN. The idea for the DataQ grant snowballed from DataFOUR and its aim to provide regional support for developing RDM services. Megan and I applied for the IMLS funding for DataQ with the support of GWLA and GPN, and of course our library administration at UC Boulder. In September we were awarded an IMLS Sparks! one year grant, from Nov. 1, 2014-October 31, 2015, to develop DataQ.
Donna: Can you explain the GWLA and GPN groups? Are they consortia?
Andrew: GWLA is a consortium of research libraries in the central and western United States. GPN was founded by researchers and is a consortium of Midwestern universities focused on cyberinfrastructure initiatives. The two groups collaborate on different projects, and host their annual meetings in conjunction.
Donna: I’m really impressed that in the course of a one year planning grant, you’ve pulled together such a large working group of Editors and launched DataQ –all well within the 12 month timeframe. Can you describe a bit about the project timeline and your working model?
Andrew: Yes, there were a lot of pieces to put together to make DataQ happen. We had a $25,000 budget to work with and a relatively short time to get the project up and running. In the first months we contracted with Drupal developers to create the site. In December we put out a Call for Editors. We were surprised by the overwhelming response to the Call. We had budgeted for eight Editors. It was really hard to limit ourselves to eight when so many highly qualified librarians with experience and expertise in data services responded that they were interested in participating in DataQ. Ultimately we were lucky in that we were able to expand the number of Editors from what we had budgeted as a few Editors received support from their institutions to attend our orientation meeting. We were also very lucky to have a separate group of librarians and other information professionals eager to participate in the project. Many of them accepted our invitation to be virtual project volunteers. They helped the project tremendously. In June, DataQ wasn’t ready for prime time, but we wanted to do a pre-launch of it by putting up a sample web form to collect questions from anonymous users. The pre-launch was a way for us to collect questions and populate DataQ with these initial questions prior to the actual launch. The DataQ volunteers helped us to gather many of these questions as well.
Donna: With the Editors being from all different geographic areas, how did you orient them to the project and develop a system for their workflows?
Andrew: We had an in-person training meeting in June that all the Editors attended that was held alongside the GWLA/GPN meeting. The meeting was very productive with all the Editors fully engaged in discussions as we planned the logistics for developing and implementing DataQ. We were able to develop Editorial workflows, establish a system for communication, brainstorm new ideas that went beyond what Megan and I had initially envisioned, and plan the project in the course of the short time we met.
Donna: What is the internal process that takes place when someone submits a question to the site?
Andrew: We have a listserv that includes all the Editors and the PIs. When a question is submitted, it gets sent to the listserv. Editors can then review the question. Any of them can opt to answer it on a first come first serve basis. The Editor who first responds composes an answer on an internal Google doc. We then have two Editors review the answer. Once an answer is approved, the Editor who authored the response posts it on the site.
Donna: Regarding the users who submit questions, are they anonymous?
Andrew: They can opt to be. We offer three options: users can choose to be anonymous, they can send along their e-mail in case the Editors need to get further information from them to answer their questions (and to let them know when an answer has been posted), or users can opt to sign in to get a DataQ user account. Accounts enable users to post comments on the DataQ site. DataQ is intended to be an interactive site. We hope that users will create user accounts and contribute their ideas and comments.
Donna: Are you seeing trends in the types of questions that users are submitting to DataQ?
Andrew: Yes, we’re seeing quite a few questions related to data citation, data documentation, and data sharing.
Donna: What is the sustainability plan for DataQ?
Andrew: That’s what we’re working on now, planning on how the project will move forward after the funding period. We may be applying for further funding to continue the project.
Donna: Will you be presenting DataQ at any national or regional conferences?
Andrew: We’ve been asked to present a few webinars on it which we’re really glad to do. Also a couple of the Editors will be presenting a poster on it at DLF. We hope to also present it at some other conferences in the coming months.
Donna: DataQ has filled a niche—providing expert answers to librarians’ specific RDM questions. Congratulations to you, Megan, and the entire team in getting DataQ up and running—in an amazingly short time!
Submitted by Donna Kafel, Project Coordinator for the New England e-Science Program, email@example.com
The inaugural New England Research Data Management Roundtable was held last Tuesday, August 18th at the Du Bois Library at the University of Massachusetts Amherst campus. This roundtable is the first in a planned series of roundtable discussions targeted for New England librarians who are engaged in research data management services or who want to learn more about data librarianship. Sponsored by the National Network of Libraries of Medicine, New England Region, the NE RDM Roundtables will provide opportunities for New England librarians to compare notes, ask questions, share lessons learned, explore new working models, acquire fresh ideas for their workplaces and develop new partnerships.
This particular Roundtable event was specifically intended for librarians in the RDM Community of Practice, i.e. librarians who are currently actively engaged in planning and/or delivering RDM services. (Note: future NE Roundtables will also be planned for an RDM Community of Interest). It was also preceded by a tour of the Massachusetts Green High Performance Computing Center in Holyoke, MA. Twenty four librarians from multiple institutions, including four of the five University of Massachusetts campuses, University of Connecticut, Boston University, Boston College, Harvard, MIT, University of New Hampshire, Brandeis, Northeastern, Mt. Holyoke, and Drexel University discussed the topic “Organizational structures for research data management services at our institutions.” Attendees were divided into five tables with four to five other attendees per table. At each table a member of the NE Roundtable planning team served as moderator for the discussion. The program was divided into two 45 minute sessions. During the first session, the discussion topic was structures within the library for delivering RDM services. The second session focused on partners on campus that support RDM services. The discussions revolved around specific questions. Time was given between the two sessions and at the end of the second session for each roundtable to report out.
Feedback on the Roundtable event has been quite positive. Attendees have noted that they like the opportunity to hear what their colleagues are doing and to discuss RDM issues, challenges, strengths, and their libraries’ service models. The New England e-Science Program plans to coordinate future Roundtables three times a year. Topics for these roundtables will be based on attendee recommendations.
The following is a summary of questions and bulleted attendee responses and comments from the Roundtable Discussion tables. For Topic 1 questions 3-6 and Topic 2 questions 1-5, the bulleted responses are grouped by theme.
Topic 1: Library Structures for Delivering RDM Services
1. What is the current structure for data management services at your library? What staff is involved and what are their relationships to each other and the work?
- Library director appointed a non-librarian project manager to be DM liaison between Office of Research and the Library. The library is not providing DM services but is incorporating “digital measures”—digitizing faculty CVs for all time (historical)
- Small undergraduate science library just starting out in RDM, no organizational structure yet. Science librarian has been assigned the DM role recently and is learning. He gave presentation about RDM with 2 other librarians to faculty.
- Engineering and Data Services librarian started a year ago. He oversees all aspects of data services in library. Other librarians are involved –science librarian as liaison to science faculty and NE e-Science program, metadata librarian for help with metadata and ontologies, and Systems dept for software support (such as DMP Tool).
- An eScience team made up of three librarians from the Science Library led by one of these three.
- Working group made up of librarians from different disciplines and systems librarian. Most librarians involved are science/engineering, and IT is involved.
- Large research university library has had a DM task force for “way too long.” This includes librarian representatives from special collections, science library, social sciences, library systems, scholarly communications that is coordinated by the director of the science library. Having a DH librarian on team has helped the team not to focus exclusively on STEM fields.
- Private academic health sciences library has DM working group with reps from the library/archives/research labs, postdocs, IT. The group meets 6x/year. The working group would like to hire a data expert to focus on archiving large longitudinal study
- Has Library Data Services Advisory Group, which started 1 ½ years ago. The group is made up of scholarly communications librarian, IR librarian, Associate Library Director, two outreach librarians, Head of Office of Sponsored Research, and representative from Research Computing.
2. How did this service begin and how has it changed over time?
- Service did not begin at small public university until data services librarian started. It has changed with tweaks to the library’s Data Workshop series for faculty, PhD students and some staff, has RDM libguide based on NSF requirements, slightly customized DMP Tool
- Started in 2012 with E-Science Institute, an RDM services working group began last year (includes ~ 12 people—IR librarian, desktop services, 3 dept liaisons (science, soc science, and gov docs), and academic technology, analysis expert
- Started somewhat informally several years ago by three science & engineering librarians who co-created an RDM libguide. Always been more of a collegial staff than a hierarchical one. Some team members are specifically part of the Data and Specialized Services Dept.
- Got started by teaching workshops a while ago, were more successful with grad students than faculty.
- Started since the ARL “Future of Science Librarianship” conference, the library formed a team of subject specialists, scholarly comm. librarian sits in.
3. What strengths does your library have related to data management and how did you fill them?
- Has a dedicated RDM fellow
- Has a dedicated RDM librarian
- University is small enough so that small library team can manage all requests, enough background among library staff to serve most of population. Research population tilts more toward the natural/physical sciences so there are fewer disciplines to keep abreast of.
- Library has expertise in metadata services, building collections, describing information, enabling access. Staff is dedicated to helping faculty/students/staff. The library has a vision to create RDM jobs among the library staff.
- Some capacity for more in-depth consultations
- Focused team approach, specialized knowledge plus shared responsibility
- Library invested in infrastructure to support researchers—e.g. repository, research computing
- Has a Data & Specialized Services department
- Lots of varied expertise in large research university
- Strong IR
- A merged department with IT is very useful as IT people have good ideas about implementing DMPs
- New library administrator has strong RDM background and is committed to growing library RDM services
Perception of Library
- Library has established a good reputation through IR
- Library has existing working relationships with campus constituencies
- Good working relationship with Office of Research
- Developed short “quick bites” RDM introductions instead of long workshops
- Broader committee brings in stakeholders across campus (IT, sponsored programs)
- Getting the word out to the community, having services that resonate with users, built relationships with researchers
- New library administrator has strong RDM background and is committed to growing library RDM services
4. What weaknesses does your library have related to data management and how do you address them?
- Not sure if there are needed RDM services that the library is not aware of
- Struggling with a campus wide lack of cohesive outlook on RDM that makes for confusion
- Defining data management—it means different things to different people
- Haven’t been able to get researchers and students to enroll in library’s RDM courses
- No courage to stop doing what we’ve been doing for 20 yrs—e.g. reference shifts, low level
- Not clear how to avail expertise from the librarians who are outside of the data services team
- Short staffing limits what library can do
- Many liaisons are more focused on collections, don’t see relevance of RDM services or are fearful of change
- No central focused person to head library’s RDM team
- Members of RDM working group can’t dedicate time to work with liaisons
- Lack of RDM policies (common among many institutions)
- Lack an institutional repository or a holding center for data in progress
- Lack of funding
- Trying to initiate new library services as a lower level staff person—need support of library administrators and their involvement in securing campus buy-in
- Difficult to bring researchers together on a Balkanized campus
- Isolation from researcher community that library serves
5. What are your main program elements for data management services in the library and how do you conduct them?
RDM Working Groups (see descriptions of working groups in question 1)
- Developed LibGuide (noted by multiple individuals)
- Data Management Workshop series-an overview of RDM theory and applications—hour long sessions that are held 1-2x/week; LibGuide, DPM tool, Consultation Services by appt.
- DMP Consultations
- RDM team is made up of several librarians who consult on DMPs
- Library offers consultations, workshops, conducts training during Responsible Conduct of Research sessions
- Archiving older data sets, got a CLIR grant for collection of data, archiving a large longitudinal study of child health and clinical data, trying to hire a data person to focus on this.
- Work with Office of Sponsored Research to find out new grants and reach out to PIs
6. What would you like to be doing (as a library and as an individual) related to data management that you are not doing now?
- Have a seat at the table—a place in the formal campus structure where decisions on infrastructure and services are made
- Collaborate with Digital Humanities
- Have contact with research team throughout grant and project cycle
- Create a data IR (one library noted goal to use Dataverse for its data IR)
- Create an infrastructure similar to Purdue where library is the portal and telling the story and IT provides the infrastructure and the Library works closely with Office of Research on compliance
- Track where data is going
- Would like to get more liaison librarians involved with data management
- Have a dedicated librarian who is a focal point for RDM
- Have RDM training incorporated into 1st year grad student requirements
- Conduct RDM training in conjunction with Responsible Conduct of Research training
- Get an RDM course into the curriculum
Topic 2: Collaborations on Campus
1. Who are your current partners on campus?
Uncertainty and problems
- In early stages, trying to learn as much as possible
- Not sure where this is going
- We’re trying to figure out what to do next
- Very do-it-yourself and there are pockets everywhere
- How do services connect when there is no commitment to collaborate?
- Until there is a policy behind it, they will not fund/go further
- Recognize a need for campus-wide “thing” but getting it moving – what do researchers what?
- Always library initiated
- When personnel change, connections change; developing relationships takes time
- Keep liaisons in the loop when working with faculty
- Sometimes faculty don’t come to the library or know of services, many are doing it on their own
- We’re making headway
- With IT and sponsored research, it can be one-sided and difficult at times
- Some campus admins are on board, some are not
Planning and ideas
- Library is the one thinking about this, talking with potential partners
- Services being offered: consultations, education, websites
- Library has back-channel communication with IT staff
- Repository available for some but not all institutions
- Going to create a team
- Putting together meeting of stakeholders on campus
- Partnerships are in their infancy. We want to reach the full community
- Survey on data needs
- We are doing the DMP Tool
- Considering a campus data summit
- Connections from open access policy are useful for data management policy/ practice discussion
- Ongoing discussion about campus infrastructure
- Finding ways to get to the faculty
- Relationships can lead to partnerships
- Partnerships stem out of just talking to other people
- Helps to have culture of open doors, availability to at least discuss
- Academic computing relationship is informal to semi-formal
- Referrals from Office of Research, this is a collaboration “waiting to happen”
- Research computing
- Office of research site–links to library data services
- Policies for data ownership and management
- Working on DMP Tool single sign-on
- co-host meetings for faculty
- host ELN jointly
- workshop participation, such as on data security & active storage topics
- Post-Doc office
- Office of sponsored programs
- info for libguides
- funding policies
- info about grants currently funded on campus
- access to dmps already written
- instruction for DMP Tool
- Labs & offices
- small scale instruction
- data to ingest into repository
- workshop on how to write proposal, including DMP training
- training in specific areas
- Grad student office
- instruction and orientation, for example on cloud storage
- eScience institute
- building training modules
- Scholarly communications office
- open access
- public access policies
- Office of general council
2. On what programs do you collaborate with campus partners?
- Customizing the DMP Tool
- DMP consultations
- Co-presentations with sponsored programs
- Co-presentations with scholarly publications
- Outreach – visit seminars and institutes
- Tech fair – library repository
- Three services: webpage, consultations, education.
- Three data services: consultations for DMP’s or general data management topics; education and training; and data archiving either in data repositories or in our institutional data repository
- Have a data management libguide
- Data Services webpage
- Webpage for services – spells out what we mean by RDM; points people to the different contacts on campus for data lifecycles; everything in one place
3. Who would you like to collaborate with?
- Building collaborations with faculty
- Faculty are interested in library supporting them and being involved; some elements are there
- We’d love to hear more from OSP; there are often time constraints
- Sponsored projects workshops
Office of Research
- “Research day” – compliance
- Workshops / outreach
- DM workshop series
- Copyright classes to graduate students
- Data management for active research, for example ELN
- It would be great to have a university level strategy
- Get a partnership with preservation
- A data board that could help with developing services
- Webpage to point people to certain areas
- Stakeholders – would like broader outreach, a unified group across campus
- New faculty institute – IRB, funding/grants/DMP’s
4. What are the roadblocks?
Perception of Library
- Being seen as being credible and useful. Libraries are seen as having a certain skill set. Need to have conversations and advocates in higher places – i.e. Provost. There’s a disconnect sometimes in terms of what people know of services in library.
- Perception of library as rare books room
- People do their own thing and don’t depend on library
- Research data policy/ lack thereof
- No policies & policies that do exist people do not know about
- Pass a policy but can it be implemented, is it realistic in what the library/institution can do?
- No buy-in due to “high up” (policy driver)
- IT layoffs
- Turf wars, territoriality (we can do this ourselves)
- Staffing stability
- People saying something will happen by a date and it not
- Turnover of staff / loss of staff positions can put a hold on things
- Personalities can be a problem
- Campus IT can be hard to communicate with
- Limited capacity for new services
- Lack of consistency of funding
- Lack of structure/organization
- Other departments have other agendas, similar issues but different priorities. Timing can be an issue to work with people on different schedules
- Getting PI’s on board, they all do things differently
- PI’s may train lab really well in DMP, others do not
- Different needs for different researchers
- Decentralized means different parts don’t always communicate
- What does language mean? Ex archiving, DM services. Have to define terms and how you are using them, controlled vocabulary
5. What support is needed from the library or the institution?
- High level support / promotion
- Infrastructure – e.g. ELN, repository
- People network
- What are other stakeholder desires & interests? Know enough about campus to make solutions
- Institutional view of issues
- Quality metadata requirements need repository librarian –
- Need more support for archiving & storage
- Problem is librarians want to take on tasks / have to take on tasks but cannot give something up.
- Communicating with peers – learning what else is happening
- What about Social Sciences & Humanities?
- Library management has been helpful
- Professional development from the library
- Help from the library to make connections
6. What external support is needed?
- Professional development to broaden knowledge (like this!)
- Listserv of this group
- STS listserv is helpful but don’t brand as discipline specific
- How to host NE region listserv?
- Sharing experiences & training with other librarians
- Short videos on technical subjects, like bit rot, preservation of videos, subject repository vs. local storage
Submitted by guest contributor Amanda Rust, Digital Humanities Librarian, Assistant Director, Digital Scholarship Group, Northeastern University Libraries, firstname.lastname@example.org
About six months ago I began a new position as a Digital Humanities Librarian, and I am now lucky enough to work with humanities data from the nitty-gritty (helping researchers contact publishers to acquire historical newspaper data) to the broadly conceptual (how is historical cultural data made?)
So in composing this short post, I thought I’d start with some recent big-picture discussions, and then apply some of those concepts to cultural data, which is often library-produced data.
Lisa Gitelman’s edited volume Raw Data is an Oxymoron and Johanna Drucker’s reformulation of data and capta (first in this 2011 Digital Humanities Quarterly article) are two excellent places to start, and well-known in the digital humanities field. To give a likely overly broad summary: these works suggest that the very meaning of “data” has changed over time, and even what we think of as the most natural, obvious, “given” data is designed in some way. The experimenter chose to observe it, created instruments encoding choices on how to measure it, perhaps disregarded outliers, imposed categorization and storage once it was captured, and so on.
Not to say that other disciplines have never considered these ideas! On the Humanist email list – one of the oldest online spaces for digital humanities work – there was recently a thread where long-time moderator Willard McCarthy prompted discussion of a resonant quote from Barry Lopez’ 1986 Arctic Dreams. In that book Lopez, a field biologist, discusses his work in the Arctic and “wonders” at the process of naming, the process of reducing what takes place “out there” to patterns that are statistically important, concluding that for the species under study: “No matter how long you watch, you will not see all it can do.”
So how do these theoretical considerations come into play when working with digital humanities projects? We are always confronting what’s been left out of the data. Researchers may start with open access data because it’s there, not because it’s the most relevant, immediately prompting us to notice that some core historical collections are only available via subscription. Why were some resources scanned and made open access, and others not? The vagaries of grant funding? The gaps between wealthy institutions that can afford to scan their collections, and those that cannot? The pressure on institutions to see special collections as a revenue stream?
Beyond the question of open or paid access, researchers are now asking detailed questions on libraries’ selection processes behind both preservation and digitization. Is the data representative of what was culturally significant in the past? What the library later determined to be significant? Who defines “significant”? Or was the original selection based on what was in good condition, or with clear copyright, or had multiple copies, or lacked multiple copies, or had a thematic focus that was easily grant fund-able? Libraries are often the producers of humanities data – or, capta — so it is both thrilling and frightening when digital humanities scholars ask these uncomfortable questions.
A request from Myrna Morales, Data Curation Graduate Student at University of Illinois at Urbana-Champaign, email@example.com
What: Request for identification of a data set
Why: Offer of assistance with data set by a data curation student
Course: Foundations in Data Curation
When: September-December 2015
We work with a Data Curation Specialization certification program team at the University of Illinois Urbana Champaign Graduate School of Library and Information Science (UIUC, GSLIS). Taught since 2007 as part of the MSLIS program, this one-semester Foundations of Data Curation (DC) course integrates as much exposure to data issues and direct experience with data as possible.
In recent semesters we have found that hands-on experience with real data sets noticeably improves student class engagement and understanding. Students are able to work effectively upgrading, ingesting, and/or rescuing a dataset. For instance, students improve their skills by enriching documentation, structuring for ingestion, and reformatting to accessible formats.
Students select a dataset at the start of the course and continue working on it in phases: 1) investigating & selecting a dataset; 2) developing a data management plan for improving the dataset; and 3) implementing the plan given available time and resources. Each dataset has an associated contact but communication with the dataset contact is restricted until the student has demonstrated to the instructors that they have mastered an understanding of the data and related available resources including papers or reports in the peer reviewed literature.
There is an expectation that if a student substantially improves the metadata documentation or the state of the data that the repository would consider using the results of their work. For instance, the National Snow and Ice Data Center and the National Space Science Data Center have publishing datasets worked on by students, datasets that would not otherwise be publically available. In addition to contributing to data availability, this approach represents an opportunity for a) students to provide a pointer to an example of data curation work on their vitae and b) repositories to enhance visibility of some data as well as to highlight their contributions to education and training of a much needed workforce in data curation.
If you have data that require attention and are interested in having a data curation student work with your data sets as a class project, please contact us. We would need to know the name of the data set, the type of data, a summary of what work you feel is needed, the name and contact information for a point of contact for the student, as well as a pointer to the data or a mechanism to access it. The first day of class is August 26, so we would need this information by the beginning of that week.
Ruth Duerr, firstname.lastname@example.org, Ronin Institute for Independent Scholarship and Adjunct Professor Graduate School of Library and Information Science, UIUC
Myrna Morales, email@example.com, UIUC Graduate Student in Data Curation
By Jen Ferguson, Co-Chair of the e-Science Portal Editorial Board. Jen can be contacted at firstname.lastname@example.org
After many rounds of user feedback, testing, and revision, we are very pleased to unveil the revamped e-Science portal today. In addition to the aesthetic redesign, here are a few of the more significant changes we’ve made to the portal based on your comments and suggestions:
- Added a new ‘getting started with e-Science’ quick guide
- Moved the events calendar to the front page
- Added the Twitter feed to the front page of the portal
- Reorganized the content headings significantly. Data Management, in particular, received a major overhaul – it now includes separate sections on research data lifecycles, data management planning, data curation, reasons to cite data, etc.
- Links to data tools have been posted directly to relevant pages such as the data curation page
- Tidied up site navigation in general, and pared down the size of the footer
- Clarified the relationship between the eScience portal and its partner projects
Our editors have been hard at work too – they’ve weeded older content in favor of focusing on smaller selections of newer material. The portal depends heavily upon our crack team of editors, and we’re happy to let them shine a bit more in the revamped portal. You told us you wanted to know more about the people behind the pages, and we heard you! The bios and smiling faces of our editors are now featured more prominently, and we’ve also made it easier for you to contact them directly. Please indulge me in a quick Academy Awards-style shoutout list to recognize those without whom this launch would not have been possible. Thanks to:
- My partner in editorial board co-chair crime, Katie – for knowing what works, what doesn’t, and not being afraid to call ‘em like you see ‘em.
- Our editors Amanda, Andrew, Daina, Jake, Julie, Margaret, and Stacy – for your fresh ideas, your patience as we messed about with your content areas, and your willingness to jump into your content headfirst.
- Usability consultant Bethany – for adding your voice and guidance to our revamp efforts.
- Portal staffers Bob, Donna, and Elaine – for keeping things on track and the project moving forward. We put poor Bob through his paces with this launch! Luckily he’s still talking to us.
Last but not least – thank you, readers, for lending us your time, expertise, and energies with everything from card sort exercises to a couple of rounds of beta testing. By my estimation around 50 of you participated in this revamp in some way. We couldn’t have done it without you! The portal is not done – is a website ever ‘done’? – but we’ve reached a point where we feel ready to release. What do you think of it so far? We welcome your feedback – comment on this post, Tweet to @NERescience, or shoot one of the editors an email.
Submitted by Jake Carlson, Research Data Services Manager, University of Michigan, email@example.com
I recently came across three opinion pieces that got me thinking on the current state of data librarians.
The first one, “Stacking the Deck” by Professor Michael Stephens, was published in the Library Journal. He describes “the full stack employee,” as first articulated by tech writer Chris Messina, and then re-imagines this description into the library workplace. A full stack employee is someone who is always on, deeply invested and goes the extra mile. They continually seek out new ways of producing and innovating through the application of technology and best practices. They are deeply connected to their peers through social media and share what they are doing, not to purposefully make a name for themselves, but to give back and add value to their communities. It’s not that they know everything; it’s that they are driven to discover possibilities and to bring people together.
Sounds like a model employee type that every library would want to hire, right? But what about the librarian him or herself? Are they “full stacked” because they want to be, or because they feel they have to be just to do their job?
Which brings me to the second piece, “Hiring Data Librarians” written by Alexis Johnson and published on Scribd. Alex is a self-described new data librarian and writes on adjusting to the position. Data librarians are often asked to perform a great many tasks and to possess or acquire a great many skills to perform their functions. Alex’s experience was coming in to the position with one set of expectations and then having more and more responsibilities piled on because “you’re a bright young fellow.” These creeping additions that Alex describes led to feelings of inadequacy for never being able to do enough as a data librarian and an anxiety that comes with feeling that you have to devote nights and weekends to learning and skill development. Alex closes the piece describing an actual job ad for a data librarian that includes 5 areas of responsibility, each of which could be considered a full time job in and of itself.
Finally, Rick Anderson writes of a “Quiet culture war in research libraries, and what it means for librarians, researchers and publishers” in UKSG Insights earlier this month. He is not writing on data librarianship directly but instead describes two competing conceptions of the role of the modern research library. On one side there are those who believe that the mission of the research library is to support the needs of its host institution. On the other side are those who would argue that libraries ought to focus on addressing larger issues of scholarly communication irrespective of institution. It is a lengthy piece and I cannot do it justice by trying to summarize it here. What drew my attention were his observations that disagreements in conceptions of the fundamental mission and how these disagreements play out in the operating culture of the library create tensions in the library’s allocation of scarce resources to its programs and projects.
Taken together these pieces present a potential problem for data librarians. I find Professor Stephen’s articulation of a full stack librarian interesting (though I do find the implicit equating of high performance in librarians to tech savviness and youth rather troubling). However, I am concerned that libraries as organizations will come to expect or demand such a complete commitment from hired data librarians without recognizing or providing the level of support needed for him or her to be successful. There are many, many ways that librarians could incorporate working with research data into their positions, but all too often I see job ads like the one described by Alex that over reach and ask for more than one person could possibly accomplish. What this type of job ad implies is that the hiring institution does not know what it wants to do in providing research data support, and in all likelihood will expect the person hired to figure it out for them. In this scenario, the hired librarian may not receive the resources or support needed to be successful. As Dorothea Salo has noted, the practice of hiring smart and talented librarians into ill-defined positions without providing them a solid base of support runs a high risk of burning out and driving away the very people libraries want to attract.
Developing data services is more than just hiring a librarian. It needs to be about the library as an organization making a commitment and investment of time, money and other resources to understand the needs of the communities (within or outside of the institution) and then to respond in ways that add value. This is not to say that libraries must have everything worked out beforehand, rather it is to recognize that getting into data will affect library organization and culture, and that a willingness to consider and openly support change will be needed to succeed. In other words, to support the full stacked librarian, we ought to consider how to build a full stacked library.
If you didn’t get to attend the 2015 New England Science Boot Camp that was held June 17-19th at Bowdoin College, no worries. And if you did attend boot camp, but would like the opportunity to review the interesting presentations, you can do that too!
All the presentations from the NE SBC 2015 from the Science Sessions, special Wednesday evening presentation, and the Capstone are now available on the Science Boot Camp for Librarians YouTube playlist at https://www.youtube.com/playlist?list=PLNtON4mU3aIdSsDOcOSGYcHjtlPJRLgDF
Check them out! And if you’d like to view what this year’s SBC topics were, check out the 2015 NE Science Boot Camp LibGuide.
The following job opportunities may be of interest to the e-Science Community:
George Washington University, Washington, DC: Research Services Coordinators (3 positions): https://www.gwu.jobs/postings/27542
Northeastern University, Boston, MA: Data Analytics/Visualization Specialist https://neu.peopleadmin.com/postings/35539
Reed College, Portland, OR: Data Services Librarian http://library.reed.edu/about/data-services-librarian
San Jose State University, Moss Landing Marine Laboratories : Senior Assistant Librarian, Tenure Track https://www.mlml.calstate.edu/sites/default/files/Tenure-track%20Assistant%20Librarian.pdf
University of California, Los Angeles: Scholarly Communications Librarian http://joblist.ala.org/modules/jobseeker/Scholarly-Communication-Librarian/30343.cfm
University of Missouri, Kansas City: Dental Scholarly Communications Relations and Outreach Librarian: https://myhr.umsystem.edu/psp/tamext/KCITY/HRMS/c/HRS_HRAM.HRS_CE.GBL?SiteId=8
Check them out! The following two articles have just been posted in the Journal of eScience Librarianship:
Assessment of Data Management Services at New England Region Resource Libraries
Julie Goldman, Donna Kafel, and Elaine R. Martin
Submitted by guest contributor, Katie Houk, Health & Life Sciences Librarian at San Diego State University. Katie’s e-mail address is firstname.lastname@example.org
I made the move from a smaller, private university setting to a large, public teaching university across the country eight months ago. One of my priorities was to bring data management education and awareness to the campus. I’ve been fortunate enough to work with our graduate and research affairs office to send out a environmental scan and to get approximately 120 responses. Compared to the number of faculty on campus it’s a rather limited response, but if you’ve ever tried to survey faculty you know how excited my team was to get over 100 responses to something sent out at the end of the spring semester.
It didn’t come as a surprise that when asked what they needed help with most, faculty thought of the most pressing and immediate need – writing Data Management Plans. It also wasn’t too surprising that the next issue on the list was data storage and backup, followed by sharing data, and lastly, preservation issues. What is disappointing to myself – and probably many of you – is the lack of infrastructure and campus centrality needed to deal with these last three issues.
Almost as soon as I arrived on campus I was asked to put together a proposal for an institutional repository solution. Our white paper was thorough and we asked for a robust solution as well as the minimal faculty and staff power it would take to run it. Sadly, the library faces an uphill battle with legitimizing our place on campus and finds it hard to get funding for large projects that require more manpower. I have since learned, however, that there is a group on campus looking into a “data storage solution” but it has no librarian involvement, and possibly no faculty researcher involvement, either.
The disconnect that happens between administration and faculty, and even faculty & administration and the library is a major impediment to creating the infrastructure required to help manage electronic data. If librarians and research faculty are not on the group looking at a campus-wide solution, will the implementation of such a thing actually provide anything actually usable? Not likely, as we consistently request a more robust IR platform than what is provided to our university system through the Chancellor’s office.
My current thoughts on the situation here area as follows:
- How does the library gain legitimacy as a unit to speak to when designing cross-campus solutions for electronic data storage and backup, etc?
- Does a university already strapped for funding want to enter the territory of trying to provide storage for research data when they don’t know or understand the amount or type of data being produced?
- Who will be in charge of this centralized solution and how will it be promoted and taught to a campus that is known for being very decentralized?
- How does the library not over-involve ourselves – since we are under-staffed as it is – in a situation where we are leading the charge and bringing to light these issues?
Have you struggled with these issues at your institution? How have you approached solving them (or have you ignored them out of necessity)? Do libraries need to have a collective plan or toolkit for helping solve these issues?
Contributed by Donna Kafel, Coordinator for the New England e-Science Program and Member of the NE Science Boot Camp Planning Group.
Along with my fellow New England Science Boot Campers, I headed Downeast this year for the seventh annual New England Science Boot Camp (SBC) at Bowdoin College in Brunswick, Maine, June 17-19th. Having heard many rave reviews about both Bowdoin and Brunswick, I was excited to have the opportunity to savor campus life there for a few short boot camp days. I instantly loved the Bowdoin campus and the town of Brunswick and soon found myself longing to be a Bowdoin student!
The science session topics for this year’s SBC were Cognitive Neuroscience, Marine Science, and Ornithology. Each of the boot camp science sessions feature two faculty from selected New England colleges and universities. Generally the sessions are structured so that for the first part one faculty member provides an overview of the science, followed by a second faculty member discussing the research he/she is conducting in the field. This year’s first science session was Cognitive Neuroscience. Dr. Erika Nyhus of Bowdoin College discussed key concepts and the types of classic experiments (remember Pavlov’s dogs and BF Skinner?) that laid the foundation for the field. Dr. Ann Maloney of UMass Medical School presented her research on altering neurocognition through videogames, specifically with children and teens with bipolar depression. Unfortunately many young people with bipolar depression require heavy doses of multiple medications to treat depression, and these medications have many undesireable side effects, such as rapid weight gain. The focus of Dr. Maloney’s research is studying the effects of video gaming on weight gain, and mood. Some preliminary findings from her research are that a significant number of her research participants with bipolar depression were able to stabilize their weight and required lower dosages of their psychiatric meds when they regularly engaged in active video games.
Wednesday evening featured a Literature and the History of Medicine themed talk by Dr. Ann Kibbie, of Bowdoin, “For the Blood is the Life: Dracula and the Early History of Blood Transfusion.” Dr. Kibbie discussed the perception of blood over the years in early medicine, the theories behind bloodletting as a way to restore wellness, and the early practice of blood transfusions—some of which were humans receiving blood from animals. I found myself astounded at how anyone could survive these early transfusions, from animals and other humans—without today’s technology of typing and crossing blood to ensure blood recipients are transfused with compatible blood.
The Marine Science session featured Dr. Barry Costa-Pierce of the University of New England discussing aquatic fisheries and the dire need to develop aquaculture in an environmentally sound way in order to feed the planet. Dr. Costa-Pierce noted that aquaculture and marine fisheries are often perceived negatively, as the popular press has done extensive coverage on antibiotic-laden fish farms, and recommended that consumers find out where their fish is from, as the types of fisheries vary dramatically from one country to another. Dr. Whitney King, of Colby College, presented his research on Maine lakes, the impact phosphorus pollutants have had on increased growth of algae and decreased oxygen in the lakes and approaches to alleviating the destruction of Maine lakes.
Ornithology was the last of the SBC science sessions. Dr. Michael Reed, of Tufts University, was the overview speaker. While he did discuss bird basics, and the interdisciplinary nature of ornithology research, what was striking about Dr. Reed’s talk is that he really covered his use of library resources—a topic that for years, our NE Science Boot Campers have wanted faculty speakers to address! Ornithology is a field in which print and digital resources are used extensively. Dr. Reed discussed popular ornithology journals, the relation of scholarly societies and journal publishing, increased availability of open source materials, his consults with Tufts Tisch Library staff searching for obscure documents, his frequent use of interlibrary loan, students’ database search practices and heavily-used ornithology data repositories such as ORNIS, NA Breeding Bird Survey, Xeno-canto (a bird song sharing repository), and Global Population Dynamics Database. These databases are heavily used and invaluable resources for ornithologists around the world. Dr. Nat Wheelwright of Bowdoin followed Dr. Reed’s presentation, starting off with a recording of a male Savannah sparrow. Dr. Wheelwright studies Savannah sparrows on the very remote Kent Island, where Bowdoin has a multidisciplinary field research station. In his presentation Incest avoidance in an island bird population, Dr. Wheelwright discussed the extraordinarily rare instances of accidental incest in the diminishing Savannah sparrow population on Kent Island. It was interesting to hear his data management practices. He collects data in the field on “Write in the Rain pads.” These are used frequently in field studies because they are water repellent. Every night he and other members of his team enter the data from the field studies into a database that has numerical limits enabling auto-correct for specific metrics.
The SBC Capstone session featured a presentation by Thea Atwood, Engineering Librarian at UMass Amherst, and Cara Martin-Tetreault, Director of Sponsored Research at Bowdoin, on the OSTP directive for enabling public access to federally funded research output. Thea discussed the policy and federal agencies’ responses regarding data management plan requirements, and the OSTP’s impact on library data services. Cara discussed funders’ requirements for data management plans in grant proposals from an institutional perspective. In her discussion, Cara noted that if Bowdoin Science Librarian Sue O’Dell hadn’t initiated a discussion with her about library interest in research data management, she would never have thought of the library as a partner in supporting research data management at Bowdoin. Data management plans are one component among many other grant proposal requirements that sponsored research offices have to address, and Cara welcomed this working partnership with the library in supporting researchers’ data management plans.
The second half of the session was a breakout activity. The week before SBC every Capstone attendee had been sent one research case from the New England Collaborative Data Management Curriculum (NECDMC), to read ahead of the Capstone, to prepare for the activity: writing and reviewing a data management plan. The Capstone attendees were divided into assigned groups of four or five and all group members had been assigned the same research case. Five cases from NECDMC were featured in the Capstone, and every two groups had the same case. The groups were tasked with writing a data management plan based on the case. After 40 minutes, each group swapped their data management plan with the other group who had been assigned the same case, and the groups reviewed each others’ data management plans. After the data management plans were all reviewed, scribes for each group gave their data management plans and reviews to one of the Capstone organizers, and returned to the auditorium for a whole group discussion. The group was asked several questions about their experience writing and reviewing data management plans. When asked what worked well in writing the data management plans, attendees noted having someone in their group with subject expertise, breaking down and mapping the data components of the case, and labeling the data as qualitative or quantitative. Challenges in writing the data management plans were not knowing the requirements of specific funding agencies, not having an institutional policy, being unfamiliar with terminologies or instrumentation. For some cases Capstone attendees noted that it would have been helpful to have disciplinary knowledge. Attendees noted that they liked being able to review another group’s data management plan on the same case, as it gave them an opportunity to see the data management components of the research case from different eyes.
This writing and reviewing data management plan activity was designed to give attendees the experience of reviewing a research project in a discipline that they may be unfamiliar with, and identifying the key data management components that would need to be addressed in a data management plan—a process that a librarian consulting on a data management plan would do in actual practice.
The cases, the group’s data management plans, and the reviews can be viewed on the the Capstone page of the 2015 Science Boot Camp LibGuide under the “Capstoners Data Management Plan and Critiques” section.
So that’s a not so brief recap of this year’s New England Science Boot Camp. Many thanks to this year’s gracious Science Boot Camp host, Sue O’Dell and the members of the New England Science Boot Camp Planning Committee for all their hard work over the year putting together this rich and unique learning and networking event. I’ll be announcing when the boot camp videos are available for viewing–stay tuned!
In the MORNING, tour a world-class computational center in Holyoke, MA. The Massachusetts Green High Performance Computing Center (MGHPCC) serves the growing research computing needs of five of the most research-intensive universities in Massachusetts: BU, Harvard, MIT, Northeaster, and the University of Massachusetts. The computers in the MGHPCC run millions of virtual experiments per month, supporting thousands of researchers in Massachusetts and around the world.
In the AFTERNOON, attend the first Research Data Management Roundtable discussion at the University of Massachusetts, Amherst. This roundtable is an informal gathering of librarians actively engaging in data services (e.g. planning data services, serving on library data service advisory group, consulting on dmps, data curation, teaching rdm, data curation.) and the first in a series of discussions focusing on practical details and learning from our colleagues about research data management. The discussion topics for this session are our organizational structures, both within the library and across campus.
Later Roundtable events will reach out to librarians just beginning to work in research data management.
Further information on these events is available at the libguide August 2015 eScience Events in MA – Tour & Roundtable, http://classguides.lib.uconn.edu/nerdmtable
Registration for both events is here: http://goo.gl/forms/3a1DJKocFF
Registration opens on July 1 and closes on August 7. Space is limited to 25 participants. The Roundtable event is sponsored by the New England Regional Medical Library’s eScience Advisory Board. For details, contact Donna Kafel at Donna.Kafel@umassmed.edu.
Posted on behalf of Kristen Burgess, Assistant Director for Research and Informatics, Donald C. Harrison Health Sciences Library, University of Cincinnati Libraries
The University of Cincinnati (UC) Libraries seek an Assistant Director for Health Sciences Library (HSL) and Henry R. Winkler Center for the History of the Health Professions (Winkler Center) Operations.
The Assistant Director for HSL and Winkler Center Operations provides leadership and coordination for the daily operations of the Health Sciences Library and Winkler Center. In collaboration with other members of the HSL leadership team, the Assistant Director assists with development of policies and procedures, implementation of the UC Libraries strategic plan, facilities management and scheduling, and financial and human resources allocation. The Assistant Director plays a central role in developing new programs and coordinating collection development and management.
For the full position description and information about how to apply, see http://bit.ly/1IxeKfp . UC is an EE/AA employer.
It’s hard to believe that the e-science portal is nearly 6 years old – a kindergartener, in human terms! The world of data librarianship has evolved a lot in that time, and the portal should continue adapting along with that changing landscape.
To that end, we’ve spent a good portion of the last year doing several rounds of user testing on the portal, scrutinizing the results, and having a lot of conversations about how best to organize and design a new version of the portal to make it as useful to you as possible.
Many thanks to those readers who’ve aided this effort by being our ‘guinea pigs’ as we’ve tested navigation, layouts, and overall user experience. Your feedback has truly been invaluable, and has greatly shaped our redesign. Some of you may be hearing from us again soon, as we reach out with one last (brief, we promise!) questionnaire about the changes we’ve made.
It’s been a long haul, but I’m pleased to report that the end is in sight. We’re putting the finishing touches on the portal redesign now, and plan to launch the new version this summer. We hope you’ll like what you see!
Submitted by Amanda Whitmire, Data Management Specialist, Oregon State University Libraries
As the mandates included in the 2013 OSTP memo are nearly coming to pass, I’ve started to wonder how we can take advantage of the new opportunities they may provide. I’m pretty excited about some work I’ve been collaborating on over the past year and a half, and the requirements included in The Memo point toward much bigger opportunities in the same area. What am I talking about? Data management plans (DMPs). The Memo requires that nearly every Federal agency that provides funding to our university researchers now compel their applicants to include a DMP with every proposal. This will vastly expand the number of DMPs that flow through our universities, and will also broaden the disciplinary scope that they cover. Why am I so excited about the prospect of more DMPs? This gets back to the work I’ve been doing lately: DMPs can be a very rich source of information for academic librarians, and the researcher in me simply can’t resist an opportunity to collect more data.
Let me explain. I’m just back from the IASSIST 2015 conference, where I lead a panel focused on sharing the methods and early results of the DART Project. DART (Data management plans as A Research Tool) is an IMLS-funded project aimed at facilitating systematic, robust review of National Science Foundation (NSF) data management plans as a means to inform the development or expansion of research data services. DMPs can tell you a wide range of things about what researchers are up to, the kinds of data they are generating, and how they plan to document, share and archive those data. If you can gain access to a large enough sample of plans and then review them in a consistent way, you will be able glean information on where researchers aren’t using best practices, or are perhaps not taking advantage of a service that exists on campus. There is also the side benefit of becoming more familiar with the research being done on campus, which is always so fascinating.
For example, in a recent review of 50 DMPs from Georgia Tech, DART Project co-PI Lizzy Rolando discovered that while researchers did a good job of planning for how to share their data, they had not fully described how they would archive or share them in the DMPs. Based on these results, Lizzy plans to improve data preservation outreach and training materials for researchers, as well as make improvements on the “boilerplate” DMP language she provides surrounding data reuse, redistribution and the creation of derivatives.
As more Federal agencies begin to roll out a DMP requirement, the pool of potential data for this type of library-based research will vastly expand. We are already learning so much from reading NSF DMPs; what secrets will NOAA, NASA, USDA, and Forest Service plans reveal? I am so excited to find out!
 Check with your Office of Sponsored Programs. You may have to remind them that the content of funded proposals are part of the public record, and there is no reason not to provide you with the DMPs associated with them.
Submitted by guest contributor Daina Bouquin, Data & Metadata Services Librarian, Weill Cornell Medical College of Cornell University, email@example.com
In the time since he joined the John G. Wolbach Library and Information Resource Center at the Harvard-Smithsonian Center for Astrophysics in 2010, Chris Erdmann has been heading a team dedicated to meeting the needs of the Astrophysics Community, as well as advancing professional development opportunities for librarians and students alike. Initiatives like the Unified Astronomy Thesaurus and Data Science Training for Librarians (DST4L) are just a few of the many fascinating and forward-thinking projects that Chris and his team have undertaken in recent years.
The success of Chris’s projects, his attention to the evolving needs of scientists and librarians, and his acknowledgement of the necessity to iterate and adapt in quickly changing settings drove my decision to reach out to him for this interview. I asked Chris to speak about his views on the role of the library in facilitating research and learning in hopes of contributing to the larger conversation among librarians about the future of librarianship. Our conversation is outlined here:
Daina: What do you think libraries can do to foster collaboration with members of their communities?
Chris: That’s a question I’ve been working on for a while. I mean, there are unnecessary divisions being made– for example, Open Humanities vs Open Sciences, but what about the connections between the two? I remember Massimo Riva, an expert in Digital Humanities from Brown, saying at Designing Libraries in Calgary, that what the Digital Humanities needs is more Data Science. Successful libraries will be able to avoid these divisions and embrace Open Research on a wider, collaborative scale.
Another thing making collaboration more difficult is a sort of hesitancy from libraries to really delve deeply into the technologies that scientists are using. I think one way to foster collaborations is to remove that technology barrier and start using the same tools our researchers use. Researchers seemed to be forming a perception of the library, that we did a particular thing, and that was it. My hope is that we can break down that barrier, that researchers can see us differently and see us as knowledgeable collaborators that might be able to help them with their programming, database, etc needs. It’s something you have to work on every day, to get to the point where faculty and scientists truly do believe that the library has a role to play, but the opportunity is there. Holding open conversations with your community helps as well. These conversations allow you to share ideas, goals and developments with each other.
Daina: Aside from supporting DMP writing, what do you think are some of the roles that librarians can have in data-intensive settings?
Chris: You never know what to expect. You want to be a kind of Swiss Army Knife. Develop your skills across the board so you can adapt quickly to new roles. One space that I am particularly interested in developing new services for my community is in supporting their professional development and technical needs. Graduate students and post-docs spend a great deal of time trying to learn programming methodologies and techniques that can help them work better. They often struggle to learn on their own. There’s no handholding. They go through a really rough process, and so there’s this gap– the library has a definite role to play there. Many people, across disciplines, are acknowledging now that early career researchers may not continue in academia, and they will need these skills to be successful on the outside, in industry (see the recent report on the postdoc experience by the NAP). They want to be able to choose meaningful work and they will need these skills do that. The library can help.
I also think librarians have a natural role as consultants. You don’t want to feel like a used car salesman advertising specific tools and solutions to the community, that you’re not beholden to any one solution. Libraries can benefit by being more flexible. To me, librarians seem to fit well into the role of consultants, offering different options that the student or researcher can choose for themselves. The librarian can then step in to guide that process, making lightweight connections and being a bit more agnostic about tools. I think people appreciate it when you’re not selling them something, and instead you’re collaborating with them. Helping them make the decision rather then making decisions for them.
I think librarians can also play a role in working with publishers and societies, providing community feedback and helping to integrate new information and tools to improve scholarly workflows. Our goals are the same, to make working with and curating research objects easier for our patrons, and I think librarians’ perspective can help remove some of that friction. You can also say that libraries can ill afford to not be in these conversations as we need to understand how to best integrate our own services in author and publisher workflows.
Daina: What do you think libraries can do to market their value and their role in scholarship?
Chris: One very unique thing the library offers is continuity. The library is concerned about capturing and maintaining institutional memory, and we can inform our faculty, students and other stakeholders about current and past initiatives. The library can help them with their projects, offer stability, so that they can continue and be adopted by the next group of people. But you also need to find the library’s champions. People who can really advocate from outside the library and foster collaborations and build out collaborative projects. Here we have been grateful to our partnerships with people like Alyssa Goodman and the Seamless Astronomy Group to help us have conversations with astronomers, computer scientists, information scientists, and others, to really be actively involved in the development of tools and systems that enable digital scholarship in astronomy. Another champion of ours is Nancy Brickhouse, our Senior Science Advisor; she advises the Director on organizational programs, scientific research policy, and strategic planning. In general, you never know who will be an advocate for your library within your organization. Be welcoming to patrons with questions who are open to working with you and developing projects. Don’t push those opportunities away.
Daina: Could you tell me about the Data Scientist Training for Librarians course and its goals?
Chris: You can’t learn new skills in two days. It takes time. Maybe three or four months of really applying new skills in real settings, and so that’s part of what we have been aiming for with Data Scientist Training for Librarians. We wanted librarians to get out from behind their desks, experience the full research lifecycle and hopefully get them talking to each other about new services they could provide. The hope is to build a community, and while we’re not quite there yet, we’re making connections. It’s a very difficult thing to scale– four months is not as easy a sell as a few days here and there, so we’re still figuring out where to go moving forward. We know there is value in building out a community in librarianship for data science skills, but it’s very difficult to build a community with only very short-term exposure to tools. We may end up partnering with groups like Mozilla and adapting more scalable approaches like marketing the training to students and community members more broadly, but we’d still like to get a spark going from within the libraries themselves. We have to find the time to do this, which may mean having to cut some other things we do to make time. It’s challenging; everyone is overworked. To learn to be a data scientist on top of all the other things librarians are expected to do is difficult. But allowing ourselves to re-tool and make time for innovating, and even failing, will help us learn more about being practical and meeting our community’s needs.
Submitted by e-Science Portal Editor, Margaret Henderson, Director of Research Data Management , Virginia Commonwealth University Libraries. Margaret’s e-mail is firstname.lastname@example.org
For those who couldn’t attend MLA last week, and those who attended but couldn’t fit the Informationist Award session led by Dr. Alan VanBiervliet into their schedule, I thought I would share some of the helpful suggestions that were discussed. I was also part of a focus group for current award holders earlier in the afternoon, so there will be a few suggestions based on the experiences of this group as well.
In case you haven’t already seen it, the program overview is here: http://www.nlm.nih.gov/ep/AdminSupp.html and the most recent Funding Opportunity Announcement (FOA) is here: http://grants.nih.gov/grants/guide/pa-files/PA-15-249.html Please note that the deadline for applying for an NLM Administrative Supplement Award for Informationist Services in NIH-Funded Research Projects is July 17, 2015.
How do I find a PI?
The FOA spells out exactly who can apply for the award, but the largest eligible group are researchers with R01 grants. A few librarians have been approached by grant holders who saw the FOA and are interested in collaborating, but most of the time, librarians find out who has R01s (grants office or RePorter) and send an invitation. If you institution has some sort of translational science center, you might approach the organizers there as well.
Remember, the Principle Investigator (PI) must submit the award paperwork, and deal with all the communication with the program administrators and grants office staff. This will involve some extra work for them, so make sure you take care of as much of the award application as you can, and help out whenever they ask.
The PI should also alert their own program officer that they will be applying for this award. Some institutes contribute funds to the award pool, so it is often possible to award more Informationist awards.
Note: Some grants have more than 1 PI. And you can apply for a supplement for more than one grant, although only one application per grant.
Will I need a biosketch?
Yes, as key personnel on the grant, you will need to have a biosketch – in the new format. Think of this as a chance to practice with myNCBI if you haven’t created a biosketch yet. Even if you have done a biosketch before, remember, you will need to update to the new format, so you will find out what all the fuss is about.
You will also need to include a separate Informationist Statement of Commitment indicating that you are prepared to take on this project.
What can Librarians do to help?
The award gives a list of things that will be considered, so you should try to focus on what you or your group can do best.
“The purposes of this administrative supplement program are (1) to enhance collaborative, multi-disciplinary basic and clinical research by integrating an information specialist into the research team in order to improve the capture, storage, organization, management, integration, presentation and dissemination of biomedical research data; and (2) to assess and document the value and impact of the informationist’s participation.”
The work of the librarian(s) must be in the scope of the current project, not start something new. Helping with data management, analysis, visualization, and/or publication are all possible tasks.
Note: awards have gone to a single librarian,as well as librarian teams.
Evaluate your work.
The other purpose of these awards is to show the value of the Informationist(s) to research, so include evaluation methods in your research plan. This is only a two year award so be realistic about what and how you will evaluate your impact. Decide on how you would define success based on the goals and aims of the project.
Working with data in some way is a big part of the stated purpose of the grant so make sure you, or somebody you can team up with on the award, have some sort of data experience.
Researchers who are established and have had many renewals of their R01 grant(s) probably have a pretty good idea of how to manage their data. Early career researchers might be more open to help with data and information management.
You don’t have to be a medical librarian. The team I am part of has a medical liaison librarian, a liaison librarian who supports biomedical engineering, and other life sciences, and a liaison librarian who supports math, physics, engineering, and other hard sciences. I am on the team as the data librarian. The four of us reside in two different libraries.
Because the librarians won’t necessarily come from the health sciences library on campus, the HSL director support letter is not mandatory.
Because the NLM program staff can only communicate with the grant PI and the signing grant administrator at your institution, you will not receive notice if you receive the award. Communication with the PI, and with your grants office, will be important.
The Journal of eScience Librarianship has an issue from 2013 with articles from the informationists who had the first round of awards http://escholarship.umassmed.edu/jeslib/vol2/iss1/
AAHSL and MAR collaborated on a series of webinars that included some of the informationists and their directors. http://nnlm.gov/mar/training/aahsl
I think one of the most challenging aspects to being a data librarian is figuring out how to talk to people about what you do and the services that you provide. Metadata, curation, archiving all mean different things to different people, assuming that they are familiar with these terms at all. Even using the word “data” is a dangerous proposition. Researches in the Arts & Humanities may not see themselves as working with data. Librarians may also have questions about what constitutes “data”, given that definitions are often fairly broad.
As a result I find myself doing a lot of translating between different groups of people. In talking with faculty and librarians I will try to get a sense of how they think of research data and how they describe the issues that are relevant to them. In attempting to make connections with the people I interact with I tend to use full definitions first and then introduce particular terms later on. For example, I’ll talk about how critical it is for people to have access to contextual information about research data so that they will be able to understand the work that was done and trust the data, rather than bring up metadata.
Which brings me to the conundrum of the term “data information literacy”.
In writing up the 2011 article on DIL for portal, Michael Fosmire, C.C. Miller, Megan Sapp Nelson and I employed the term “data information literacy” to deliberately distinguish our work from “data literacy” for two reasons.
First, data literacy generally refers to how the data are used or manipulated to produce research outputs as opposed to how the data are managed, shared or curated. These things are certainly related to each other and a number of the DIL Competencies we came up with venture into this territory, but the perspectives and approaches are distinct from each other. We thought that this distinction was an important one to make.
Second, we really wanted to make connections between data librarians and information literacy librarians and to affirm that each had important contributions to make in this area. We saw this as a “big tent” area for librarians where expertise and skill sets from multiple types of librarians would be needed to be successful. Data was (and perhaps still is) a foreign area to many librarians and so couching the work we were doing in something that was much more familiar and accepted made sense.
However, “data information literacy” as a term does not mean anything to people outside of academic libraries. This is not really surprising. “Information literacy” doesn’t really mean much outside of the library community either (with the possible exception of education) and that community has struggled a bit with how to present itself to faculty, students and others. From what little I know about this community there have been and continue to be discussions about changing “information literacy” to other labels such as “information fluency” to describe their work. These discussions highlight the difficulty of finding a term that succinctly encapsulates the work that librarians do in ways that are both meaningful to ourselves and to others.
I recently published an article with Marianne Stowell Bracke about a semester long course we taught to graduate students in the College of Agriculture at Purdue. We used the term “data literacy” to describe our work in the article because the venue we published in reaches beyond the library community, but perhaps more importantly this is how she and I connected with our students and our sponsors. We wrestled with the decision of what term to use in the article, but in the end choose to be authentic to how we discussed our work with our constituencies.
I don’t know that “data information literacy” as a term has really caught on yet with librarians, or if it ever will. And really that’s okay. I still see value in making the distinction between “data literacy” and “data information literacy”, but it’s more important that we connect with our communities in ways that they can understand and relate to. For now, I’m willing to trade shared terminology for forward progress.
For another take on “translating” as a component of Data Librarianship, check out this article by Kirsten Partlo published in IASSIST Quarterly