<?xml version="1.0" encoding="UTF-8" ?>
<!DOCTYPE TEI.2 PUBLIC "-//TEI P4//DTD Main Document Type//EN"
"http://www.infomotions.com/alex/dtd/tei2.dtd" [
<!ENTITY % TEI.XML         'INCLUDE' >
<!ENTITY % TEI.prose       'INCLUDE' >
<!ENTITY % TEI.linking     'INCLUDE' >
<!ENTITY % TEI.figures     'INCLUDE' >
<!ENTITY % TEI.names.dates 'INCLUDE' >
<!ATTLIST xptr   url CDATA #IMPLIED >
<!ATTLIST xref   url CDATA #IMPLIED >
<!ATTLIST figure url CDATA #IMPLIED >
]> 
<TEI.2>
  <teiHeader>
    <fileDesc>
      <titleStmt>
        <title>What are libraries?</title> 
        <author>Eric Lease Morgan</author>
        <respStmt>
          <resp>converted into TEI-conformant markup by</resp>
          <name>Eric Lease Morgan</name>
        </respStmt>
      </titleStmt>
      <publicationStmt>
        <publisher>Eric Lease Morgan, &#169; University of Notre Dame</publisher>
        <address>
        	<addrLine>emorgan@nd.edu</addrLine>
        </address>
        <distributor>Available through the Distant Reader at <xptr url='https://distantreader.org/blog/what-are-libraries/' />.</distributor>
        <idno type='reader'>22</idno>
        <availability status='free'>
          <p>This document is distributed under a GNU Public License.</p>
        </availability>
      </publicationStmt>
      <notesStmt>
       <note type='abstract'>Through the use of concordancing, I address the question, "What are libraries?"</note>
      </notesStmt>
      <sourceDesc>
        <p>This was originally shared on the Code4Lib Slack channel (October 28, 2022).</p>
      </sourceDesc>
    </fileDesc>
    <profileDesc>
      <creation>
        <date>2022-11-14</date>
      </creation>
      <textClass>
        <keywords>
          <list><item>libraries and librarianship</item></list>
        </keywords>
      </textClass>
    </profileDesc>
    <revisionDesc>
      <change>
<date>2022-11-14</date>
<respStmt>
<name>Eric Lease Morgan</name>
</respStmt>
<item>initial TEI encoding</item>
</change>
    </revisionDesc>
  </teiHeader>
  <text>
    <front>
    </front>
    <body>
      <div>
  <p>I wanted to address the question "How are libraries and librarianship described in computer science literature?" To accomplish this goal I first downloaded about 4,000 pre-prints from arxiv which had been self-described by their authors as being about digital libraries. I then extracted the underlying plain text from the PDF documents and concordanced for the following phrases:</p>
  <list type='bulleted'>
    <item>librarians are</item>
    <item>librarians were</item>
    <item>librarianship is</item>
    <item>librarianship was</item>
    <item>libraries are</item>
    <item>libraries were</item>
    <item>library is</item>
    <item>library was</item>
  </list>
  <p>I then removed all the text up to and including the given phrases, removed duplicates, and sorted the results. In the end I had a quick-and-dirty list of pseudo-definitions. Some of the more interesting are listed here:</p>
  <list type='bulleted'>
    <item>a center of information, science, technology, art and culture [5]. an informati</item>
    <item>a collection of information sources in a place." * a collection consists of obj</item>
    <item>an external stored procedure wrapped in a table -valued stored procedure sphtm_</item>
    <item>becoming less important to universities, while publishers have roughly maintain</item>
    <item>constantly evolving in response to scholars' needs. * convenience transformatio</item>
    <item>interested in building special event-centric web collections that humans can co</item>
    <item>taking the lead in offering courses in using, managing, and disseminating data</item>
    <item>today on the front lines of efforts to preserve data products and link them to</item>
  </list>
  <p>(The complete list is available at 
  <xref url='./libraries-are.txt'>./libraries-are.txt</xref>.)</p>
  <p>On one hand, the results are not surprising. On the other hand, I would not be able to articulate all of these simple definitions, and I believe the enumeration can be the beginning of an interesting discussion.</p>
  <p>For extra credit, one could acquire a different set of content -- say, bunches o' articles from the traditional library literature -- apply the same concordancing process, and compare the results.</p>
</div>

    </body>
    <back>
    </back>
  </text>
</TEI.2>
