Planet Code4Lib
https://planet.code4lib.org/
2024-03-19T10:42:41+00:00
http://intertwingly.net/code/venus/
Open Knowledge Foundation: #ODDStories 2024 @ Kwara, Nigeria 🇳🇬
https://blog.okfn.org/?p=29363
2024-03-19T08:50:05+00:00
<p>The <a href="https://meta.wikimedia.org/wiki/Learnovation_Network_Foundation" rel="noreferrer noopener" target="_blank">Learnovation Foundation Network</a> organized the <a href="https://meta.wikimedia.org/wiki/Event:Wikidata_Loves_SDGs_Nigeria" rel="noreferrer noopener" target="_blank">Wikidata Loves SDGs 2024</a> event on 7 March 2024 at the Mustapha Akanbi Library and Resource Centre in Kwara, Nigeria, to celebrate <a href="https://opendataday.org/" rel="noreferrer noopener" target="_blank">Open Data Day</a> with <a href="https://blog.okfn.org/2024/02/28/and-the-winners-of-the-open-data-day-2024-mini-grants-are/" rel="noreferrer noopener" target="_blank">mini-grant support</a>. The event focused on enhancing and updating Wikidata items related to Sustainable Development Goals (SDGs) in Nigeria, fostering collaboration and awareness among Wikidatans, SDG advocates, and data enthusiasts.</p>
<div class="wp-block-spacer" style="height: 30px;"></div>
<figure class="wp-block-image size-large"><img alt="" class="wp-image-29365" height="690" src="https://blog.okfn.org/wp-content/files/2024/03/1603px-Group_photo_of_participants_at_the_WikiData_Loves_SDGs-ODD_2024_42-1024x690.jpg" width="1024" /></figure>
<figure class="wp-block-gallery has-nested-images columns-4 is-cropped">
<figure class="wp-block-image size-large"><img alt="" class="wp-image-29366" height="720" src="https://blog.okfn.org/wp-content/files/2024/03/540px-WLS-OKF_02.jpg" width="540" /></figure>
<figure class="wp-block-image size-large"><img alt="" class="wp-image-29371" height="720" src="https://blog.okfn.org/wp-content/files/2024/03/540px-WLS-OKF_05.jpg" width="540" /></figure>
<figure class="wp-block-image size-large"><img alt="" class="wp-image-29369" height="720" src="https://blog.okfn.org/wp-content/files/2024/03/791px-Particiapnts_at_the_WikiData_Loves_SDGs-ODD_2024_24.jpg" width="791" /></figure>
<figure class="wp-block-image size-large"><img alt="" class="wp-image-29368" height="720" src="https://blog.okfn.org/wp-content/files/2024/03/809px-Hafisat_Ige_5.jpg" width="809" /></figure>
<figure class="wp-block-image size-large"><img alt="" class="wp-image-29373" height="720" src="https://blog.okfn.org/wp-content/files/2024/03/914px-Participant_at_the_WikiData_Loves_SDGs-ODD_2024_17.jpg" width="914" /></figure>
<figure class="wp-block-image size-large"><img alt="" class="wp-image-29367" height="720" src="https://blog.okfn.org/wp-content/files/2024/03/923px-Blessing_Linason_09.jpg" width="923" /></figure>
<figure class="wp-block-image size-large"><img alt="" class="wp-image-29372" height="683" src="https://blog.okfn.org/wp-content/files/2024/03/1080px-BARAkat_Adegboye_067-1024x683.jpg" width="1024" /></figure>
<figure class="wp-block-image size-large"><img alt="" class="wp-image-29370" height="619" src="https://blog.okfn.org/wp-content/files/2024/03/1192px-WikiData_Loves_SDGs-ODD_2024_41-1024x619.jpg" width="1024" /></figure>
</figure>
<div class="wp-block-spacer" style="height: 30px;"></div>
<p>The event boasted the presence of experienced facilitators and Wikimedia project organizers, including <a href="https://meta.wikimedia.org/wiki/User:Ridzaina" rel="noreferrer noopener" target="_blank">Barakat Adegboye</a>, <a href="https://meta.wikimedia.org/wiki/User:Linason_Blessing" rel="noreferrer noopener" target="_blank">Blessing Linason,</a> and <a href="https://meta.wikimedia.org/wiki/User:Mijesty" rel="noreferrer noopener" target="_blank">Miracle James</a>. The theme, <strong>“Open data for advancing sustainable development goals,</strong>” set the stage for a day of insightful presentations and hands-on activities.</p>
<p><a href="https://www.linkedin.com/in/kehinde-akinsola-46825b1a4/" rel="noreferrer noopener" target="_blank">Kehinde Akinsola</a>, the Programs Lead from The Wellbeing Foundation Africa, represented by Miss Jimoh Zainab, delivered an engaging talk on the intersection of <a href="https://docs.google.com/presentation/d/1BKdK4R-l4Zg8aOqe1_VPVuHQaF-xq7TT/edit#slide=id.p7" rel="noreferrer noopener" target="_blank">open data and SDGs</a>, emphasizing the role of accurate data in achieving sustainable development. <a href="https://www.linkedin.com/in/hafisat-ige-871097142/" rel="noreferrer noopener" target="_blank">Hafisat Ige</a>, a renowned Data Scientist and Women Techstar Fellow ’23, provided a deep dive into <a href="https://docs.google.com/presentation/d/1H2eaO0lShe9K4IvgrOitjY_qDAsEAyA7/edit#slide=id.p4" rel="noreferrer noopener" target="_blank">data-driven strategies for SDG advancement.</a></p>
<p><a href="https://commons.wikimedia.org/wiki/Category:Learnovation_Foundation_Network" rel="noreferrer noopener" target="_blank">Participants</a>, including Undergraduate Students, Educators, Librarians, Mass media and Medical professionals, engaged actively in the sessions. Despite the limitation of resources restricting invitations to only 20 out of 39 registered attendees, the event was a resounding success, with contributions spanning across various SDG-related topics on Wikidata.</p>
<p>The event utilized the Outreach Dashboard to track participant contributions, which included the creation of 2 new items, the editing of 12 items, and a total of 87 edits by 36 editors. The efforts of participants led to the addition of 28 new references, enhancing the reliability and depth of Wikidata’s SDG-related content.</p>
<p>In conclusion, Wikidata Loves SDGs 2024 not only highlighted the critical role of open data in sustainable development but also demonstrated the power of community collaboration in enriching the global data repository for the greater good. The event set a precedent for future initiatives aimed at leveraging open data for societal progress.</p>
Bukola James
https://blog.okfn.org
HangingTogether: Breaking down succession planning challenges with Metadata Managers
https://hangingtogether.org/?p=14245
2024-03-18T14:49:05+00:00
<p>The challenges of transitioning to new metadata workflows have long been a concern to OCLC RLP Metadata Managers Focus Group members (<a href="https://hangingtogether.org/what-should-metadata-managers-be-learning/">What should metadata managers be learning?</a>, <a href="https://hangingtogether.org/filling-the-bench/">Filling the bench</a>, <a href="https://hangingtogether.org/new-skill-sets-for-metadata-management/">New skill sets for metadata managers</a>). Recently, the group has asked me to facilitate deeper conversations about how to address these challenges. For the January 2024 session, I contacted Crystal Goldman, General Instruction Coordinator for the UC San Diego Library. <a href="https://orcid.org/0000-0002-9828-6005">Crystal’s research</a> examines how staff in research libraries understand and apply succession planning. She notes that although there is some literature about the potential benefits of succession planning (and a call for more among library leaders/HR professionals), no comprehensive studies have been conducted across different libraries. In both her interviews and surveys, she has focused on three areas of activities (based on a framework from the Society of Human Resource Managers (SHRM)):</p>
<ul>
<li>training and development</li>
<li>career planning and management</li>
<li>replacement planning or formal succession planning</li>
</ul>
<p>To help us understand where Metadata Managers stand, we asked for responses to an informal survey using some of the questions from a previous instrument used in Crystal’s study of succession planning in ARL libraries.</p>
<p>Among both ARL libraries and Metadata Managers, formal succession planning (i.e. planning/preparing multiple individuals to potentially step into leadership roles) happens (if it happens at all) mostly at senior leadership levels. Like other ARL respondents, Metadata Managers were more likely to know about formal succession planning in their organizations if they were already managers in a leadership role. Metadata Managers identified that they engaged in replacement planning, often around key life events like expected temporary parental/medical leave and/or retirements. Even in these cases, identifying staff to fill gaps may happen in informal discussions with other managers while not directly engaging with staff who might see themselves in new roles. In the worst-case scenarios, Metadata Managers found themselves with unexpected vacancies, forcing them to promote “accidental managers” into leadership roles.</p>
<p>Metadata Managers reported slightly higher activity than most ARL respondents around training and development. Participants in our session felt this was unsurprising given the nature of metadata work and the changing landscape of technical developments that have been occurring. Similarly, Metadata Managers participate in some career planning and management, especially thinking about what kinds of competencies will be needed in the next five years. Forecasting those skills can inform decisions about hiring new staff members and/or providing opportunities for staff willing to seek new challenges.</p>
<figure class="wp-block-pullquote has-text-align-left has-pale-cyan-blue-background-color has-background"><blockquote><p>During our discussion, I learned that a new revision of the <a href="http://hdl.handle.net/11213/20799"><em>Core Competencies for Cataloging and Metadata Librarians</em></a> was just published. <a href="https://ala-events.zoom.us/rec/play/H9rn3ByywUZhZMzLEVftjWK7UEpW6G2cbTfFMkwB-wYLPJuxneGVTu8NnChNu9UoOo4hdNo6qCA6fXIM.nA9JOMcxFEw2_ARt?canPlayFromShare=true&from=my_recording&startTime=1708714933000&componentName=rec-play&originRequestUrl=https%3A%2F%2Fala-events.zoom.us%2Frec%2Fshare%2FqIJU26o5JybqWUXJhdoxMaHOnUdQlE9BSBgU1sJJV4FqfEsAnONwa6Qs0z2Mo_99.B6GYLc3PfTi9eTnE%3FstartTime%3D1708714933000">A recording of the authors speaking about the development of the revision</a> is available from the ALA CORE interest group. </p></blockquote></figure>
<p>When the topic of succession planning has come up in the past, I sensed that Metadata Managers were responding to broad calls to do better in this area – and perhaps felt guilty that they hadn’t made more progress. One of the most valuable things I walked away from the sessions with was a better way to tease apart the challenges we are all facing into structural, cultural, and agentive issues. </p>
<div class="wp-block-image">
<figure class="aligncenter size-full"><a href="https://hangingtogether.org/wp-content/uploads/2024/03/MetadataManagers_Goldman-1.png"><img alt="Graphic illustrating the concept of structure (a house icon), culture (a group of three people icon), and agency (a thumbs-up icon)." class="wp-image-14247" height="443" src="https://hangingtogether.org/wp-content/uploads/2024/03/MetadataManagers_Goldman-1.png" title="illustration" width="408" /></a></figure></div>
<h2 class="wp-block-heading">Structure</h2>
<p>In both our sessions, Metadata Managers acknowledged the challenges of working within organizational contracts, collective bargaining agreements, or other job classification criteria. At a time when metadata is changing, these structures can require additional effort to redefine a position’s required skills and experience. This may not be feasible due to time limitations and/or limited availability from human resources staff that are trying to fill multiple open positions. In these scenarios, it can help to focus energies toward longer-range thinking about competencies.</p>
<p>Several Metadata Managers noted that these structures can be especially frustrating in places where metadata is transitioning. Moving away from cataloging to other kinds of next-generation metadata work can be inhibited by structural agreements that classify staff differently. As hiring managers are already struggling against economic forces to attract people into libraries with the needed computer/data science expertise, this can require additional effort to navigate. Structures also limited Metadata Managers’ agency to provide professional development opportunities to staff with aptitude/attitude for new challenges because they fall outside narrowly defined positions.</p>
<p>Institutional policies requiring searches to be conducted in a specific way (e.g. external national searches) can also make it hard to elevate staff with an aptitude for leadership within the organization. In Crystal’s research and in our discussions, examples surfaced of promising leaders needing to leave their organizations to advance their careers. For other types of libraries, transitioning into a management role may come with risks due to the loss of contract protections.</p>
<h2 class="wp-block-heading">Culture</h2>
<p>In many ways, succession planning in academic libraries reflects the culture of academic institutions more broadly. In principle, these are organized around merit-based systems of advancement (i.e. tenure) that find corporate-style succession planning distasteful. In these contexts, seeking external candidates holds more value than advancing staff internally. These aspects of culture are often reified into structural policies that are difficult to change (either through practice or contractual obligations).</p>
<p>While there is value in adding new views and voices to an organization, this practice of preferring external hires can inhibit investments in developing staff leadership skills that are key to succession planning. This approach can also create self-fulfilling feedback loops, i.e. current leadership is reluctant to invest in leadership training for non-management staff because they will not be able to advance within the organization. This is reinforced by a fear that when staff do get this training, they are likely to find it easier to leave with their new skills to another organization. These kinds of cultural attitudes are also in operation around technical skills that create a Catch-22 for both managers and staff.</p>
<h2 class="wp-block-heading">Agency</h2>
<p>Within these kinds of structures and cultures, Metadata Managers have some opportunities to exercise their agency:</p>
<ul>
<li>How can you embed future staffing needs into other strategic planning? Rather than focusing on the advancement of an individual (i.e. traditional succession planning), how can you have transparent conversations about how to advance as a group? In the process, you may find individuals who also want to advance their leadership/technical skills. This longer-range planning can also provide the time needed to navigate structural barriers and provide opportunities to redefine job descriptions that allow for growth with the right attitude.</li>
<li>As a Metadata Manager, you can cultivate a climate that supports discussions about career planning beyond immediate skills development. Even having a basic discussion with your team about planning can be a good way to start the ball rolling.</li>
<li>It may also be helpful to have a conversation within your organization about what it means to be successful regarding the different activities that make up succession planning. Is developing staff who leave to be successful elsewhere a win or a loss? If this is not the outcome you’re hoping for, how can you change the structural/cultural roadblocks to success?</li>
</ul>
<p>An area that would be worth additional follow-up discussion is the relationship between diversity, equity, and inclusion (DEI) efforts in libraries and succession planning activities. This intersection was outside the scope of Crystal’s work and only briefly discussed during our sessions. On one hand, formal succession planning has been viewed as a detriment to DEI because it can reinforce systemic bias about who can advance in an organization. On the other hand, conscientious use of succession planning activities can help clear away these same obstacles. In our discussion, it was noted that the culture of external searches has been tied to DEI recruitment goals. As noted, this already creates tension when successful leaders need to change institutions to advance, potentially having a detrimental effect on the retention of diverse staff. If this is a topic that you’re currently working on in your library, please reach out about how we could facilitate a future conversation among the Metadata Managers Focus Group.</p>
<p>The post <a href="https://hangingtogether.org/breaking-down-succession-planning-challenges-with-metadata-managers/">Breaking down succession planning challenges with Metadata Managers</a> appeared first on <a href="https://hangingtogether.org">Hanging Together</a>.</p>
Richard Urban
https://hangingtogether.org/
Information Technology and Libraries: Letter from the Editors: March 2024
https://ital.corejournals.org/index.php/ital/article/view/17080
2024-03-18T07:00:00+00:00
<p>The editors of Information<em> Technology and Libraries</em> provide an update on Editorial Board activities and summarize the content of the March 2024 issue.</p>
Kenneth J. Varnum; Marisha C. Kelly
https://ital.corejournals.org/index.php/ital
Information Technology and Libraries: How Libraries Can Foster a Vibrant Local Music Community
https://ital.corejournals.org/index.php/ital/article/view/17063
2024-03-18T07:00:00+00:00
<p>This column outlines how libraries can add value to their both their digital offerings and programming while providing local music artists with a curated, low-barrier entrance into streaming media. Library-hosted digital music collections give up-and-coming artists increased exposure and credibility to listeners and open a wealth of opportunities to engage with their communities.</p>
Joshua Smith
https://ital.corejournals.org/index.php/ital
Information Technology and Libraries: Supporting Information Visualization Research in an Academic Library
https://ital.corejournals.org/index.php/ital/article/view/16867
2024-03-18T07:00:00+00:00
<div> <p class="AbstractText"><span lang="EN-CA">This paper summarizes librarian research on information visualization as well as general trends in the broader field, highlighting the most recent trends, important journals, and which subject disciplines are most involved with information visualization. By comparing librarian research to the broader field, the paper identifies opportunities for libraries to improve their information visualization support services. </span></p> </div>
Michael Groenendyk, Tomasz Neugebauer
https://ital.corejournals.org/index.php/ital
Information Technology and Libraries: Knowledge Graph Visualization Interface for Digital Heritage Collections
https://ital.corejournals.org/index.php/ital/article/view/16719
2024-03-18T07:00:00+00:00
<p>Digital heritage portal interfaces are generally similar to digital library and search engine interfaces in displaying search results as a list of brief metadata records. The knowledge organization and search result display of these systems are item-centric, with little support for identifying relationships between items. This paper proposes a knowledge graph system and visualization interface as a promising solution for digital heritage systems to support users in browsing related items, understanding the relationships between items, and synthesizing a narrative on an issue. The paper discusses design issues for the knowledge graph, graph database, and graph visualization, and offers recommendations based on the authors’ experience in developing three knowledge graph systems for archive and digital humanities resources: the Zubir Said personal archive collection at the Nanyang Academy of Fine Arts, Singapore; Singapore Pioneers social network; and Polyglot Medicine knowledge graph of Asian traditional and herbal medicine. Lessons learned from a small user study are incorporated in the discussion.</p>
Christopher S.G. Khoo, Eleanor A.L. Tan, Siam-Gek Ng, Chwee-Fong Chan, Michael Stanley-Baker, Wei-Ning Cheng
https://ital.corejournals.org/index.php/ital
Information Technology and Libraries: Recommended by Librarians
https://ital.corejournals.org/index.php/ital/article/view/16687
2024-03-18T07:00:00+00:00
<div> <p class="AbstractText">To study library guides, as published on Springshare’s LibGuides platform, new approaches are needed to expand the scope of the research, ensure comprehensiveness of data collection, and reduce bias for content analysis. Computational methods can be utilized to conduct a nuanced and thorough evaluation that critically assesses the resources promoted in library guides. Web-based library guides are curated by librarians to provide easy access to high-quality information and resources in a variety of formats to support the research needs of their users. Recent scholarship considers library guides as valuable resources and as de facto publications, highlighting the need for critical study. In this article, the authors present a novel model for comprehensively gathering data about a specific genre of books from individual LibGuide pages and applying computational methods to explore the resultant data. Beginning with a pre-selected list of 159 books, we programmatically queried the titles using the LibGuides Community search engine. After cleaning and filtering the resultant data, we compiled a list of 20,484 book references (of which 6,212 are unique) on 1,529 LibGuide pages. By testing against inclusion and exclusion criteria to ensure relevancy, we identified a total of 281 titles relevant to our topic. To gain insights for future study, citation analysis metrics are presented to reveal patterns of frequency, co-occurrence, and bibliographic coupling of books promoted in LibGuides. This proof-of-concept could be adopted for a variety of applications, including assessment of collections, public services, critical librarianship, and other complex questions to enable a richer and more thorough understanding of the information landscape of LibGuides.</p> </div>
Carmen Orth-Alfie, Erin Wolfe
https://ital.corejournals.org/index.php/ital
Information Technology and Libraries: Hidden Inequities of Access
https://ital.corejournals.org/index.php/ital/article/view/16661
2024-03-18T07:00:00+00:00
<div> <p class="AbstractText"><a name="_Hlk148083708"></a>Despite ongoing efforts to improve database accessibility, aggregated database vendors concede that they do not have complete control over document accessibility. Instead, they point to the responsibility of journal publishers to deliver articles in an accessible format. This may increase the likelihood that users with disabilities will encounter articles that are not compatible with a screen reader. To better understand the extent of the problem, a document accessibility audit was conducted of randomly selected articles from EBSCO’s Library & Information Source database. Full-text articles from 12 library science journals were evaluated against two measures of screen reader compatibility: HTML format (the optimal format for screen readers) and PDF accessibility conformance. Findings showed inconsistencies in HTML format availability for articles in the selected journals. Additionally, the entire sample of PDF articles failed to meet the minimum standard of PDF Universal Accessibility of containing a tagged structure. However, all PDF articles passed accessibility permissions tests, so could be made accessible retroactively by a third party.</p> </div>
Amanda Hovious, Congwen Wang
https://ital.corejournals.org/index.php/ital
Information Technology and Libraries: Exploring the Impact of the Gamified Metaverse on Knowledge Acquisition and Library Anxiety in Academic Libraries
https://ital.corejournals.org/index.php/ital/article/view/16651
2024-03-18T07:00:00+00:00
<div> <p class="AbstractText"><span lang="EN">This paper investigates the potential of the Gamified Metaverse as a platform for promoting library services. The study compares the effectiveness of a traditional library program with a Metaverse-based library program in terms of knowledge acquisition and library anxiety. The research also examines students’ perceptions of implementing gamification within the context of the Gamified Metaverse platform. A mixed-methods approach was adopted, including pre- and post-test analysis, statistical analysis, and qualitative data collection. The results indicate that both the traditional and Metaverse-based library programs effectively increased the participants’ knowledge, with no significant difference between the two approaches. However, the Metaverse-based program was found to be less effective in facilitating interaction with librarians and reducing library anxiety. Additionally, students expressed positive perceptions of implementing gamification in the Gamified Metaverse platform, finding it engaging and motivating. These findings contribute to the understanding of the effect of the Metaverse as a tool for promoting library services and enhancing knowledge acquisition. However, it is not as effective in reducing library anxiety, particularly in terms of interaction with librarians and staff. It should be noted that the platform may have limitations such as high costs and potential side effects of virtual reality, making it more suitable as an additional tool for promoting library services, taking into account its feasibility and potential benefits for specific student populations and larger libraries.</span></p> </div>
Pradorn Sureephong, Suepphong Chernbumroong, Supicha Niemsup, Pipitton Homla, Kannikar Intawong, Kitti Puritat
https://ital.corejournals.org/index.php/ital
Information Technology and Libraries: Overview of the Library Automation System in South Sulawesi Libraries
https://ital.corejournals.org/index.php/ital/article/view/15853
2024-03-18T07:00:00+00:00
<div> <p class="AbstractText">Technology in libraries has played an essential role in serving today’s communities. This study provides an overview of the integrated library systems/software (ILSs) used in libraries in South Sulawesi, Indonesia. It aims to highlight the strengths and possibilities of ILSs and briefly explain their advantages and disadvantages along with the cost of implementation. The data was gathered from questionnaires sent via an online survey and from direct interviews with certain academic libraries over the period of 2019 to 2020. Fifty-three of 67 libraries that fulfilled the study have implemented an ILS. To deeply understand the application, a direct interview with some libraries was conducted to learn the advantages and disadvantages. The result of the study showed that the most used ILSs are SLiMS and INLISlite and other programs like Apollo, Athenium Light, Simpus, Spektra, Jibas, KOHA, and Openlibrary. The budget spent is an average of 300 USD. While the ILSs have helped these libraries improve services, IT expertise and adequate resources are needed, especially when the systems present problems. An easy-to-use system that costs less will potentially be used in this area of research. This study will be particularly helpful for any library in Indonesia. These findings may also be generalized to libraries in other countries facing economic and technological similarities.</p> </div>
Taufiq Mathar, Ismaya
https://ital.corejournals.org/index.php/ital
Ed Summers: Blobs
https://inkdroid.org/2024/03/17/blobs/
2024-03-17T04:00:00+00:00
<p>
On the one hand, all the <a href="https://huggingface.co/models">models</a> that are available for
download on Hugging Face seem pretty much like programming language
compilers and interpreters that we download and use to write software.
You don’t try to open and read <code>/usr/bin/python3</code> in your
text editor. You trust that it works. Simon Willison <a href="https://simonwillison.net/2023/Aug/3/weird-world-of-llms/">says</a>
these models are like an “opaque blob that can do weird and interesting
things”, and the same analogy seems to hold for the binary executables
we run too.
</p>
<p>
But the big difference is that, once you get various dependencies
assembled correctly, you can <a href="https://devguide.python.org/getting-started/setup-building/">build</a>
the Python binary. The build depends on other opaque blobs being set up,
like <a href="https://gcc.gnu.org/">gcc</a>, which in turn can be <a href="https://gcc.gnu.org/install/build.html">built</a> by bootstrapping
using a lower level language. There are layers of abstraction at work
that can be tested, and reasoned about, which lead us to having some
confidence that things are working correctly. It might get complicated
but we can debug them when they don’t work correctly.
</p>
<p>
This is not true of the opaque blob Large Language Model (LLM). We don’t
have access to the source code that was used to create it. Compiling it
can require a huge investment in time and resources. There’s no way to
debug its logic. If you’re lucky there may be a paper about how it was
built, but some don’t because it is deemed too dangerous.
</p>
<p>
So while it feels the same, it’s really not. I just don’t understand why
people would like to integrate LLMs into applications, for generating
database queries, or API calls. It seems to me like we would want to be
able to reason about these things, and that we lose the ability to do
that when using an LLM. Why does anyone think this is a good idea?
</p>
<p>
And if this style of programming were really to catch on with a new
generation of programmers, would we lose our ability to understand SQL
or REST? Are these really useless abstractions like <a href="https://en.wikipedia.org/wiki/Assembly_language">Assembler</a>
that we want to forget? Won’t our ability to reason about our
applications atrophy? The state of software is already kind of bad, and
it seems like some people are dreaming up ways of making it even worse.
</p>
Ed Summers
https://inkdroid.org/
Lucidworks: Tired of Tech Hype? Get Back to These 7 Customer Experience Basics
https://lucidworks.com/?p=28569
2024-03-14T23:11:58+00:00
<p>These customer experience basics will actually make a meaningful impact on your bottom line. AR and VR, blockchain, and 3D models won’t help you get there.</p>
<p>The post <a href="https://lucidworks.com/post/customer-experience-basics/">Tired of Tech Hype? Get Back to These 7 Customer Experience Basics</a> appeared first on <a href="https://lucidworks.com">Lucidworks</a>.</p>
Lila Schoenfield
https://lucidworks.com/
David Rosenthal: Petabit Optical Media?
tag:blogger.com,1999:blog-4503292949532760618.post-3848145161257524872
2024-03-13T16:59:28+00:00
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi3o4iVw3a7KlNcltB3flBGEvE7aybYm5G6liubjeBThYpeFQh7gAwCAy1M5MD9Tm6pECxVVJxc35y6hkknOa4YkEiWliSpcJgVMDNxdFjB7DnubN4hsnH3cwDguYF1TGajAownfksnJUvcPUknMRMVUL3mpTNkl8djjvtMBvJxbdW94PDVD3aa1AgYKXKp/s218/Sabine.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="218" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi3o4iVw3a7KlNcltB3flBGEvE7aybYm5G6liubjeBThYpeFQh7gAwCAy1M5MD9Tm6pECxVVJxc35y6hkknOa4YkEiWliSpcJgVMDNxdFjB7DnubN4hsnH3cwDguYF1TGajAownfksnJUvcPUknMRMVUL3mpTNkl8djjvtMBvJxbdW94PDVD3aa1AgYKXKp/s1600/Sabine.png" width="214" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://www.youtube.com/watch?v=6a_yxsJuOMY">Source</a></td></tr></tbody></table>
<a href="https://www.youtube.com/watch?v=6a_yxsJuOMY"></a><a>Sabine Hossenfelder</a> does the <a href="https://blog.dshr.org/search?q=Pangloss&max-results=20&by-date=true">good Dr. Pangloss</a> proud in her report on <a href="https://doi.org/10.1038/s41586-023-06980-y"><i>A 3D nanoscale optical disk memory with petabit capacity</i></a> by Miao Zhao <i>et al</i>. Their abstract claims that:<br />
<blockquote>
we increase the capacity of [optical data storage] to the petabit level by extending the planar recording architecture to three dimensions with hundreds of layers, meanwhile breaking the optical diffraction limit barrier of the recorded spots. We develop an optical recording medium based on a photoresist film doped with aggregation-induced emission dye, which can be optically stimulated by femtosecond laser beams. This film is highly transparent and uniform, and the aggregation-induced emission phenomenon provides the storage mechanism. It can also be inhibited by another deactivating beam, resulting in a recording spot with a super-resolution scale. This technology makes it possible to achieve exabit-level storage by stacking nanoscale disks into arrays, which is essential in big data centres with limited space.
</blockquote>
Below the fold I discuss this technology.<br />
<span><a name="more"></a></span>
What the authors mean by "petabit level" <a href="https://doi.org/10.1038/s41586-023-06980-y">is</a>:<br />
<blockquote>
The ODS has a capacity of up to 1.6 Pb for a DVD-sized disk area through the recording of 100 layers on both sides of our ultrathin single disk.
</blockquote>
1.6 petabit is 200TB per disk, which is 2,000 times the capacity of <a href="https://en.wikipedia.org/wiki/Blu-ray">triple-level Blu-ray media</a>. So this is a big increase. But weirdly, the caption to their Figure 1 <a href="https://doi.org/10.1038/s41586-023-06980-y">claims that</a>:<br />
<blockquote>
The capacity of a single 3D nanoscale disk is approximately equivalent to that of a petabit-level Blu-ray library (15.2 Pb, DA-BH7010, Hualu, China) or an HDD data array (12.64 Pb, EMC PowerVault ME5084, Dell, USA).
</blockquote>
A decade ago, <a href="https://blog.dshr.org/2014/09/more-on-facebooks-cold-storage.html">Facebook's Blu-ray library</a> put 10,000 100GB disks in a single rack for 1 Peta<i>byte</i> or 8 Peta<i>bit</i> capacity. This is 5 times as much as the authors' claim for a single disk. The caption's claim of 15.2Pb for the DA-BH7010 is 9.5 times their claim of the capacity of a single disk. Note also that they compare the volume of a single disk to the volume of complete read-write systems, which is comparing apples to oranges. I guess if your meaning of "approximately" is "within an order of magnitude" that makes sense.<br />
<br />
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiOgGwOj5l65aHujHZwqZQLXZL9Xga03LgxVCKNk-ZJo0DAjuIBqKq-xQdHRZmqmy0Gmio2x_52R9nasszMtoARyRfhsVC0OJYmQxdnywxwmzlVZqT1gYEc5-5XGWo7XBm5NKc-ZepisikV_fOOqPgZceKmRc9b0ibPGIGxZsQIx5A-vMrCUwRs_Iu_evoS/s902/TriStateTransitions.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="66" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiOgGwOj5l65aHujHZwqZQLXZL9Xga03LgxVCKNk-ZJo0DAjuIBqKq-xQdHRZmqmy0Gmio2x_52R9nasszMtoARyRfhsVC0OJYmQxdnywxwmzlVZqT1gYEc5-5XGWo7XBm5NKc-ZepisikV_fOOqPgZceKmRc9b0ibPGIGxZsQIx5A-vMrCUwRs_Iu_evoS/w200-h66/TriStateTransitions.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://doi.org/10.1038/s41586-023-06980-y">Figure 3a</a></td></tr></tbody></table>
The recording material on the disk has three states, as shown in the <a href="https://doi.org/10.1038/s41586-023-06980-y">schematic Figure 3a</a>:<br />
<blockquote>
The transition from the second to the third state is initiated by the 515-nm femtosecond Gaussian-shaped laser beam and deactivated by the 639-nm CW doughnut-shaped laser beam.
</blockquote>
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi-Pmz-P1BpAGdZJbm6t00W-evELOTYIC-Rg7hhMZ3UjnjJt5lJPNaGGpuHUjs6bvre7W1nfyiBw7gVakV94FEXZ_MGs8EBIOh3mfoZjmfIvfyWxfelj9_5qNxgLB9JM43EGZgEVJnm4NkJ9Eu72uE331UgImZirla6m6-CJ_BRevysZXLYr1Ue0yuZgd0x/s500/EmissionSpectra.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="122" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi-Pmz-P1BpAGdZJbm6t00W-evELOTYIC-Rg7hhMZ3UjnjJt5lJPNaGGpuHUjs6bvre7W1nfyiBw7gVakV94FEXZ_MGs8EBIOh3mfoZjmfIvfyWxfelj9_5qNxgLB9JM43EGZgEVJnm4NkJ9Eu72uE331UgImZirla6m6-CJ_BRevysZXLYr1Ue0yuZgd0x/w200-h122/EmissionSpectra.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://doi.org/10.1038/s41586-023-06980-y">Figure 3c</a></td></tr></tbody></table>
I assume that because this transition involves polymerization it is irreversible, making the media write-once. Comparing the dark blue line (second state) with the yellow and pink lines (third state) in Figure 3c shows that the second and third states are readily distinguishable by their <a href="https://doi.org/10.1038/s41586-023-06980-y">emissions when illuminated by >1mw 480nm</a>.<br />
<br />
There are a number of reasons to be less enthusiastic about the potential of this technology than Hossenfelder. It is true that they have demonstrated the ability to read and write petabit-scale data on a CD-sized medium. To do the reading they use two lasers, a 480nm pulsed and a 592nm continuous laser. To do the writing they used two lasers, a 515nm <a href="https://en.wikipedia.org/wiki/Mode_locking">femtosecond laser</a> and a 639nm continuous-wave laser. I haven't been able to find a price for a 515nm femtosecond laser, but <a href="https://www.thorlabs.com/newgrouppage9.cfm?objectgroup_id=14348">here</a> is a 1550nm femtosecond laser for $48,880. The femtosecond laser they actually (<a href="https://www.acculasers.com/en/product/fs_lasers/">Acculasers ACL-AFS-515-CUS</a>) used is a substantial box with fans and an AC power input.<br />
<br />
The authors make claims of the density of the <i>medium</i> but not of the <i>system</i>. Clearly, current femtosecond lasers are too expensive and too large to use in equivalents of the decade-old <a href="https://blog.dshr.org/2014/09/more-on-facebooks-cold-storage.html">Facebook Blu-Ray technology</a>. Something like Microsoft Research's system that uses femtosecond lasers to write in Silica allows the cost of the lasers to be amortized over an entire data-center aisle of media. If you are going to build something like this, there is no reason to use the CD form factor.<br />
<br />
The repetition rate of the femtosecond laser was 42MHz. I believe it writes one bit per pulse, so the write bandwidth is limited to around 5MB/sec, meaning that writing an entire disk would take around <strike>10.5</strike> 10,000 hours. A system using this technology would be write-once, and have a long read latency while the robot fetched the needed disk. It would thus only be suitable for the niche archival market, and in this market the slow write rate would require many drives writing in parallel. This all makes this claim by the authors <a href="https://doi.org/10.1038/s41586-023-06980-y">somewhat hyperbolic</a>:<br />
<blockquote>
the development of next-generation industry-oriented nanoscale ODS that is much less expensive than state-of-the-art optical disk libraries and HDD data arrays will fulfil the vast data storage requirements of the big-data era.
</blockquote>
It would have similar product issues to those I outlined in <a href="https://blog.dshr.org/2024/03/microsofts-archival-storage-research.html"><i> Microsoft's Archival Storage Research</i></a>:<br />
<blockquote>
Six years ago <a href="https://blog.dshr.org/2018/02/dnas-niche-in-storage-market.html">I wrote</a>:<br />
<blockquote>
<a href="http://blog.dshr.org/2016/12/the-medium-term-prospects-for-long-term.html">time-scales in the storage industry are long</a>. Disk is a <a href="https://en.wikipedia.org/wiki/History_of_IBM_magnetic_disk_drives#Early_IBM_HDDs">60-year-old technology</a>, tape is at least <a href="https://en.wikipedia.org/wiki/IBM_7_track">65 years old</a>, CDs are <a href="https://en.wikipedia.org/wiki/Compact_disc">35 years old</a>, flash is <a href="https://www.google.com/patents/US5095344">30 years old</a> and has yet to impact bulk data storage.
</blockquote>
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj-ZpPqvkf3GXj_kN5rc__s2L5rvYJFx7FAmoP_wx6B-74cGM0d6zq6Y1HZdiuKwCRMx91LIhSAOod9lRwnBjlKQh8mwXLNSbQqy3nRhk6fYJRY4W8jniBiE4E8Bp5dgQZF_H6ghOlIIkdYl1JQK8wkgW3TKLT0Z92Pb2BNOgRiRyHlYVCeR-yk3CZ4CA/s800/BitShipments.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="83" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj-ZpPqvkf3GXj_kN5rc__s2L5rvYJFx7FAmoP_wx6B-74cGM0d6zq6Y1HZdiuKwCRMx91LIhSAOod9lRwnBjlKQh8mwXLNSbQqy3nRhk6fYJRY4W8jniBiE4E8Bp5dgQZF_H6ghOlIIkdYl1JQK8wkgW3TKLT0Z92Pb2BNOgRiRyHlYVCeR-yk3CZ4CA/w200-h83/BitShipments.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://digitalpreservation.gov/meetings/DSA2023/loc_dsa2023_website_0104_lauhoff_Storage%20Landscape%20__0326.pdf">Source</a></td></tr></tbody></table>
Six years on flash has finally impacted the bulk storage market, but it isn't predicted to ship as many bits as hard disks for another four years, when it will be a 40-year-old technology. Actual demonstrations of DNA storage are only 12 years old, and similar demonstrations of silica media are 15 years old. History suggests it will be decades before these technologies impact the storage market.<br />
</blockquote>
Hossenfelder makes several mistakes in her report:<br />
<ul>
<li>"new disk memory that could bring disk memory into the Petabyte range" - no, that is the Peta<i>bit</i> range.</li>
<li>Optical disks "were outcompeted by hard disks". - no, write-once removable media and on-line storage are two completely different markets. Optical disks lost out to the cloud and to a lesser extent by flash.</li>
<li>"the information density on compact disks or any optical storage is ultimately limited by the frequency of the laser light" - well yes, but she is talking about a paper describing a 2000-times increase in capacity <i>using laser light</i>.</li>
<li>"in modern flash drives the information is stored in little magnetizable cells that are a few atoms in size" - no, flash isn't a magnetic technology. She also misses that modern flash is a volumetric not a planar technology, just like the technology in the paper.</li>
<li>"figured out how to write data in multiple layers" - no, Blu-ray is a multi-layer technology more than a decade old. They figured out how to write a lot more layers of much smaller bits.</li>
<li>"this could work up to hundreds of layers" - well, they only demonstrated 100 layers, so hundreds plural is speculation. To get to the petabyte range needs at least 500 layers or much smaller bits. Note that modern flash has over 100 layers.</li>
</ul>
David. (noreply@blogger.com)
https://blog.dshr.org/
Meredith Farkas: Time: It doesn’t have to be this way
https://meredith.wolfwater.com/wordpress/?p=4588
2024-03-13T01:59:02+00:00
<img alt="Three pocket watches" class="attachment-thumbnail size-thumbnail wp-post-image" height="150" src="https://meredith.wolfwater.com/wordpress/wp-content/uploads/2024/03/Linearer_Zeitfluss-e1710282415495-150x150.jpg" style="float: left; margin: 0 15px 15px 0;" width="150" /><blockquote class="wp-block-quote">
<p>“What we think time is, how we think it is shaped, affects how we are able to move through it.” </p>
<p><cite>-Jenny Odell <em>Saving Time</em>, p. 270</cite></p></blockquote>
<p>What I love about reading Jenny Odell’s work is that I often end up with a list of about a dozen other authors I want to look into after I finish her book. She brings such diverse thinkers beautifully into conversation in her work along with her own keen insights and observations. One mention that particularly interested me in Odell’s book <a href="https://bookshop.org/p/books/saving-time-discovering-a-life-beyond-the-clock-jenny-odell/18556369?ean=9780593242704" rel="noreferrer noopener" target="_blank"><em>Saving Time</em></a> (2023) was <a href="https://bookshop.org/p/books/what-can-a-body-do-how-we-meet-the-built-world-sara-hendren/13591508?ean=9780735220003" rel="noreferrer noopener" target="_blank"><em>What Can a Body Do</em></a><em> </em>(2020) by Sara Hendren. Her book is about how the design of the world around us impacts us, particularly those of us who don’t fit into the narrow band of what is considered “normal,” and how we can build a better world that goes beyond accommodation. Her book begins with the question “Who is the built world built for?” and with a quote from Albert Camus: “But one day the ‘why’ arises, and everything begins in that weariness tinged with amazement” (1).</p>
<p>“Why” is such a simple world, but asking it can completely alter the way we see the world. There’s so much in our world that we simply take for granted or assume is the only way because some ideology (like neoliberalism) has so deeply limited the scope of our imagination. Most of what exists in our world is based on some sort of ideological bias and when we ask “why” we crack the world open and allow in other possibilities. Before I read the book <a href="https://bookshop.org/p/books/invisible-women-data-bias-in-a-world-designed-for-men-caroline-criado-perez/15136602?ean=9781419735219" rel="noreferrer noopener" target="_blank"><em>Invisible Women</em></a> (2021) by Caroline Criado Perez, I already knew that there was a bias towards men in research and data collection as in most things, but I didn’t realize the extent to which the world was designed as if men were the only people who inhabited it and how dangerous and harmful it makes the world for women. <em>What Can a Body Do</em> similarly begins with an exploration of the construction of “normal” and how design based on that imagined normal person can exclude and harm people who aren’t considered normal, particularly those with disabilities. The book is a wonderful companion to <em>Invisible Women</em> in looking at why the world is designed the way it is and how it impacts those who it clearly was not built for. I’ll explore that more in a later essay in this series. </p>
<p>One thing I took for granted for a very long time was time itself. I thought of time in terms of clocks and calendars, not the rhythms of my body nor the seasons (unless you count the start and end of each academic term as a season). I believed that time was scarce, that we were meant to use it to do valuable things, and that anything less was a waste of our precious time. I would beat myself up when, over Spring Break, I didn’t get enough practical home or scholarship projects done or if I didn’t knock everything off my to-do list at the end of a work week. I would feel angry and frustrated with myself when my bodily needs got in the way of getting things done (I’m writing this with ice on both knees due to a totally random flare of tendinitis when I’d planned to do a major house cleaning today so I’m really glad I don’t fall into that <a href="https://mindfulnessmeditation.net.au/arrow/" rel="noreferrer noopener" target="_blank">shooting myself with the second arrow trap</a> as much as I used to). I looked for ways to use my time more efficiently. I am embarrassed to admit that I owned a copy of David Allen’s <em>Getting Things Done</em> and tried a variety of different time management methods over the years that colleagues and friends recommended (though nothing ever stuck besides a boring, traditional running to-do list). I’d often let work bleed into home time so I could wrap up a project because not finishing it would weigh on my mind. I was always dogged by the idea that I wasn’t getting enough done and that I could be doing things more efficiently. It felt like there was never enough time all the time. </p>
<div class="wp-block-image">
<figure class="aligncenter size-full is-resized"><a href="https://meredith.wolfwater.com/wordpress/wp-content/uploads/2024/03/lloyd.jpeg"><img alt="Black and white photo of a man hanging from a clock atop a building" class="wp-image-4592" height="234" src="https://meredith.wolfwater.com/wordpress/wp-content/uploads/2024/03/lloyd.jpeg" width="317" /></a>From Harold Lloyd’s <em>Safety Last</em> (1923)</figure>
</div>
<p>I didn’t start asking questions about time until I was 40 and the first one I asked was a big one “what is the point of our lives?” Thinking about that opened a whole world of other questions about how we conceive of time, what kinds of time we value, to what end are we constantly trying to optimize ourselves, what is considered productive vs. unproductive time, why we often value work time over personal time (if not in word then in deed), why time often requires disembodiment, etc. The questions tumbled out of me like dominoes falling. And with each question, I could see more and more that the possibility exists to have a different, a better, relationship with time. I feel Camus’ “weariness, tinged with amazement.”</p>
<p>This is an introduction to a series of essays about time: how we conceive of it, how it drives our actions, perceptions, and feelings, and how we might approach time differently. I’ll be pulling ideas for alternative views of time from a few different areas, particularly queer theory, disability studies, and the slow movement. I’m not an expert in all these areas, but I’ll be sure to point you to people more knowledgeable than me if you want to explore these ideas in more depth.</p>
<p>How many of you feel overloaded with work? Like you’re not getting enough done? How many of you are experiencing time poverty: where your to-do list is longer than the time you have to do your work? How many of you feel constantly distracted and/or forced to frequently task-switch in order to be seen as a good employee? How many of you feel like you’re expected to do or be expert in more than ever in your role? How many of you feel like it’s your fault when you struggle to keep up? More of us are experiencing burnout than ever before and yet we keep going down this road of time acceleration, constant growth, and continuous availability that is causing us real harm. People on the whole are not working that many more hours than they used to, but we are experiencing time poverty and time compression like never before, and that feeling bleeds into every other area of our lives. If you want to read more about how this is impacting library workers, I’ll have a few article recommendations at the end of this essay.</p>
<p>My exploration is driven largely by this statement from sociologist Judy Wajcman’s (2014) excellent book <a href="https://bookshop.org/p/books/pressed-for-time-the-acceleration-of-life-in-digital-capitalism-judy-wajcman/6800274?ean=9780226380841" rel="noreferrer noopener" target="_blank"><em>Pressed for Time</em></a>: “How we use our time is fundamentally affected by the temporal parameters of work. Yet there is nothing natural or inevitable about the way we work” (166). We have fallen into the trap of believing that the way we work now is the only way we can work. We have fallen into the trap of centering work temporality in our lives. And we help cement this as the only possible reality every time we choose to go along with temporal norms that are causing us harm. In my next essay, I’m going to explore how time became centered around work and how problematic it is that we never have a definition of what it would look like to be doing enough. From there, I’m going to look at alternative views of time that might open up possibilities for changing what time is centered around and seeing our time as more embodied and more interdependent. My ideas are not the be-all end-all and I’m sure there are thinkers and theories I’ve not yet encountered that would open up even more the possibilities for new relationships with time. To that end, I’d love to get your thoughts on these topics, your reading recommendations, and your ideas for possible alternative futures in how we conceive of and use time. </p>
<p><strong>Works on Time in Libraries</strong></p>
<p>Bossaller, Jenny, Christopher Sean Burns, and Amy VanScoy. “Re-conceiving time in reference and information services work: a qualitative secondary analysis.” <em>Journal of Documentation</em> 73, no. 1 (2017): 2-17.</p>
<p>Brons, Adena, Chloe Riley, Ean Henninger, and Crystal Yin. “Precarity Doesn’t Care: Precarious Employment as a Dysfunctional Practice in Libraries.” (2022).</p>
<p>Drabinski, Emily. “A kairos of the critical: Teaching critically in a time of compliance.” <em>Communications in Information Literacy</em> 11, no. 1 (2017): 2.</p>
<p>Kendrick, Kaetrena Davis. “The public librarian low-morale experience: A qualitative study.” Partnership 15, no. 2 (2020): 1-32.</p>
<p>Kendrick, Kaetrena Davis and Ione T. Damasco. “Low morale in ethnic and racial minority academic librarians: An experiential study.” <em>Library Trends</em> 68, no. 2 (2019): 174-212.</p>
<p>Lennertz, Lora L. and Phillip J. Jones. “A question of time: Sociotemporality in academic libraries.” <em>College & Research Libraries</em> 81, no. 4 (2020): 701.</p>
<p>McKenzie, Pamela J., and Elisabeth Davies. “Documenting multiple temporalities.” <em>Journal of Documentation</em> 78, no. 1 (2022): 38-59.</p>
<p>Mitchell, Carmen, Lauren Magnuson, and Holly Hampton. “Please Scream Inside Your Heart: How a Global Pandemic Affected Burnout in an Academic Library.” <em>Journal of Radical Librarianship</em> 9 (2023): 159-179.</p>
<p>Nicholson, Karen P. “Being in Time”: New Public Management, Academic Librarians, and the Temporal Labor of Pink-Collar Public Service Work.” <em>Library Trends</em> 68, no. 2 (2019): 130-152.</p>
<p>Nicholson, Karen. “On the space/time of information literacy, higher education, and the global knowledge economy.” <em>Journal of Critical Library and Information Studies </em>2, no. 1 (2019).</p>
<p>Nicholson, Karen P. ““Taking back” information literacy: Time and the one-shot in the neoliberal university.” In <em>Critical library pedagogy handbook</em> (vol. 1), ed. Nicole Pagowsky and Kelly McElroy (Chicago: ACRL, 2016), 25-39.</p>
<p><strong>Awesome Works on Time Cited Here</strong></p>
<p>Hendren, Sara. <em>What Can a Body Do?: How We Meet the Built World.</em> Penguin, 2020.</p>
<p>Odell, Jenny. <em>Saving Time: Discovering a Life Beyond Productivity Culture</em>. Random House, 2023.</p>
<p>Wajcman, Judy. <em>Pressed for time: The acceleration of life in digital capitalism. </em>University of Chicago Press, 2020.</p>
Meredith Farkas
https://meredith.wolfwater.com/wordpress
Open Knowledge Foundation: Open Data Day 2024 – Global Statistics & Activity Report
https://blog.okfn.org/?p=29300
2024-03-12T11:03:00+00:00
<p>This year’s <a href="https://opendataday.org/" rel="noreferrer noopener" target="_blank">Open Data Day</a> (ODD) was a huge success. Almost 300 events registered worldwide, with 60 countries participating in 15+ different languages. </p>
<p>Before starting the <a href="https://blog.okfn.org/category/open-data-day/odd-stories/" rel="noreferrer noopener" target="_blank">#ODDStories</a> 2024 series, with reports from events around the world, we’ve just finalised a report with the main figures and data on the 2024 edition and we can’t say it any other way: <img alt="💜" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f49c.png" style="height: 1em;" /> A HEARTFELT THANKS to everyone in the open data community! <img alt="💙" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f499.png" style="height: 1em;" /></p>
<div class="wp-block-spacer" style="height: 30px;"></div>
<div class="ml-slider-3-62-0 metaslider metaslider-flex metaslider-29322 ml-slider ms-theme-default nav-hidden" id="metaslider-id-29322" style="width: 100%;">
<div id="metaslider_container_29322">
<div id="metaslider_29322">
<ul class="slides">
<li class="slide-29325 ms-image " style="display: block; width: 100%;"><img alt="" class="slider-29322 slide-29325" height="768" src="https://blog.okfn.org/wp-content/files/2024/03/Slide-4_3-6.png" title="Slide-4_3-6" width="1024" /></li>
<li class="slide-29326 ms-image " style="display: none; width: 100%;"><img alt="" class="slider-29322 slide-29326" height="768" src="https://blog.okfn.org/wp-content/files/2024/03/Slide-4_3-9.png" title="Slide-4_3-9" width="1024" /></li>
<li class="slide-29327 ms-image " style="display: none; width: 100%;"><img alt="" class="slider-29322 slide-29327" height="768" src="https://blog.okfn.org/wp-content/files/2024/03/Slide-4_3-2.png" title="Slide-4_3-2" width="1024" /></li>
<li class="slide-29328 ms-image " style="display: none; width: 100%;"><img alt="" class="slider-29322 slide-29328" height="768" src="https://blog.okfn.org/wp-content/files/2024/03/Slide-4_3-1.png" title="Slide-4_3-1" width="1024" /></li>
<li class="slide-29329 ms-image " style="display: none; width: 100%;"><img alt="" class="slider-29322 slide-29329" height="768" src="https://blog.okfn.org/wp-content/files/2024/03/Slide-4_3-3.png" title="Slide-4_3-3" width="1024" /></li>
<li class="slide-29330 ms-image " style="display: none; width: 100%;"><img alt="" class="slider-29322 slide-29330" height="768" src="https://blog.okfn.org/wp-content/files/2024/03/Slide-4_3-4.png" title="Slide-4_3-4" width="1024" /></li>
<li class="slide-29331 ms-image " style="display: none; width: 100%;"><img alt="" class="slider-29322 slide-29331" height="768" src="https://blog.okfn.org/wp-content/files/2024/03/Slide-4_3-5.png" title="Slide-4_3-5" width="1024" /></li>
<li class="slide-29332 ms-image " style="display: none; width: 100%;"><img alt="" class="slider-29322 slide-29332" height="768" src="https://blog.okfn.org/wp-content/files/2024/03/Slide-4_3-7.png" title="Slide-4_3-7" width="1024" /></li>
<li class="slide-29323 ms-image " style="display: none; width: 100%;"><img alt="" class="slider-29322 slide-29323" height="768" src="https://blog.okfn.org/wp-content/files/2024/03/Slide-4_3-8.png" title="Slide-4_3-8" width="1024" /></li>
<li class="slide-29324 ms-image " style="display: none; width: 100%;"><img alt="" class="slider-29322 slide-29324" height="768" src="https://blog.okfn.org/wp-content/files/2024/03/Slide-4_3-10.png" title="Slide-4_3-10" width="1024" /></li>
</ul>
</div>
</div>
</div>
<div class="wp-block-spacer" style="height: 30px;"></div>
<p>Some lessons learned from this year’s data:</p>
<ul><li>Open Data Day goes far beyond the days of the event. Our community continues to promote open data <strong>beyond the official dates</strong>.</li></ul>
<ul><li>Communities and countries in the <strong>Global South</strong> have shown a great appetite for open data and a growing mobilisation for open data for development.<br /></li><li>Our community members prioritise <strong>real interactions</strong> at face-to-face and both hyperlocal and global events.<br /></li><li>The global open data community is <strong>growing</strong>: +55.9% events in 2024 and +9.4% members compared to last year.<br /></li><li>Open Data Day is a <strong>truly diverse initiative</strong> in terms of gender, power, levels of knowledge and geography.<br /></li></ul>
<p>We at the Open Knowledge Foundation want to thank the co-organisers from the <a href="https://okfn.org/en/network/" rel="noreferrer noopener" target="_blank">Open Knowledge Network</a> – <a href="https://okfn.org/en/gambia/" rel="noreferrer noopener" target="_blank">Jokkolabs Banjul</a> (Gambia), <a href="https://okfn.de/" rel="noreferrer noopener" target="_blank">Open Knowledge Germany</a>, <a href="https://oknp.org/" rel="noreferrer noopener" target="_blank">Open Knowledge Nepal</a>, and <a href="https://okfn.org/en/ghana/" rel="noreferrer noopener" target="_blank">Open Knowledge Ghana</a> – and the sponsors of the <a href="https://blog.okfn.org/2024/02/28/and-the-winners-of-the-open-data-day-2024-mini-grants-are/" rel="noreferrer noopener" target="_blank">mini-grants</a> – <a href="https://www.hotosm.org/" rel="noreferrer noopener" target="_blank">Humanitarian OpenStreetMap Team</a> (HOT), <a href="https://www.datopian.com/" rel="noreferrer noopener" target="_blank">Datopian</a>, and <a href="https://linkdigital.com.au/" rel="noreferrer noopener" target="_blank">Link Digital</a>.</p>
<p>Let’s move on to 2025 with an even bigger, more diverse and impactful event!</p>
<div class="wp-block-spacer" style="height: 30px;"></div>
<hr class="wp-block-separator" />
<div class="wp-block-spacer" style="height: 30px;"></div>
<h3>About Open Data Day</h3>
<p><a href="https://opendataday.org/" rel="noreferrer noopener" target="_blank">Open Data Day</a> (ODD) is an annual celebration of open data all over the world. Groups from many countries create local events on the day where they will use open data in their communities. </p>
<p>As a way to increase the representation of different cultures, since 2023 we offer the opportunity for organisations to host an Open Data Day event on the best date within a one-week period. In 2024, a total of 287 events happened all over the world between March 2nd-8th, in 60+ countries using 15 different languages.</p>
<p>All outputs are open for everyone to use and re-use.</p>
<p>In 2024, Open Data Day was also a part of the <a href="https://www.hotosm.org/opensummit23-24" rel="noreferrer noopener" target="_blank">HOT OpenSummit ’23-24 initiative</a>, a creative programme of global event collaborations that leverages experience, passion and connection to drive strong networks and collective action across the humanitarian open mapping movement</p>
<p>For more information, you can reach out to the Open Knowledge Foundation team by emailing <a href="mailto:opendataday@okfn.org" rel="noreferrer noopener" target="_blank">opendataday@okfn.org</a>. You can also join the <a href="https://groups.google.com/forum/#!forum/open-data-day" rel="noreferrer noopener" target="_blank">Open Data Day Google Group</a> to ask for advice or share tips and get connected with others.</p>
Lucas Pretti
https://blog.okfn.org
Ed Summers: Some things to consider when deciding whether to start building with "AI" in libraries and archives.
https://inkdroid.org/2024/03/12/ai/
2024-03-12T04:00:00+00:00
<p>
I was asked to participate in a panel at work about AI. I initially
declined, but once it became clear that I would be allowed to get on my
soapbox and rant for 15 minutes I agreed. Below are my notes and some <a href="https://docs.google.com/presentation/d/1RyqenjG8PdIf9RH_evKI9HVl4PmQochmQMQXy2rm4_8/edit?usp=sharing">slides</a>.
This was not a fun post to write or present. I’m sure it rubbed some
people the wrong way, and I am genuinely sorry for that.
</p>
<hr />
<p>
I’ve done a little bit of work with AI, like downloading some models
from Hugging Face as part of named entity recognition <a href="https://github.com/sul-dlss-labs/ksr-notebooks/blob/main/FE_Supporting_Links_NER_edsu.ipynb">experiments</a>,
running Whisper on some interviews that I wanted a transcript for, <a href="https://inkdroid.org/2024/02/21/magic/">testing</a> out Google’s
new file identification tool, and writing a bot to use the OpenAI API to
generate some <a href="https://github.com/edsu/diary#readme">fake diary
entries</a> from some random words a friend of mine was publishing.
</p>
<p>
But as I listened to my CPU fan spinning, all this experience has really
done is reinforce some concerns I have as a software developer about the
AI industry, and the application of these technologies in libraries and
archives.
</p>
<p>
It’s not that I don’t think these tools and methods have some use in the
cultural heritage sector, but I do think we need to think carefully and
critically about them. I’m sure you will be familiar with at least a few
of these topics, but I thought it could be useful to bring them
together, with links to learn more, and also close each one out with
some tactics for addressing them.
</p>
<p>
If you take nothing else from this presentation I’d like it to be that
despite what the “boomers” and “doomers” would like you to believe, the
ascendency of AI is not inevitable, and we have decisions to make. What
we decide to do will have a big impact on how these technologies get
deployed.
</p>
<p>
Much of this perspective is informed by my own interest in Science and
Technology Studies, which encourages an understanding of technology in
its social and historical context, and to remember that “it could be
otherwise” <span class="citation">(<a href="https://inkdroid.org/feed.xml#ref-Woolgar:2014">Woolgar, 2014</a>)</span>.
It was also informed by reading Dan McQuillan’s book <a href="https://bristoluniversitypress.co.uk/resisting-ai">Resisting
AI</a> (highly recommended).
</p>
<hr />
<p>
Despite the recent surge of interest in Large Language Models and
Generative AI tools (ChatGPT, DALL-E, etc), AI is part of long history
of computer automation, which is continuing to transform our work, and
our lives. I have tended to prefer the term <em>Machine Learning</em>
(ML) to <em>AI</em>, because it has more specificity when discussing the
recent application of statistical algorithms to increasingly large
datasets, using increasingly large computing environments. But I’ve also
come to appreciate that the term AI is actually useful for talking about
this longer trajectory of automation, stretching back to the beginnings
of modern computing. Looking at this technology as part of a very long
project, involving a shifting set of actors is important.
</p>
<p>
However the areas that I’m going to touch on here refer mostly to recent
developments with Large Language Models, although some of them are
relevant for more specialized forms machine learning as well. There are
five points of concern, and for each area I’ll include a <em>tactic</em>
for addressing it in libraries and archives.
</p>
<h2 id="bias">
Bias
</h2>
<p>
ML models are built using data. Recent advances in Deep Learning have
largely been the result of applying decades old algorithms to
increasingly large amounts of data collected from the web. The data that
is used to train these models is significant because the models
<em>necessarily</em> reflect the data that was used to create them.
Unfortunately corporations are increasingly tight lipped about the data
that has been used to train these models (more on that next).
</p>
<p>
Some commonly used datasets like CommonCrawl represent significantly
large collections of web data, but the web is a big place, and decisions
have gone into <a href="https://foundation.mozilla.org/en/research/library/generative-ai-training-data/common-crawl/#executive-summary">what
websites were collected</a>. CommonCrawl is not representative of the
web as a whole. Furthermore LLMs encode biases that are present in
today’s society. Blindly using and becoming dependent on LLMs risks
further encrusting these biases and participating in systemic racism.
</p>
<p>
As LLMs are used to generate more and more web content there is also a
risk that this data is again collected and used to train future models.
This process has been called <a href="https://www.theregister.com/2024/01/26/what_is_model_collapse/">Model
Collapse</a> and has been shown to lead to a <a href="https://arxiv.org/pdf/2305.17493.pdf">process of forgetting</a>.
OpenAI launched a tool for identifying content generated with an LLM and
had to shut it down 6 months later because it didn’t work, and its not
clear that it can even be done with reliability. What would it mean to
only train these models with pre-2023 data?
</p>
<hr />
<p>
<strong>Tactic: When evaluating an AI tool always see if you can
identify what data has been used to train the model(s). How has it been
“cleaned” or shaped? How is it updated?</strong>
</p>
<hr />
<h2 id="intellectual-property">
Intellectual Property
</h2>
<p>
Since LLMs have been built with data collected from the web this
includes many types of content, from openly licensed datasets designed
to be shared, to copyrighted books like those found in the
<em>books1,2,3</em> datasets, which are rumored to have been assembled
from shadow libraries like Library Genesis and SciHub. Over the last
year we’ve seen several lawsuits including from the Authors Guild
challenging OpenAI’s use of copyrighted materials in building their GPT
models.
</p>
<p>
In some ways these types of lawsuits are not new to the web. Napster was
challenged by the Recording Industry Association of American; Google
Books was sued by the Authors Guild in the mid 2000s; the Internet
Archive has been recently sued over its Open Library platform. But what
makes LLMs a bit different is the way they transform the content they’ve
collected, rather than making it available verbatim. The US Copyright
Office published a <a href="https://www.copyright.gov/ai/">notice of
inquiry</a> last year to gather information about the use of copyrighted
materials in AI tools, which we can expect to hear more about this year.
</p>
<p>
But this is not just an issue for blatantly pirated material.
</p>
<p>
<a href="https://www.nytimes.com/2023/12/27/business/media/new-york-times-open-ai-microsoft-lawsuit.html">The
New York Times is also suing</a> because of how millions of their openly
published news stories were used by OpenAI to train their models,
without a license. OpenAI is in the midst of <a href="https://www.theverge.com/2024/1/4/24025409/openai-training-data-lowball-nyt-ai-copyright">trying
to negotiate</a> licensing contracts after the fact with many big
players.
</p>
<p>
The way LLMs function represents a big shift in how the web ecosystem
has evolved. Web search engines like Google crawl web pages to index
them, and provide users with search results that link back to the
original website. Similarly, social media platforms have provided a
place to discuss web content by sharing links to it, driving other users
to the web publisher.
</p>
<p>
In the LLM paradigm users never leave the ChatGPT interface, and the
original publisher is completely cut out of the virtuous circle. LLMs
are enclosing the web commons, and threaten to choke off the very
sources of content that they used. Web publishers will lose the ability
to understand how their content is being used.
</p>
<p>
Some web publishers have chosen to tell LLM bots to stop using
robots.txt. Not all the bots collecting data from the web for LLMs will
respect robots.txt files. In one experiment <a href="https://palewi.re/docs/news-homepages/openai-gptbot-robotstxt.html">Ben
Welsh found</a> that 54% of news publishers (628 out of 1156) have
decided to block OpenAI, Google AI, or CommonCrawl.
</p>
<hr />
<p>
<strong>Tactic: What content should we make available to Generative AI
tools. What would our donors want?</strong>
</p>
<hr />
<h2 id="verifiability">
Verifiability
</h2>
<p>
One of the reasons why ChatGPT doesn’t link to websites as citations is
that <em>it doesn’t know what to link to</em>. In LLMs the neural
network doesn’t record information about where a particular piece of
data came from. As LLMs get integrated into more traditional search
tools the challenge is to <a href="https://ethanzuckerman.com/2023/10/10/heather-ford-is-the-web-eating-itself-llms-versus-verifiability/A">make
generated text verifiable</a> in the sense that the results include
in-line citations, which should support the statement that they are used
in.
</p>
<p>
Verifiability is important for understanding when generated content is
out of alignment with the world, a so called “hallucination”. It’s also
important for explaining why the model generated the response it did,
when trying to debug why some interaction went wrong. Explainability is
an <a href="https://arxiv.org/abs/2309.01029">active research area</a>
in the ML/AI community, and it’s not clear that given the model size and
the size of the training data, whether the models can be made
explainable, because at a fundamental level <a href="https://www.scientificamerican.com/article/how-ai-knows-things-no-one-told-it/">we
don’t understand</a> why they work. Generative AI applications that
include citations have been <a href="https://arxiv.org/abs/2304.09848">shown to be unreliable</a>, and
provide a false sense of security.
</p>
<p>
The lack of explainability in LLMs presents real problems for libraries
and archives whose raison d’être is to provide users with documents,
whether they are books, maps, photographs, sound recordings, films,
letters, etc. We describe these documents, and preserve these documents,
in order to provide access to them, so that users can derive meaning
from them. If we use an LLM to generate a response to a query or prompt,
and we can’t back up the response with citations to these documents,
this a problem.
</p>
<p>
This lack of verifiability is starting to be a problem for <a href="https://futurism.com/wikipedia-cnet-unreliable-ai">Wikipedia</a>
too.
</p>
<hr />
<p>
<strong>Tactic: Library and archives professionals have a role in
evaluating how AI tools cite documents as evidence.</strong>
</p>
<hr />
<h2 id="work">
Work
</h2>
<p>
Part of the value proposition behind recent AI tools like GitHub’s
Copilot, ChatGPT or DALL-E is that they <em>democratize</em> access to
some skill whether it be writing code, authoring news stories, or
creating illustrations. But is it democratic to systematically undermine
creative workers, by stealing their content without having asked to use
it in the first place?
</p>
<p>
When you make a decision to use these tools you are potentially <a href="https://www.latimes.com/opinion/story/2022-12-21/artificial-intelligence-artists-stability-ai-digital-images">replacing</a>
a person’s skill with a service. Furthermore you are binding your own
organization to the whims of a corporation which would like nothing
better than for you to <a href="https://www.gitclear.com/coding_on_copilot_data_shows_ais_downward_pressure_on_code_quality">divest</a>
of your organization’s expertise and become completely dependent on
their service. It’s a trap.
</p>
<p>
If the past is any guide, we can also expect that skilled creative jobs
will be replaced with lower paid jobs that involve mundane <a href="https://www.vice.com/en/article/pkap3m/gpt-4-cant-replace-striking-tv-writers-but-studios-are-going-to-try">cleaning</a>
of the messes that have been made by automation. Or in the words of
screenwriter C. Robert Cargill (quoted in the previous link):
</p>
<blockquote>
<p>
The immediate fear of AI isn’t that us writers will have our work
replaced by artificially generated content. It’s that we will be
underpaid to rewrite that trash into something we could have done better
from the start. This is what the WGA is opposing and the studios want.
</p>
</blockquote>
<p>
LLMs like ChatGPT are built using a technique called Reinforcement
Learning with Human Feedback (RLHF). The important part here is the
human feedback. <a href="https://yalebooks.yale.edu/book/9780300261479/behind-the-screen/">Who
is providing this feedback?</a> Are they users of the system? What types
of systematic biases does this training introduce? Are they lower paid
“<a href="https://ghostwork.info/">ghost workers</a>”?
</p>
<hr />
<p>
<strong>Tactic: When evaluating the use of AI tools involve the people
whose work is impacted in the decision making and
implementation.</strong>
</p>
<hr />
<h2 id="sustainability">
Sustainability
</h2>
<p>
Probably the most <a href="https://www.reuters.com/sustainability/climate-energy/power-mad-ais-massive-energy-demand-risks-causing-major-environmental-headaches-2023-12-04/">troubling</a>
aspect to the latest wave of AI technology is <a href="https://www.nature.com/articles/d41586-024-00478-x">their
environmental impact</a>. Recent advances in LLMs were not achieved
through a better understanding of how neural networks work, but by using
existing algorithms with massive amounts of data and compute resources.
This training can takes months of time, and needs to be repeated to keep
models up to date.
</p>
<p>
Apparently the initial training of GPT-4 took $100 million. The training
relies on Graphical Processor Units (GPU) which are faster than CPUs for
the types of computation that LLMs demand, but require up to four times
as much energy to run. Data centers require <a href="https://www.theatlantic.com/technology/archive/2024/03/ai-water-climate-microsoft/677602/">water</a>
<a href="https://www.theguardian.com/commentisfree/2024/mar/02/ais-craving-for-data-is-matched-only-by-a-runaway-thirst-for-water-and-energy">to
cool</a>, sometimes in environments where it is scarce. This isn’t just
a problem for training models, it’s a bigger problem for <a href="https://wimvanderbauwhede.codeberg.page/articles/climate-cost-of-ai-revolution/">querying
them</a> which has been estimated to be 60-100 times more in terms of
energy utilization. Another problem lurking here is the lack of data
from data centers that provides transparency about what is going on.
</p>
<p>
Is this the really the right direction for us to be headed as we are
trying to reduce energy costs globally to limit global warming?
</p>
<p>
The tech industry is incentivized to try to make AI infrastructures more
efficient. But <a href="https://en.wikipedia.org/wiki/Jevons_paradox">Jevons Paradox</a>
will likely hold: technological progress increases the efficiency with
which a resource is used, but the falling cost of use induces increases
in demand enough that the resource use is increased.
</p>
<p>
My links runneth over:
</p>
<ul>
<li>
<a href="https://aclanthology.org/P19-1355/">Energy and Policy
Considerations for Deep Learning in NLP</a>
</li>
<li>
<a href="https://www.sciencedirect.com/science/article/pii/S2542435123003653">The
growing energy footprint of artificial intelligence</a>
</li>
<li>
<a href="https://wires.onlinelibrary.wiley.com/doi/10.1002/widm.1507">A
systematic review of Green AI</a>
</li>
<li>
<a href="https://www.sciencedirect.com/science/article/pii/S2542435123003653">The
growing energy footprint of artificial intelligence</a>
</li>
<li>
<a href="https://www.markey.senate.gov/news/press-releases/markey-heinrich-eshoo-beyer-introduce-legislation-to-investigate-measure-environmental-impacts-of-artificial-intelligence">Markey,
Heinrich, Eshoo, Beyer Introduce Legislation to Investigate, Measure
Environmental Impacts of Artificial Intelligence</a>
</li>
</ul>
<hr />
<p>
<strong>Tactic: Libraries and archives should be looking for ways to
<a href="https://www.science.org/doi/10.1126/science.aam9744">reduce
energy consumption</a> not increase it.</strong>
</p>
<hr />
<h2 id="security-and-privacy">
Security and Privacy
</h2>
<p>
Generative AI is a dual use technology. Experts are increasingly worried
that it will be used to create disinformation as well as fake
interactions online. We’ve had court cases where filings made by lawyers
contained <a href="https://www.nytimes.com/2023/05/27/nyregion/avianca-airline-lawsuit-chatgpt.html">citations
to cases</a> that didn’t exist. AI generated voice <a href="https://www.pbs.org/newshour/show/how-ai-generated-misinformation-threatens-election-integrity">robo-calls</a>
have been made illegal because of how AI tools were used to impersonate
Biden’s voice. Bad actors can manipulate images and video to target
specific groups because the tools are more powerful and accessible.
There are possible ways to mitigate this by using trusted sources of
information and provable ways of sharing the provenance of media.
</p>
<p>
Since the mechanics of how LLMs generate content are not explainable
they are susceptible to attacks like what Simon Willison calls <a href="https://simonwillison.net/2023/Apr/14/worst-that-can-happen/">prompt
injection</a>. This is where a prompt is crafted to subvert the original
design of the system to generate an intended response. This has serious
ramifications for the use of LLM technology as glue between other
automated systems. Indeed this was recently <a href="https://arxiv.org/abs/2403.02817">demonstrated</a> by researchers
using OpenAI and Google APIs to execute arbitrary code, and exfiltrate
personal information.
</p>
<p>
While its not great to conflate privacy with security, I’m running out
of time, and it’s important to note that privacy is also a problem. As
LLM APIs are deeply integrated into applications, data will flow from
one context into another. For example <a href="https://www.pcmag.com/news/docusign-tapping-user-data-to-train-ai-models-offers-vague-privacy-promises">Docusign</a>
and Dropbox recently announced that they were integrated OpenAI into
their products. When enabled your data will flow to OpenAI who may or
may not use it to further train their models.
</p>
<hr />
<p>
<strong>Tactic: support <a href="https://www.techpolicy.press/imagining-the-possibilities-for-an-online-civil-rights-act/">legislation</a>
that gives users agency over their data and practices that help ensure
authenticity and <a href="https://www.bbc.co.uk/mediacentre/2024/content-credentials-bbc-verify">provenance</a>.</strong>
</p>
<hr />
<h3 class="unnumbered" id="references">
References
</h3>
<div class="references csl-bib-body hanging-indent" id="refs">
<div class="csl-entry" id="ref-Woolgar:2014">
Woolgar, S. (2014). Struggles with representation: Could it be
otherwise? In <em>Representation in scientific practice revisited</em>.
The MIT Press. <a href="https://doi.org/10.7551/mitpress/9780262525381.003.0018">https://doi.org/10.7551/mitpress/9780262525381.003.0018</a>
</div>
</div>
Ed Summers
https://inkdroid.org/
David Rosenthal: Good News For Tether
tag:blogger.com,1999:blog-4503292949532760618.post-2185426314207502200
2024-03-11T21:36:47+00:00
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjESzRWPZCJ03RH3zQeeDLVrIil0qmNBGMvjVvfRS-BCPyzGpcBuVxxJ4uUHt-V-ptRTw7QahDOsLzoqwEpj24vDghxJ_yDvoIAyB-NuQgzEn4NOcgmiLmmCxMYI2MrnT6TGTiG7KMj51a96AeYAL7a7quvT_S1uvLfVZjYn4BFXpA8J-B1mLU8ucODgvsl/s630/USDT_1Y_graph_coinmarketcap.jpeg" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="121" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjESzRWPZCJ03RH3zQeeDLVrIil0qmNBGMvjVvfRS-BCPyzGpcBuVxxJ4uUHt-V-ptRTw7QahDOsLzoqwEpj24vDghxJ_yDvoIAyB-NuQgzEn4NOcgmiLmmCxMYI2MrnT6TGTiG7KMj51a96AeYAL7a7quvT_S1uvLfVZjYn4BFXpA8J-B1mLU8ucODgvsl/w200-h121/USDT_1Y_graph_coinmarketcap.jpeg" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://coinmarketcap.com/currencies/tether/">USDT "market cap"</a></td></tr></tbody></table>
The good news for Tether is shown in this graph, with two huge surges in "market cap" this year. One of about $15B early in the year, and another of about $6B recently. It looks like the euphoria over the prospect of <a href="https://blog.dshr.org/2023/11/desperately-seeking-retail.html">spot Bitcoin ETFs</a> has solved the <a href="https://blog.dshr.org/2022/11/greater-fool-supply-chain-crisis.html"><i>Greater Fool Supply-Chain Crisis</i></a> with the cryptosphere experiencing a massive inflow of around $20B actual dollars. As one might expect from injecting $20B whose only uses are to HODL or to buy cryptocurrency into the market, the result has been a massive bubble in cryptocurrency "prices".<br />
<br />
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiUYsQqauaUBYn_uS9gUXrGenOZbltiSOf2DFwDyeBuflKklG5Tr9Jp9NU4I1w4liMFDiAyl9Uogft5I0XQvHoHU0Y6wccRcTttJaOb9_l9xXz6beSGfZcq6GkMT4mo4YN1WOASTQYawqFeC_XaxXxXpO0TNVgagJlLXaaxVuu2vpIaMoPtkyBFzLoKWZT8/s630/BTC_1Y_graph_coinmarketcap.jpeg" style="clear: right; display: block; margin-left: auto; margin-right: auto; padding: 1em 0px; text-align: center;"><img alt="" border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiUYsQqauaUBYn_uS9gUXrGenOZbltiSOf2DFwDyeBuflKklG5Tr9Jp9NU4I1w4liMFDiAyl9Uogft5I0XQvHoHU0Y6wccRcTttJaOb9_l9xXz6beSGfZcq6GkMT4mo4YN1WOASTQYawqFeC_XaxXxXpO0TNVgagJlLXaaxVuu2vpIaMoPtkyBFzLoKWZT8/s200/BTC_1Y_graph_coinmarketcap.jpeg" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://coinmarketcap.com/currencies/bitcoin/">BTC "price"</a></td></tr></tbody></table>
Bitcoin has gone from about $16K at the start of the year to around $42K recently. Ethereum has merely doubled, from about $1.2K to about $2.4K.<br />
<br />
So all is well with the world; Tether gets to keep the interest on another $20B, which at say 4% is an extra $800M/year on their bottom line, and the Bitcoin HODL-ers see their <strike>investment</strike> gamble return a 160% gain. Is all really well with the world? Follow me below the fold.<br />
<span><a name="more"></a></span>
<br />
Did hordes of retail <strike>investors</strike> gamblers really send $20B in cash to Tether? No, because Tether only deals with authorized institutions, and only in amounts over $100K, Did institutions send Tether $20B in cash? It seems unlikely. At various times over its history Tether has been caught minting USDT in return for things other than cash, such as loans to mysterious Chinese companies, or even thin air.
It has never been audited, and has been described as being "<a href="https://www.bloomberg.com/news/features/2021-10-07/crypto-mystery-where-s-the-69-billion-backing-the-stablecoin-tether">practically quilted out of red flags</a>". Matt Levine <a href="https://www.bloomberg.com/opinion/articles/2023-10-31/bad-passwords-are-securities-fraud">says</a> "I feel like eventually Tether is going to be an incredibly interesting story, but I still don’t know what it is."<br />
<br />
Part of the story was revealed by Dirty Bubble Media in <a href="https://www.dirtybubblemedia.com/p/tethers-secret-agent"><i>Tether’s Secret Agent</i></a>:<br />
<blockquote>
When people talk about the leaders of Tether, a few names typically occupy the conversation. Paolo Ardoino, recently promoted from CTO to CEO of the company, is the public face of Tether on Twitter and in the media. Giancarlo Devasini, the failed plastic surgeon, then failed software pirate, now billionaire CFO (how’s that for a career trajectory?), is widely regarded as the de facto leader of the company. And people often joke about Tether’s absentee former CEO, JL van der Velde, asking whether he even exists (he does, and he also was a serial failure before joining Tether).<br />
...<br />
We have been puzzled by Tether’s leadership for some time. The massive success of the company and the complexity of its operations seem beyond the abilities of a small-time scam operator and repeat failure (Giancarlo), an inexperienced and sweaty front man (Paolo), and another failed businessman and absentee executive (JL). Notably, none of these guys had any significant prior experience in finance, and it is unlikely that any of them had the sorts of business and political ties that would be essential for keeping a controversial company like Tether afloat.
</blockquote>
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgvboHXiL5MuriIDGEQjmhXK9igJZnhEo3RM_GN_rbN9QLfZy3a1RunR-UWHPha-g03jzyKdKHOEXJCDNFIpGFMJfoNv3bWc9L3ikiQjrhKWBY-WjOTN6dSLoaNaJA6DnJoMoGEvGc-WwqaET6csgMQOJrapOejTXngdnx8AQxdw9wnz5pArn5jEvtsVAxZ/s639/Harborne.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="196" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgvboHXiL5MuriIDGEQjmhXK9igJZnhEo3RM_GN_rbN9QLfZy3a1RunR-UWHPha-g03jzyKdKHOEXJCDNFIpGFMJfoNv3bWc9L3ikiQjrhKWBY-WjOTN6dSLoaNaJA6DnJoMoGEvGc-WwqaET6csgMQOJrapOejTXngdnx8AQxdw9wnz5pArn5jEvtsVAxZ/w200-h196/Harborne.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://www.dirtybubblemedia.com/p/tethers-secret-agent">Harborne?</a></td></tr></tbody></table>
But Dirty Bubble Media identifies <a href="https://www.dirtybubblemedia.com/p/tethers-secret-agent">someone else involved</a>:<br />
<blockquote>
His name is Christopher Charles Sherriff Harborne, AKA Chakrit Sakunkrit. Styling himself a “digital nomad,” Mr. Harborne’s busy hands reach across continents, industries, and political movements. The scope of Mr. Harborne’s activities and the apparent wealth backing his activities is staggering. Among those many diverse interests, it appears that Tether has become one of his most important interests. And Mr. Harborne has been far more than a passive investor in the company. Indeed, the available evidence suggests that Mr. Harborne’s involvement in Tether is far more significant than generally recognized…
</blockquote>
The details of <a href="https://en.wikipedia.org/wiki/Christopher_Harborne">Harborne</a>'s career are <a href="https://www.dirtybubblemedia.com/p/tethers-secret-agent">fascinating</a>:<br />
<blockquote>
Mr. Harborne has major ownership interests in other notable companies. Through a Delaware corporation he is the largest minority shareholder in QinetiQ, a major British defense contractor; this stake is worth around $200 million at present. He is also the sole owner of IFX Payments, a British fintech company specializing in moving large sums of money around the globe (hm). In total, over a dozen corporate entities spread across the globe have been linked to Mr. Harborne, with many more likely still hidden.
</blockquote>
He was a major funder of the <a href="https://www.theguardian.com/politics/2023/dec/30/britons-brexit-bad-uk-poll-eu-finances-nhs">disastrous Brexit campaign</a> and the UK's second <a href="https://www.theguardian.com/politics/2022/oct/20/iceberg-lettuce-in-blonde-wig-outlasts-liz-truss">worst recent Prime Minister</a>, Boris Johnson. He owns "<a href="https://www.dirtybubblemedia.com/p/tethers-secret-agent">around 12% of Tether and a similar percentage of equity in Bitfinex</a>". After the <a href="https://ag.ny.gov/press-release/2021/attorney-general-james-ends-virtual-currency-trading-platform-bitfinexs-illegal">Crypto Capital Corp seizure</a> they lost access to the US banking system, and Harborne was apparently <a href="https://www.dirtybubblemedia.com/p/tethers-secret-agent">critical to rescuing them both</a>. The details of this are the subject of a lawsuit <a href="https://amycastor.com/2024/03/09/tether-ftx-and-deltec-bank-money-time/">Harborne filed against the <i>WSJ</i></a>. <a href="https://www.dirtybubblemedia.com/p/tethers-secret-agent">Dirty Bubble Media</a> updated the post:<br />
<blockquote>
<i>Note: On February 28, 2024, Mr. Harborne sued the Wall Street Journal regarding alleged defamation. Mr Harborne alleges that the Journal misrepresented his opening of a bank account at Signature Bank as attempting to assist Bitfinex during a time of crisis. The Journal removed this portion of the article several days prior to the suit being filed. Mr. Harborne’s attorneys subsequently contacted Dirty Bubble Media requesting parts of the article referring to the Journal’s story be removed; while the case is pending we have acquiesced to their request. Additionally, we have added clarification regarding Mr. Harborne’s role as a “principal” at Bitfinex.</i>
</blockquote>
With a cast of characters this sketchy, skepticism is warranted, and in <a href="https://davidgerard.co.uk/blockchain/2023/12/06/bitcoin-goes-up-can-5-billion-unbacked-tethers-kickstart-a-fresh-crypto-bubble/"><i>Bitcoin goes up! Can 5 billion unbacked tethers kickstart a fresh crypto bubble?</i></a> Amy Castor and David Gerard supply some:<br />
<blockquote>
Bitcoin is over $44,000! In just the last week, the <a href="https://davidgerard.co.uk/blockchain/2022/04/10/the-national-posts-invisible-hand-society-nft-collection-the-facepalm-manifesto/">invisible hand of the market</a> suddenly decided that bitcoins are really good now!<br />
<br />
By complete coincidence, Tether has printed five billion USDT stablecoins in the past month out of thin air as “loans” — backed in the Tether reserve only by the “loans” themselves.<br />
<br />
How high can you pump a number with five billion fake dollars to deploy?<br />
...<br />
In just one month, from November 5 to December 5, Tether’s issuance climbed from 85 billion to 90 billion.<br />
</blockquote>
And the printer is still going. Molly White reports that <a href="https://web3isgoinggreat.com/?id=tether-christmas-2023-mint"><i>Tether mints itself a $1 billion Christmas present</i></a>:<br />
<blockquote>
On December 25, Tether minted 1 billion of its USDT dollar-pegged stablecoin. CEO Paolo Ardoino announced on Twitter that the mint was an "authorized but not issued transaction, meaning that this amount will be used as inventory for next period issuance requests and chain swaps". This seems to be a recent trend for Tether, as similar language was used for a $1 billion mint in September.<br />
<br />
The activity has raised more questions around where the real money backing Tether is coming from, and if it even exists at all. Some have argued that these recent Tether mints are being used to artificially inflate the price of Bitcoin, which has been on an upward trend since mid-October.
</blockquote>
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhJbZz4zTM1ZScRjtjTrv7Fec7vVLS1naQjx_u1sJgv9eHU4ghbCClZ1yCIUQEK_N8BHPsA3P-p3dclgjk6_EgrqVRAWMBVyojcbBjMe9iBBnd6WaTh8n8gcZd9hWPfgp9dP_0AGwwylNiazsvkSmm1-Yx9As52rKaWzfi1aJoKrYoRAu5C_AFpljluoHMw/s975/USDC_1Y_graph_coinmarketcap.jpeg" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="81" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhJbZz4zTM1ZScRjtjTrv7Fec7vVLS1naQjx_u1sJgv9eHU4ghbCClZ1yCIUQEK_N8BHPsA3P-p3dclgjk6_EgrqVRAWMBVyojcbBjMe9iBBnd6WaTh8n8gcZd9hWPfgp9dP_0AGwwylNiazsvkSmm1-Yx9As52rKaWzfi1aJoKrYoRAu5C_AFpljluoHMw/w200-h81/USDC_1Y_graph_coinmarketcap.jpeg" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://coinmarketcap.com/currencies/usd-coin/">USDC "market cap"</a></td></tr></tbody></table>
Amy Castor and David Gerard <a href="https://davidgerard.co.uk/blockchain/2023/12/06/bitcoin-goes-up-can-5-billion-unbacked-tethers-kickstart-a-fresh-crypto-bubble/">ask the right question</a>:<br />
<blockquote>
You would think, with that kind of totally genuine and organic market demand for stablecoins, USDC’s issuance would also be going up — but no. USDC’s issuance is 24.4 billion, having seen a steady decrease from 44 billion in March 2023.<br />
<br />
So where is Tether getting all the dollars to back these tethers?<br />
<br />
It isn’t. Tether’s printing press is not fueled by demand. This is Tether issuing loans to some of its biggest customers — printing pseudo-dollars out of thin air, with the only “backing” being the loan itself, counted as an asset. The loans are secured by cryptos held as collateral — not as reserves. No actual dollars flow into the system this way.<br />
</blockquote>
Why would crypto-bros prefer USDT to USDC if it is doubtful that it is fully backed? It might have something to do with USDT lacking "regulatory clarity". How do we know Tether is printing USDT out of thin air? <a href="https://davidgerard.co.uk/blockchain/2023/12/06/bitcoin-goes-up-can-5-billion-unbacked-tethers-kickstart-a-fresh-crypto-bubble/">Because</a>:<br />
<blockquote>
Tether spent years denying that they issued tethers from thin air as loans — then Alex Mashinsky of Celsius Network confirmed in October 2021 that Celsius had been taking out such loans from Tether. It came out in the <a href="https://davidgerard.co.uk/blockchain/2021/10/29/the-cftc-settlement-with-tether-and-bitfinex-42-5-million-dollars-in-fines/">CFTC settlement</a> later that month that they had been doing this precise thing for a while.<br />
<br />
Tether admitted in September that it was making “secured” loans again — after saying in December 2022 that it would reduce its secured loans to zero. [<a href="https://www.wsj.com/finance/currencies/tether-is-lending-its-stablecoins-again-b11705f2">WSJ</a>]
</blockquote>
Presumably, during the "crypto winter", there would have been a need for customers to exchange USDT for USD. But <a href="https://davidgerard.co.uk/blockchain/2023/12/06/bitcoin-goes-up-can-5-billion-unbacked-tethers-kickstart-a-fresh-crypto-bubble/">did they?</a>:<br />
<blockquote>
In mid-2022, after <a href="https://davidgerard.co.uk/blockchain/2022/05/10/terras-stablecoin-does-a-2008-crisis-ust-crashes-and-takes-bitcoin-with-it/">the Terra-Luna collapse</a>, Tether <a href="https://davidgerard.co.uk/blockchain/2022/06/13/celsius-goes-fahrenheit-451-and-number-goes-down/">bragged</a> that it had <a href="https://davidgerard.co.uk/blockchain/2022/06/22/crypto-collapse-latest-the-contagion-spreads/">“redeemed” 16 billion USDT</a>. We would assume most or all of that was loans being canceled and the tethers burned. We certainly don’t know of any independently verifiable evidence that a single actual dollar was transferred in return.<br />
<br />
For comparison, USDC reserves are held in short-term treasuries and cash in US bank accounts. A USDC appears to have an actual dollar backing it — and now that interest rates are up, Circle has been making a ton of money.<br />
<br />
If Tether had billions of real dollars backing its tethers — as it claims — then the folks running Tether could also make a ton of money simply by putting the reserve into Treasury bills. They do not need to be making loans.<br />
<br />
In late 2022, CZ from Binance was deeply upset that Sam Bankman-Fried from FTX might destabilize tethers by <a href="https://amycastor.com/2022/12/18/crypto-collapse-binance-is-not-so-fine-ftx-delaware-vs-ftx-bahamas-celsius-voyager-gemini-tether/">trying to cash out … $250,000 worth</a>. That’s out of a supposed reserve in the <i>billions</i>. This brings into serious question how many actual dollars are anywhere near Tether — clearly not enough.
</blockquote>
What is going on has similarities with Celsius' <a href="https://celsius.network/cel-token-explained">"flywheel"</a>. <a href="https://davidgerard.co.uk/blockchain/2023/12/06/bitcoin-goes-up-can-5-billion-unbacked-tethers-kickstart-a-fresh-crypto-bubble/">Castor and Gerard explain</a>:<br />
<blockquote>
Crypto institutions — exchanges, hedge funds — use the tethers to buy leverage and pump the price. They post their inflated crypto as collateral to borrow more USDT and keep pumping. [<a href="https://www.dirtybubblemedia.com/p/examining-tethers-secret-loan-portfolio">Dirty Bubble</a>]
</blockquote>
These crypto-backed loans are the fuel for the pump inflating the cryptocurrency bubble. <a href="https://www.dirtybubblemedia.com/p/examining-tethers-secret-loan-portfolio">Dirty Bubble Media concludes</a>:<br />
<blockquote>
Based on the current dataset, we can estimate that Tether issued many billions of USDT backed by crypto collateral. The impact is far larger than one might assume just from looking at the loan balances. For example, Tether lent Celsius Network just over $4 billion in total. Our data indicates that other parties like Amber and 3AC similarly received billions in loans, which round-tripped their way through the crypto-conomy without ever touching the real financial system or ever being backed by real money….<br />
<br />
Many questions remain unanswered:<br />
<ul>
<li>What percentage of Tether’s “redemptions” are actually loan repayments?</li>
<li>What is the impact of cycling billions of crypto-backed USDT through the crypto markets?</li>
<li>And, why did Tether’s reported secured loans massively diverge from this data starting in May/June 2022, around the same time as Terra, Celsius Network, and Three Arrows Capital collapsed?</li>
</ul>
</blockquote>
US regulators have been suspicious of Tether for a long time, and they <a href="https://amycastor.com/2023/12/28/crypto-collapse-mt-gox-payouts-tether-hooks-up-the-feds-sec-says-no-to-coinbase-crypto-media-mergers/">seem to have lost patience</a>:<br />
<blockquote>
The US government isn’t entirely happy with Tether’s financial shenanigans. But they’re really unhappy about sanctions violations, especially with what’s going on now in the Middle East. <br />
<br />
So Tether has announced that it will now be freezing OFAC-sanctioned blockchain addresses — and it’s onboarded the US Secret Service and FBI onto Tether! [<a href="https://tether.to/en/tether-introduces-new-policy-to-strengthen-ecosystem-security/">Tether</a>, <a href="https://archive.is/MZtxn">archive</a>; <a href="https://assets.ctfassets.net/vyse88cgwfbl/6KDtp7U4IcH03zPWnpG11n/1b052835c72f2c7be0bb5ec5bd5a89fc/Tether_Lummis_Hill_Follow_up_Letter.pdf">letter</a>, PDF, <a href="https://web.archive.org/web/20231216090755/https://assets.ctfassets.net/vyse88cgwfbl/6KDtp7U4IcH03zPWnpG11n/1b052835c72f2c7be0bb5ec5bd5a89fc/Tether_Lummis_Hill_Follow_up_Letter.pdf">archive</a>]<br />
<br />
Tether doesn’t do anything voluntarily. We expect they were told that they would allow this or an extremely large hammer would come down upon them.
</blockquote>
It looks like Tether has achieved some "regulatory clarity". It isn't just US authorities who can get Tether to freeze wallets. Patrick Tan asks <a href="https://medium.com/chainargos/what-happens-when-tether-freezes-your-tether-5a8ece2bd508"><i>What happens when Tether “freezes” your Tether?</i></a>, and recounts the tale of The Victim, whose wallet was frozen at the request of Indian law enforcement. Tan <a href="https://medium.com/chainargos/what-happens-when-tether-freezes-your-tether-5a8ece2bd508">concludes</a>:<br />
<blockquote>
The Victim’s transactions are at very most 3 hops away from known bad actors, so it’s not entirely unreasonable for Indian authorities to require more information and detailed documents, not to mention the backdrop of ongoing scams abusing already stretched Indian law enforcement agencies.<br />
<br />
For Tether’s part, it looks as though they received a request from Indian law enforcement and followed it.<br />
<br />
But perhaps, and somewhat more significantly, there is also the risk that Tether blacklists the USDT in your wallet, in response to government requests, regardless if those requests are lawful or not.<br />
<br />
It’s entirely possible for government officials or authorities with a personal vendetta, to target causes, or political opponents who receive donations or are known to transact in USDT, and for Tether to err on the side of caution and comply.<br />
<br />
It’s entirely possible that many “innocent” addresses are blacklisted in such opaque processes, collateral damage in purges.
</blockquote>
The Victim might well be collateral damage, but it was essentially impossible to supply the "more information and detailed documents" that law enforcement required to lift the freeze. Such are the risks of operating without "regulatory clarity". But the kind of "regulatory clarity" the US has imposed on Tether isn't likely to help The Victim or others who become collateral damage, it is likely to increase their numbers. An increasing number of users unable to access their funds is a double-edged sword for Tether; the good news is that Tether gets to keep the interest on frozen funds, the <i>good</i> news is that more and more people figure out how risky USDT is.<br />
<br />
David. (noreply@blogger.com)
https://blog.dshr.org/
Peter Murray: Learnings from the British Library Cybersecurity Report
https://dltj.org/article/british-library-cybersecurity-report
2024-03-09T21:08:28+00:00
<figure class="align-right" style="width: 400px;"> <a href="https://www.bl.uk/home/british-library-cyber-incident-review-8-march-2024.pdf"> <img alt="" src="https://dltj.org/assets/images/2024/2024-03-09-british-library-report-cover.png" width="400" /> </a> </figure>
<p>The British Library suffered a <a href="https://en.wikipedia.org/wiki/British_Library_cyberattack" title="British Library cyberattack | Wikipedia">major cyber attack in October 2023</a> that encrypted and destroyed servers, exfiltrated 600GB of data, and has had an ongoing disruption of library services after four months. Yesterday, the Library published <a href="https://www.bl.uk/home/british-library-cyber-incident-review-8-march-2024.pdf" title="Learning Lessons From the Cyber-Attack: British Library cyber incident review | British Library">an 18-page report</a> on the lessons they are learning. (There are also some <a href="https://via.hypothes.is/https://www.bl.uk/home/british-library-cyber-incident-review-8-march-2024.pdf">community annotations on the report</a> on Hypothes.is.)</p>
<p>Their investigation found the attackers likely gained access through compromised credentials on a remote access server and had been monitoring the network for days prior. The attack was a typical ransomware job: get in, search for personal data and other sensitive records to copy out, and encrypt the remainder while destroying your tracks. The Library did not pay the ransom and has started the long process of recovering its systems.</p>
<p>The report describes in some detail how the Library recognized that its conglomeration of disparate systems over the years left them vulnerable to service outages and even cybersecurity attacks. They had started a modernization effort to address these problems, but the attack dramatically exposed these vulnerabilities and accelerated their plans to replace infrastructure and strengthen processes and procedures.</p>
<p>The report concludes with lessons learned for the library and other institutions to enhance cyber defenses, response capabilities, and digital modernization efforts. The library profession should be grateful to the British Library for their openness in the report, and we should take their lessons to heart.</p>
<h2 id="the-attack">The Attack</h2>
<p>The report admits that some information needed to determine the attackers’ exact path is likely lost. Their best-effort estimate is that a set of compromised credentials was used on a Microsoft Terminal Services server (<a href="https://learn.microsoft.com/en-us/windows/win32/termserv/terminal-services-is-now-remote-desktop-services" title="Terminal Services has been renamed | Microsoft Learn">now called Remote Desktop Services</a>). Multi-factor authentication (MFA, sometimes called 2FA) was used in some areas of the network, but connections to this server were not covered. The attackers tripped at least one security alarm, but the sysadmin released the hold on the account after running malware scans.</p>
<p>Starting in the overnight hours from Friday to Saturday, the attackers copied 600GB of data off the network. This seems to be mostly personnel files and personal files that Library staff stored on the servers. The network provider could see this traffic looking back at network flows, but it is unclear whether this tripped any alarms itself. Although their Integrated Library System (an <a href="https://exlibrisgroup.com/products/aleph-integrated-library-system/" title="Aleph Integrated Library System | Ex Libris">Aleph 500</a> system <a href="https://librarytechnology.org/library/3413" title=" British Library | Library Technology Guides">according to Marshall Breeding’s Library Technology Guides site</a>) was affected, the report does not make clear whether patron demographic or circulation activity was taken.</p>
<h2 id="recoveryrebuild-and-renew">Recovery—Rebuild <em>and</em> Renew</h2>
<p>Reading between the lines a little bit, it sounds like the Library had a relatively flat network with few boundaries between systems: “our historically complex network topology … allowed the attackers wider access to our network than would have been possible in a more modern network design, allowing them to compromise more systems and services.” Elevated privileges on one system lead to elevated privileges on many systems, which allowed the attacker to move freely across the network. Systems are not structured like that today—now tending to follow the model of “least privileges”—and it seems like the Library is moving away from the flat structure towards a segmented structure.</p>
<p>As the report notes, recovery isn’t just a matter of restoring backups to new hardware. The system can’t go back to the vulnerable state it was in. It also seems like some software systems themselves are not recoverable due to age. The British Library’s program is one of “Rebuild and Renew” — rebuilding with fresh infrastructure and replacing older systems with modern equivalents. In the never-let-a-good-crisis-go-to-waste category, “the substantial disruption of the attack creates an opportunity to implement a significant number of changes to policy, processes, and technology that will address structural issues in ways that would previously have been too disruptive to countenance.”</p>
<p>The report notes “a risk that the desire to return to ‘business as usual’ as fast as possible will compromise the changes”, and this point is well taken. Somewhere I read that the definition of “personal character” is the ability to see an action through after the emotion of the commitment to action has passed. The British Library was a successful institution, and it will want to return to that position of being seen as a thriving institution as quickly as possible. This will need to be a continuous process. What is cutting edge today will become legacy tomorrow. As our layers of technology get stacked higher, the bottom layers get squeezed and compressed into thin slivers that we tend to assume will always exist. We must maintain visibility in those layers and invest in their maintenance and robustness.</p>
<h2 id="backups">Backups</h2>
<p>They also found “viable sources of backups … that were unaffected by the cyber-attack and from which the Library’s digital and digitised collections, collection metadata and other corporate data could be recovered.” That is fortunate—even if the older systems have to be replaced, they have the data to refill them.</p>
<p>They describe their new model as “a robust and resilient backup service, providing immutable and air-gapped copies, offsite copies, and hot copies of data with multiple restoration points on a 4/3/2/1 model.” I’m familiar with the 3/2/1 strategy for backups (three copies of your data on two distinct media with one stored off-site), but I hadn’t heard of the 4/3/2/1 strategy. Judging from <a href="https://www.backblaze.com/blog/whats-the-diff-3-2-1-vs-3-2-1-1-0-vs-4-3-2/" title="What’s the Diff: 3-2-1 vs. 3-2-1-1-0 vs. 4-3-2 | Backblaze blog">this article from Backblaze</a>, the additional layer accounts for a fully air-gapped or unavailable-online copy. An example is the <a href="https://aws.amazon.com/s3/features/object-lock/" title="Amazon S3 Object Lock | Amazon Web Services">AWS S3 “Object Lock” service</a>, a cloud version of <a href="https://en.wikipedia.org/wiki/Write_once_read_many" title="Write once read many | Wikipedia">Write-Once-Read-Many (WORM) storage</a>. Although the backed-up object is online and can be read (“Read-Many”), there are technical controls that prevent its modification until a set period of time elapses (“Write-Once”). Presumably, the time period is long enough to find and extricate anyone who has compromised the systems before the object lock expires.</p>
<h2 id="improved-processes">Improved Processes</h2>
<p>The lessons include the need for better network monitoring, external security expertise retention, multi-factor authentication, and intrusion response processes. The need for comprehensive <a href="https://en.wikipedia.org/wiki/Multi-factor_authentication" title="Multi-factor authentication | Wikipedia">multi-factor authentication</a> is clear. (Dear reader: if you don’t have a comprehensive plan to manage credentials—including enforcement of MFA—then this is an essential takeaway from this report.)</p>
<p>Another outcome of the recovery is better processes for refreshing hardware and software systems as they age. Digital technology is not static. (And certainly not as static as putting a printed book on a climate-controlled shelf.) It is difficult (at least for me) to envision the kind of comprehensive change management that will be required to build a culture of adaptability and resilience to reduce the risk of this happening again.</p>
<h2 id="some-open-questions">Some open questions…</h2>
<p>I admire the British Library’s willingness to publish this report that describes in a frank manner their vulnerabilities, the impacts of the attack, and what they are doing to address the problems. I hope they continue to share their findings and plans with the library community. Here are some things I hope to learn:</p>
<ul>
<li>To what extent was the patron data (demographic and circulation activity) in the integrated library system sought and copied out?</li>
<li>How will they prioritize, plan, and create replacement software systems that cannot be recovered or are deemed too insecure to put back on the network?</li>
<li>Describe in greater detail their changes to data backup plans and recovery tests. What can be taught to other cultural heritage institutions with similar data?</li>
<li>This is about as close to “green-field” development as you can get in an organization with many existing commitments and requirements. What change management exercises and policies helped the staff (and public) through these changes?</li>
</ul>
<p>Cyber security is a group effort. It would be easy to pin this chaos on the tech who removed a block on the account that may have been the beachhead for this attack. As this report shows, the organization allowed this environment to flourish, culminating in that one bit-flip that brought the organization down.</p>
<p>I’ve never been in that position, but I am mindful that I could someday be in a similar position looking back at what my actions or inactions allowed to happen. I’ll probably be at risk of being in that position until the day I retire and destroy my production work credentials. I hope the British Library staff and all involved in the recovery are treating themselves well. Those of us on the outside are watching and cheering them on.</p>
Peter Murray (jester@dltj.org)
https://dltj.org/
Lorcan Dempsey: So-called soft skills are hard
https://www.lorcandempsey.net/rss/65cb03078f65940001b5e17c
2024-03-08T06:26:20+00:00
<img alt="So-called soft skills are hard" src="https://www.lorcandempsey.net/content/images/2024/02/rainier.jpg" /><p>We can recognize the importance of &apossoft skills&apos while acknowledging that &apossoft&apos has multiple associations that may be misleadingly at variance with the importance of those skills. </p><p>In this post, I look at some contexts which are making such skills both more visible and more important before making some general concluding observations. It is clear that such skills are core needs for the library which is increasingly networked, relational and community focused. </p><blockquote class="kg-blockquote-alt">We can recognize the increasing importance of &apossoft skills&apos while acknowledging that &apossoft&apos has multiple associations that may be misleadingly at variance with the importance of those skills. </blockquote><p>Soft skills, and the contributions of the (often female) library workers who demonstrate them, have often been undervalued or gone unobserved. However, the value and visibility of this work is increasingly recognised, and indeed acknowledged as central. </p><p>For example, this recognition is a major focus of the important <a href="https://www.cambridge.org/core/books/social-future-of-academic-libraries/4ECBBA26E7B474067C877CEE055D1DD1?ref=lorcandempsey.net#fndtn-information">collection</a> <em>The Social Future of Academic Libraries</em>: <em>new perspectives on communities, networks and engagement. </em>It raises up the sometimes invisible relational work of library workers, and places it very much at the center of library operations and value.</p><p>What are soft skills? Examples are emotional and social intelligence, persuasion, team-working, empathy, communication, negotiation, networking, maintaining boundaries and self-care, conflict resolution, cultural awareness, advocacy, relationship-building. I have deliberately presented these as a random list to emphasize that there is no neat definition or bounded category here. </p><p>Seth Godin contrasts them with &aposvocational&apos skills and suggests calling them &aposreal&apos skills: </p><blockquote>Vocational skills can be taught: You’re not born knowing engineering or copywriting or even graphic design, therefore they must be something we can teach. But we let ourselves off the hook when it comes to decision-making, eager participation, dancing with fear, speaking with authority, <a href="https://ideas.ted.com/the-3-things-that-great-teams-have-in-common/?ref=lorcandempsey.net" rel="noopener">working in teams</a>, seeing the truth, speaking the truth, inspiring others, doing more than we’re asked, caring and being willing to change things. We underinvest in this training, fearful that these things are innate and can’t be taught. Perhaps they’re talents. And so we downplay them, calling them soft skills, making it easy for us to move on to something seemingly more urgent. // <a href="https://ideas.ted.com/soft-skills-and-real-skills/?ref=lorcandempsey.net">Seth Godin</a></blockquote><p>The phrase &apossoft skills&apos is quite problematic in several ways, including its gendered reception. I discuss some issues below. It is tempting to pre-empt that discussion and refer to such skills as CORE skills, where CORE stands for COmmunication, Relational and Empathetic, or some-such [although see my note about CORE below]. But although it sends out the right message – that what we have come to call &apossoft&apos skills are in fact &aposcore&apos skills – it is probably too confusing. </p><p>Similarly, although I think that they might be better, I don&apost use &aposstrategic&apos or &aposrelational&apos or &apossocial&apos in place of &apossoft.&apos Even though it is clear that &apossoft&apos is hopelessly inadequate when it comes to describing some of the social/relational/strategic skills required in today&aposs library. </p><h2 id="four-contexts">Four contexts</h2><p>There is a variety of contexts where the importance of soft skills is increasingly highlighted. I describe four here. </p><h3 id="1-the-relational-library">1 The relational library</h3><p>A frame is always useful. I find it helpful to think of an evolution from a library which is configured around the collection, to one which is configured around the civic, learning or research needs of the people it serves. The <strong><em>relational library</em></strong> is a helpful label in this context. </p><p>One could also point to David Lankes&apos succinct <a href="https://davidlankes.org/the-lankes-corollaries/?ref=lorcandempsey.net" rel="noreferrer">provocation</a>: &aposBad libraries build collections, good libraries build services, great libraries build communities.&apos</p><p>In an academic setting, the library is interested in more directly supporting research and learning workflows, building connections with departments, instructional practice and research groups. It is also developing richer partnerships with other campus agents, which might include, for example, the center for teaching and learning, the office of sponsored research, and student success initiatives. The library, like other campus units, will be in ongoing interaction with core university functional units such as facilities or communications. What Rebecca Bryant calls <a href="https://hangingtogether.org/category/research-support/social-interoperability/?ref=lorcandempsey.net" rel="noreferrer">social interoperability</a> is key.</p><p>In a public library setting, the library is welcoming the community to its space with a growing variety of creative activities and events. It is partnering with social and educational services, with local charities or cultural institutions, with schools and colleges. It is reaching previously overlooked or marginalised populations, it is developing special programs for particular language groups, it is providing services for immigrants. The awareness of the role of the public library as critical social infrastructure has been elevated by the publication of Eric Klinenberg&aposs <em>Palaces for the People</em>. </p><blockquote class="kg-blockquote-alt">The library provides access to the means of creative production. </blockquote><p>This deeper community engagement requires what we have called soft skills - communication, relational skills, cultural competence. Working with partners also involves these skills, along with persuasion, trust-building and negotiation. And given that library communities may vary quite a bit, there is a premium on flexibility and adaptability. </p><p>The use of &apossoft&apos seems especially misleading in the context of the current realities of public library work. Bringing the community into the library means bringing the whole community into the library. And so, library workers have often to engage with community issues, including mental health, homelessness, food insecurity, and a range of social circumstances for which they may be not very well prepared. </p><h3 id="2-challenges-to-value-and-values">2 Challenges to value and values</h3><p>In an important article about library value, Eleanor Jo Rodger characterises the public library in this way. </p><blockquote>Similarly, public libraries are society&aposs way of paying attention to learning and equity. In the United States we hold both in high esteem, so we fund public libraries with tax revenues. // <a href="https://www.webjunction.org/documents/webjunction/Value_and_Vision.html?ref=lorcandempsey.net">WebJunction</a></blockquote><p>This was written over twenty years ago. We are in a different political environment now. What happens when learning and equity are not held in high esteem, or their value is questioned? Or when the longstanding values or policies around collections are challenged?</p><p>At the same time value questions may be asked of all types of libraries given competing financial demands, mistaken impressions about the digital environment, and so on. </p><p>There are very big issues here, but in the context of this post, I want to note again the importance of advocacy, persuasion, storytelling, communication, and relationship building. In any discussion, it is important to understand the position of others, especially if one wants to persuade them of a particular course of action. </p><p>Such &apossoft&apos skills don&apost seem very soft in this context, as the librarian interacts with the mayor, the library board, the provost or the faculty committee. </p><p>Again, the recent environment in public libraries underlines this as library workers have to be prepared to talk about collection development, events or other policies in the face of organized and persistent challenges. </p><h3 id="3-collaboration-and-partnership">3 Collaboration and partnership</h3><p>Collaboration is central to library operations. Libraries also scale learning, innovation and advocacy through collaborative work. And as discussed above, libraries partner with others, whether these are other units within the parent municipality or college, or are outside.</p><p>It would actually be interesting to see some research into how much time library workers spend on collaboration and partnership. My sense is that this considerable, and that it is also growing. At the same time, libraries are looking more critically at collaboration, assessing the level of investment it requires. </p><p>At the University of Washington, I have been developing a course on library collaboration and partnerships. It has seemed to me that this is a somewhat overlooked area in library education, but also in strategy and planning, given how central it is to library operations and thinking. </p><p>In working through the course, I have been struck by how much this kind of library work also depends on what we call soft skills. </p><p>Collaboration involves communication, relationship-building, negotiation, teamwork. It involves building the trust that allows candid conversations about priorities to happen. But it is much more. Within these collaborative settings, librarians mentor colleagues from other institutions, develop confidence in committee work and presentation, and advocate for their library&aposs interests. The social and political environments that collaborative working involves depend on &apossoft&apos skills to work well and in turn they allow people to develop those soft skills. </p><figure class="kg-card kg-bookmark-card kg-card-hascaption"><a class="kg-bookmark-container" href="https://www.lorcandempsey.net/the-powers-of-library-consortia-1-how-consortia-scale-capacity-learning-innovation-and-influence/"><div class="kg-bookmark-content"><div class="kg-bookmark-title">The powers of library consortia 1: how consortia scale capacity, learning, innovation and influence</div><div class="kg-bookmark-description">Libraries and related organizations group together in a variety of ways to get their work done. They consort where there are scale advantages: to lobby, for example, to negotiate and license, to reduce costs, or to build shared infrastructure.</div><div class="kg-bookmark-metadata"><img alt="So-called soft skills are hard" class="kg-bookmark-icon" src="https://www.lorcandempsey.net/content/images/size/w256h256/2023/02/LorcanDempseyNetIconTransparent-10023.png" /><span class="kg-bookmark-author">LorcanDempsey.net</span><span class="kg-bookmark-publisher">Lorcan</span></div></div><div class="kg-bookmark-thumbnail"><img alt="So-called soft skills are hard" src="https://www.lorcandempsey.net/content/images/size/w1200/2021/02/IMG_20170929_090933.jpg" /></div></a><p><span style="white-space: pre-wrap;">A discussion of dynamics of collaborative working</span></p></figure><h3 id="4-equity-and-empathy">4 Equity and empathy</h3><p>Library workers have been stretched and stressed in very challenging ways in recent years through the pandemic, social unrest, and the real impacts of the culture wars. The murder of George Floyd made libraries and library workers recognise the need to more purposefully identify and repair harm. Librarians have had to step up to additional roles and to handle difficult situations in the workplace. Organizational and hierarchical issues have been emphasised, as, for example, in the uneven need to be physically present during the pandemic. Some may now be concerned about the uncertain impact of AI on their work, and the potential dilution of social trust and confidence as synthesized communication or creation spreads. This is especially so in this year of elections. The cumulative emotional attrition has been draining, and empathy can be difficult. </p><p>This may be leading to a change in sensibilities and expectations around libraries and the roles of library workers:</p><blockquote>This may be unevenly manifested, but underlines the need for the library to recognize the importance of equity and empathy, in terms of both value created and values embraced. We know that libraries are social organizations supporting mental wellness, social cohesion, and personal and community development. Current experience has foregrounded these roles.</blockquote><p>It has also foregrounded the importance of empathy and understanding in the workplace, as managers and as colleagues. Empathy, transparency and communication are central. </p><p>Writing about soft skills, Emy Nelson Decker [2020] notes a claim that "many professions that rely heavily upon empathy or listening to the needs of others (i.e. soft skills) are also notorious for burn out or "compassion fatigue." </p><p>The work of Kaetrena Davis Kendrick on <a href="https://kaetrenadaviskendrick.wordpress.com/?ref=lorcandempsey.net" rel="noopener noreferrer">empathy and self-preservation</a> is very relevant here. See for example this WebJunction webinar with a recording and related resources.</p><figure class="kg-card kg-bookmark-card kg-card-hascaption"><a class="kg-bookmark-container" href="https://www.webjunction.org/events/webjunction/low-morale-in-libraries.html?ref=lorcandempsey.net"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Low Morale in Libraries: Impacts and Countermeasures</div><div class="kg-bookmark-description">Learn about important research on low morale and leave with actionable ideas for promoting a healthy work environment for all staff and cultivating empathetic leadership in libraries.</div><div class="kg-bookmark-metadata"><img alt="So-called soft skills are hard" class="kg-bookmark-icon" src="https://www.webjunction.org/apps/settings/wcm/designs/oclc/images/apple-touch-icon-precomposed.png" /><span class="kg-bookmark-author">WebJunction</span></div></div><div class="kg-bookmark-thumbnail"><img alt="So-called soft skills are hard" src="https://www.webjunction.org/content/dam/WebJunction/Images/webjunction/2023-04/social_card_wj_webinar_morale_countermeasures.jpg" /></div></a><p><span style="white-space: pre-wrap;">Webinar organized by WebJunction which includes a useful collection of additional related resources</span></p></figure><p>I was taken with this comment from her co-presenter, Sunnie Scarpa: &aposHire and train empathetic staff with healthy boundaries.&apos</p><figure class="kg-card kg-bookmark-card kg-card-hascaption"><a class="kg-bookmark-container" href="https://www.lorcandempsey.net/letter-to-an-lis-graduate-student/#grow-into-your-voicefocus-on-equity-and-empathy"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Reflecting life and career: advice to an LIS graduate student</div><div class="kg-bookmark-description">What do I value about libraries? asked a student. What is essential for a library student to know? Here are several ways in which life experiences have influenced my views.</div><div class="kg-bookmark-metadata"><img alt="So-called soft skills are hard" class="kg-bookmark-icon" src="https://www.lorcandempsey.net/content/images/size/w256h256/2023/02/LorcanDempseyNetIconTransparent-10023.png" /><span class="kg-bookmark-author">LorcanDempsey.net</span><span class="kg-bookmark-publisher">Lorcan</span></div></div><div class="kg-bookmark-thumbnail"><img alt="So-called soft skills are hard" src="https://www.lorcandempsey.net/content/images/size/w1200/2021/10/windows.png" /></div></a><p><span style="white-space: pre-wrap;">The paragraphs above draw on a fuller discussion of equity and empathy in this post, from where the quote above also comes</span></p></figure><h3 id="id"></h3><h2 id="discussionthe-new-core">Discussion - the new core</h2><p>There has been some research about soft skills in the library field, notably by Laura Saunders. She has explored the knowledge, skills and attributes (KSAs) reported as necessary to library work. She identified eleven KSAs deemed core, and of those seven (and possibly eight depending on categorization) were general and could be considered &apossoft skills.&apos </p><p>She makes this interesting note about importance:</p><blockquote>This emphasis on interpersonal and communication skills seems to align with the idea of the information professions as user-centered and customer-service oriented. Partridge, Menzies et al. asserted from their findings that “personality traits, not just qualifications, were critical to be a successful librarian or contemporary information worker” (2010, p. 271), and Saunders (2015) found that some focus-group participants said they would prioritize soft skills over hard skills or content knowledge when hiring. // <em>Saunders, 2019</em></blockquote><p>This article dates from 2019 based on survey work carried out in 2017. Given the contexts described above, I imagine that soft skills would be found to be even more urgently important today. </p><p>So soft skills are critical to library work in multiple ways and to positioning the library effectively in the community it serves. However &aposSoft&apos is potentially misleading or unhelpful in several ways. In fact the use of the word may actually be damaging in particular settings, or it may suggest something that is directly counter to the actual situation. </p><ul><li>It can suggest that such skills cannot be learned or taught. However, they can be, and how they should be tackled is an intriguing question for library education. In fact, mentoring is an important skill, as is a disposition to learning. </li><li>It can suggest that such skills are less important than so-called &aposhard&apos skills. Or are somehow easier to develop. However, as noted above, and as can be easily seen elsewhere, soft skills are very important in the workplace. And they need to be learned and practiced. </li><li>It may mean that those who have good soft skills are less valued than those with putatively harder skills, that their opinions are less valued, or that they are pushed to the sidelines of decision-making. </li><li>The language may reflect or support prejudices about gender, given the association that may be made between hard and soft, respectively, and masculine and feminine. Emy Nelson Decker notes: "This is particularly worrisome in a historically feminized field as is the case with library science."</li><li>It can suggest that so-called soft and hard skills are mutually exclusive, and that one can only optimise for one. However, consider how we want to be treated in medical settings. Soft skills are important to <a href="https://www.lorcandempsey.net/technology-is-not-on-the-outside/">technology</a> workers, say, as much as to others. </li><li>A recent <em>Journal of Engineering Education</em> editorial <a href="https://doi.org/10.1002/jee.20442?ref=lorcandempsey.net">argued</a> that it may reduce emphasis on equity and inclusion. &aposUltimately, using the term "soft skills" pushes individuals that advocate for and excel at human-focused competencies to the margins of engineering. While these skills have typically involved communication and interpersonal skills, they also involve commitments to equity and inclusion."</li></ul><p>Dissatisfaction with the term is common. I was interested to read these two comments in the context of library collaboration. </p><blockquote>The abilities required for collaboration tend to be poorly covered in professional competency statements, scattered around under multiple headings and also inaccurately labelled in the library literature as “soft skills”, indicating fundamental gaps in understanding that need to be addressed as a matter of urgency. // Sheila Corrall, <a href="https://alastore.ala.org/netwsch?ref=lorcandempsey.net"><em>Foreword</em></a></blockquote><blockquote>Most people think of collaboration as a soft skill and dismiss it with a shrug. Here’s the thing: You don’t collaborate to make people feel okay, because it’s expected of you, or to earn brownie points. You collaborate because on large-scale projects, you have no choice. // Valerie Horton, <a href="https://americanlibrariesmagazine.org/2021/11/01/the-necessity-of-collaboration/?ref=lorcandempsey.net" rel="noreferrer"><em>American Libraries Magazine</em></a> </blockquote><p>However, a quick search shows that this dissatisfaction spreads to the general business press and elsewhere. </p><blockquote>While technical skills are vital, "soft skills" are the glue that holds people, teams, and business units together. These skills encompass a wide range of abilities, including communication, problem-solving, critical thinking, emotional intelligence, and teamwork. They are the foundation upon which leaders and their teams develop trust, cooperation, and high performance. // It&aposs about time we abandoned the term &apossoft skills&apos, <a href="https://www.forbes.com/sites/danpontefract/2023/03/27/its-about-time-we-abandoned-the-term-soft-skills/?sh=2ef8b8281ff7&ref=lorcandempsey.net" rel="noreferrer"><em>Forbes</em></a></blockquote><p>The author argues for the adoption of &aposprofessional skills&apos (which itself might be challenged by some). The <a href="https://americanlibrariesmagazine.org/blogs/the-scoop/predicting-the-unpredictable/?ref=lorcandempsey.net">piece</a> by Seth Godin I reference above is worth reading in full in this context. </p><p>That said, changing the name is unlikely to happen quickly or universally. I think we would benefit from an alternative term which is generally understood. However, the bigger immediate issue is value, recognition and preparation. </p><p>For the library community, soft skills are core, and critical to the success of the relational, community focused library. They are essential and valuable skills for all library workers. And what we might have called soft skills are now needed in some very difficult front-line situations where the values of the library are challenged or where library workers are engaging with social issues and stressed populations. </p><p>This importance should be recognised ... and not only performatively. But recognised in reality, with focused systematic attention when it comes to review, professional development, promotion, recruitment, human resourcing plans, strategies, job boundaries, and so on. </p><p>And it should be recognized that soft skills are not only essential, but they are very hard.</p><div class="kg-card kg-callout-card kg-callout-card-pink"><div class="kg-callout-text">CORE. After suggesting CORE skills above (to refer to Communication, Relational and Empathy), I read Emy Nelson Decker&aposs interesting article on soft skills in academic library settings [Decker 2020]. She references the use of CORE to designate Competence in Organizational and Relational Effectiveness. </div></div><h2 id="some-references"><strong>Some references</strong></h2><p>Berdanier, C. G. P. (2022). A hard stop to the term “soft skills.” <em>Journal of Engineering Education</em>, <em>111</em>(1), 14–18. <a href="https://doi.org/10.1002/jee.20442?ref=lorcandempsey.net" rel="noreferrer">https://doi.org/10.1002/jee.20442</a></p><p>Corrall, S. (2024). Foreword: The Network is the Message. In S. Pavey, <em>The Networked Librarian: The School Librarian’s Role in Fostering Connections, Collaboration and Co-creation Across the Community</em>. Facet.</p><p>Decker, E. N. (2020). The X-factor in academic libraries: the demand for soft skills in library employees. <em>College & Undergraduate Libraries</em>, <em>27</em>(1), 17–31. <a href="https://doi.org/10.1080/10691316.2020.1781725?ref=lorcandempsey.net" rel="noreferrer">https://doi.org/10.1080/10691316.2020.1781725</a></p><p>Horton, V. (2021). The Necessity of Collaboration. <em>American Libraries Magazine</em>. <a href="https://americanlibrariesmagazine.org/2021/11/01/the-necessity-of-collaboration/?ref=lorcandempsey.net" rel="noreferrer">https://americanlibrariesmagazine.org/2021/11/01/the-necessity-of-collaboration/</a></p><p>Kendrick, K. D., & Scarpa, S. (2023, June 29). <em>Low Morale in Libraries: Impacts and Countermeasures</em> [Webinar recording and materials]. <a href="https://www.webjunction.org/events/webjunction/low-morale-in-libraries.html?ref=lorcandempsey.net">https://www.webjunction.org/events/webjunction/low-morale-in-libraries.html</a></p><p>Rodger, E. J. (2002). Value and Vision. <em>American Libraries</em>, <em>33</em>(10). <a href="https://www.webjunction.org/documents/webjunction/Value_and_Vision.html?ref=lorcandempsey.net">https://www.webjunction.org/documents/webjunction/Value_and_Vision.html</a></p><p>Saunders, L. (2019). Core and More: Examining Foundational and Specialized Content in Library and Information Science. <em>Journal of Education for Library and Information Science</em>, <em>60</em>(1), 3–34. <a href="https://doi.org/10.3138/jelis.60.1.2018-0034?ref=lorcandempsey.net" rel="noreferrer">https://doi.org/10.3138/jelis.60.1.2018-0034</a></p><p>Saunders, L., & Bajjaly, S. (2022). The Importance of Soft Skills to LIS Education. <em>Journal of Education for Library and Information Science</em>, <em>63</em>(2), 187–215. <a href="https://doi.org/10.3138/jelis-2020-0053?ref=lorcandempsey.net">https://doi.org/10.3138/jelis-2020-0053</a></p><p><strong>Photograph:</strong> I took the picture at the University of Washington, where I am currently <a href="https://ischool.uw.edu/news/2023/10/library-luminary-eager-share-practical-expertise?ref=lorcandempsey.net">based</a> in the Information School. </p><p><strong>Acknowledgements:</strong> I am grateful to Sari Feldman, Alicia Salaz, and Sharon Streams who generously commented on a draft. </p>
Lorcan
https://www.lorcandempsey.net/
Harvard Library Innovation Lab: Cracking the justice barrier: announcing the Open Legal AI Workbench
https://lil.law.harvard.edu/blog/2024/03/08/announcing-the-open-legal-ai-workbench-olaw/
2024-03-08T00:00:00+00:00
<p><img alt="" src="https://lil-blog-media.s3.amazonaws.com/olaw-banner.png" /></p>
<blockquote>
<p><em>This post is part of the Library Innovation Lab’s announcements in the context of <a href="https://lil.law.harvard.edu/about/cap-celebration/">Transform: Justice</a>, celebrating the full, unqualified release of the data from the <a href="https://case.law">Caselaw Access Project</a>.</em></p>
</blockquote>
<p>When the Lexis corporation <a href="https://en.wikipedia.org/wiki/LexisNexis#History">first launched legal research terminals</a> in the 1970s it hoped to “crack the librarian barrier,” allowing lawyers to do their own legal research from their desks instead of sending law firm librarians through paper search indexes. Today something larger is possible: we may be able to “crack the justice barrier,” allowing people to answer a larger and larger number of legal questions for themselves. <a href="https://justicegap.lsc.gov/">According to the Legal Services Corporation</a>, low-income Americans do not receive any or enough legal help for 92% of their civil legal problems, so there would be a huge public benefit to making legal resources more widely available.</p>
<p>We want academics and nonprofits at the table in discovering the next generation of legal interfaces and helping to close the justice gap. It is not at all clear yet which legal AI tools and interfaces will work effectively for people with different levels of skill, what kind of guardrails they need, and what kind of matters they can help with. We need to try a lot of ideas and effectively compare them to each other.</p>
<p>That’s why we’re releasing a common framework for scholarly researchers to build novel interfaces and run experiments: the <a href="https://github.com/harvard-lil/olaw">Open Legal AI Workbench</a> (OLAW). In technical terms, OLAW is a simple, well-documented, and extensible framework for legal AI researchers to build services using tool-based retrieval augmented generation.</p>
<p>We’re not done building this yet, but we think it’s time to share with the legal technology and open source AI communities for feedback and collaboration.</p>
<p>Out of the box, OLAW looks like this:</p>
<figure>
<div class="embed-container">
</div>
<strong>Video</strong>: OLAW’s chatbot retrieving court opinions from the CourtListener API to help answer a legal question. Information is interpreted by the AI model, which may make mistakes.
</figure>
<hr />
<h2 id="what-is-olaw-for">What is OLAW for?</h2>
<p>OLAW itself is not a useful legal AI tool, and we didn’t build it to be used as-is. Instead, OLAW is intended to rapidly prototype new ideas for legal tools. OLAW is an excellent platform for testing questions like:</p>
<ul>
<li>How are legal AI tools affected by the use of different prompts, models, or finetunings?</li>
<li>How can legal AI tools best incorporate different data sets, such as caselaw, statutes, or secondary sources?</li>
<li>What kind of search indexes are best for legal AI tools (boolean, semantic search etc.)?</li>
<li>How can users be best instructed to use legal AI tools? What interface designs cause users at different skill levels to engage with the tool effectively and manage its limitations?</li>
<li>What kind of safety guardrails and output filters are most effective and informative for legal AI tools?</li>
<li>What kind of information about the tool’s internal processes should be exposed to users?</li>
<li>What kind of questions are better or worse suited for legal AI tools, and how can tools help guide users toward effective uses and away from ineffective ones?</li>
</ul>
<p>… and many others. If you want to experiment with legal AI search tools, and you have a programmer who can write some basic Python, OLAW will give you all the knobs to turn when you get started.</p>
<hr />
<h2 id="why-is-olaw-needed-now">Why is OLAW needed now?</h2>
<p>Legal AI tooling is a wide-open design space with the potential to help a lot of people. We want to make it easier for the academic and open source communities to get involved in exploring the future of these tools.</p>
<p>The commercial legal research industry is undergoing the fastest period of exploration since the invention of the internet. While there has been incremental progress, the boolean search techniques still used by lawyers today would be recognizable to lawyers using LEXIS terminals in the 1970s. But now, everything is changing: commercial vendors like Westlaw, LexisNexis, and vLex all introduced novel AI-based search interfaces in the last year.</p>
<p>We want to support research that happens outside the legal industry as well as inside, and research that is published publicly and peer-reviewed as well as proprietary. That’s needed because lots of people who need legal help may never be profitable to serve; because lots of novel tools are now possible beyond the ideas any one company can explore; and because everyone will be better off if there is rigorous, public research available on what works and what doesn’t.</p>
<hr />
<h2 id="whats-next">What’s next?</h2>
<p>We currently have the core concept implemented: a simple, well documented testbed using tool-based retrieval augmented generation that is easy to modify. These are some directions we would like to explore next:</p>
<ul>
<li><strong>Automatic benchmarking frameworks</strong>. OLAW currently requires manual testing to evaluate the impacts of design experiments. Some impacts may be testable automatically; we would like feedback on the best way to design effective benchmarks.</li>
<li><strong>Additional tools</strong>. OLAW ships with just one tool, which runs searches against the CourtListener API. We would welcome additions of default tools that search other legal resources.</li>
<li><strong>Structured extension points</strong>. We have a standard plugin-based approach to adding tools, but other extensions such as output filters or display methods require patches to the underlying source code. We would like help identifying other extension points that would benefit from standardized interfaces for testing.</li>
</ul>
<p>We welcome the community’s input on these and other areas for improvement.</p>
<hr />
<h2 id="how-do-i-get-involved">How do I get involved?</h2>
<p>OLAW is currently best suited for programmers who can host their own web software and make their own modifications. To get started, <a href="https://github.com/harvard-lil/olaw">head over to our GitHub repo</a> to get installation instructions, file issues, send pull requests, or comment in the discussion area.</p>
<hr />
<h2 id="credits">Credits</h2>
<p>Thanks to Jeremiah Milbauer and Tom Zick for their input on this effort; all mistakes are by Jack and Matteo.</p>
<p>Logo: <a href="https://lil.law.harvard.edu/about/#jacob-rhoades">Jacob Rhoades</a>.</p>
Harvard Library Innovation Lab
https://lil.law.harvard.edu/blog/
HangingTogether: A marathon, not a sprint: implementing research information management systems (RIMS) in the US
https://hangingtogether.org/?p=14061
2024-03-07T09:53:00+00:00
<div class="wp-block-image">
<figure class="alignright size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2024/03/markus-spiske-8fXUlfjt0G0-unsplash-scaled.jpg"><img alt="" class="wp-image-14163" height="1024" src="https://hangingtogether.org/wp-content/uploads/2024/03/markus-spiske-8fXUlfjt0G0-unsplash-683x1024.jpg" style="width: 317px; height: auto;" width="683" /></a><em><sup>Photo by <a href="https://unsplash.com/@markusspiske?utm_content=creditCopyText&utm_medium=referral&utm_source=unsplash">Markus Spiske</a> on <a href="https://unsplash.com/photos/a-man-running-down-a-dirt-road-in-the-woods-8fXUlfjt0G0?utm_content=creditCopyText&utm_medium=referral&utm_source=unsplash">Unsplash</a></sup></em></figure></div>
<p>Research information management systems (RIMS) are an area of growth and investment for US libraries, which OCLC Research has explored in several previous <a href="https://www.oclc.org/research/areas/research-collections/rim.html" rel="noreferrer noopener" target="_blank">research reports</a>. Recently the <a href="https://www.oclc.org/research/partnership.html" rel="noreferrer noopener" target="_blank">OCLC Research Library Partnership</a> hosted a webinar where we learned about the RIMS implementations at three partner institutions through presentations from: </p>
<ul>
<li>Jason Glenn, Program Director for Research Information Management Services, Carnegie Mellon University Libraries</li>
<li>Brian Mathews, Associate Dean, Research & Innovation, Carnegie Mellon University Libraries</li>
<li>Laura Simon, Research Support Librarian, Bernard Becker Medical Library, Washington University School of Medicine in St. Louis Missouri</li>
<li>Mark Zulauf, Researcher Information Systems Coordinator, University of Illinois Urbana-Champaign</li>
</ul>
<p>Each presenter shared about the origin, history, and current status of their RIMS implementation, along with information about system scope, uses, and institutional partners. While I’m providing a high level synthesis in this post, I encourage you to review the publicly available <a href="https://www.oclc.org/research/events/2024/lessons-implementing-rims-us.html" rel="noreferrer noopener" target="_blank">video recording</a> and <a href="https://www.oclc.org/research/events/2024/lessons-implementing-rims-us.html" rel="noreferrer noopener" target="_blank">slides</a>. </p>
<h4 class="wp-block-heading">RIMS support multiple use cases</h4>
<p>In the 2021 OCLC Research report, <em><a href="https://www.oclc.org/research/publications/2021/oclcresearch-rim-united-states.html" rel="noreferrer noopener" target="_blank">Research Information Management in the United States</a></em>, we identified six discrete use cases for US RIMS systems, and the webinar presenters shared about the four use cases currently being supported at their institutions: </p>
<div class="wp-block-image">
<figure class="alignright size-large is-resized"><a href="https://www.oclc.org/research/publications/2021/oclcresearch-rim-united-states.html"><img alt="" class="wp-image-9753" height="1024" src="https://hangingtogether.org/wp-content/uploads/2021/10/RIM-US-Part-1-med-791x1024.jpg" style="width: 287px; height: auto;" width="791" /></a></figure></div>
<ul>
<li><strong>Public portals </strong>that feature profiles of individual researchers affiliated with the institution, to support expertise discovery and institutional reputation management</li>
<li><strong>Metadata reuse</strong> through repurposing of RIMS data for dynamic updates to faculty or unit web pages and directories</li>
<li><strong>Strategic reporting and decision support </strong>through reports and visualizations, often in response to queries about research collaboration and impact</li>
<li><strong>Faculty activity reporting</strong>, to support annual academic progress reviews and/or tenure and promotion workflows</li>
</ul>
<h5 class="wp-block-heading">Public portals</h5>
<p>The need to support reputation management and expertise discovery through a public portal was the impetus for RIMS adoption at both the <a href="https://experts.illinois.edu/" rel="noreferrer noopener" target="_blank">University of Illinois</a> and <a href="https://profiles.wustl.edu/en/" rel="noreferrer noopener" target="_blank">Washington University School of Medicine</a>. Both institutions license the Pure system from Elsevier, and each institution has about 3,000 public faculty and researcher profiles. At Carnegie Mellon, which licenses Symplectic Elements as part of the broader Digital Science suite utilized there, about one third of 1,500 faculty profiles are now <a href="http://scholars.cmu.edu" rel="noreferrer noopener" target="_blank">publicly available</a>. </p>
<p>The expertise discovery portals support campus users in many ways at all three institutions. For instance, Laura described how <a href="https://profiles.wustl.edu/en/" rel="noreferrer noopener" target="_blank">Research Profiles</a> is used at WashU to promote mentors and enhance recruitment for students, postdocs, and residents and fellows. Today <a href="https://experts.illinois.edu/" rel="noreferrer noopener" target="_blank">Illinois Experts</a>, which has been live since 2016, has about 40,000 visitors/month, and is used to find research collaborators, support media requests and links to research outputs, and identify reviewers for fellowship and award committees. </p>
<p>These public portals promote a consistent brand image, which is explicitly leveraged in Washington University School of Medicine marketing materials, particularly as a single, aggregated referral site about the school’s research productivity. </p>
<h5 class="wp-block-heading">Metadata reuse </h5>
<p>Colleges, departments, labs, and faculty members have long maintained their own web pages. However, by leveraging the Pure API, many Illinois units now receive dynamic updates from Illinois Experts, reducing burdensome data reentry and ensuring that information is current and synchronized with other campus pages. Similarly, the WashU <a href="https://internalmedicine.wustl.edu/research/publications/" rel="noreferrer noopener" target="_blank">Department of Medicine</a> utilizes RSS feeds to maintain a current list of publications for each of its divisions. RSS feeds are also used to maintain publication lists for laboratory or individual web pages at Illinois. </p>
<h5 class="wp-block-heading">Strategic reporting and decision support</h5>
<p>RIMS are part of the toolkit that libraries are increasingly utilizing to support <a href="https://hangingtogether.org/libraries-support-data-driven-decision-making/" rel="noreferrer noopener" target="_blank">data-driven decision making</a>. Both Illinois and Carnegie Mellon use RIMS data to support timely and accurate decision support for campus needs such as accreditation, bibliometric and research impact analysis, and grant proposal preparation. RIMS data has been leveraged at both institutions to answer questions about the breadth of research in areas such as food scarcity or AI, revealing expertise spread across many campus units. Illinois has also used RIMS to explore collaboration networks, by quantifying institutional collaborations with external industry partners and identifying units where researchers have co-authored with other researchers at minority-serving institutions. </p>
<p>Carnegie Mellon, in particular, is investing in this area, working to build expertise and capacity to support data visualization and reporting for campus users, similar to the type of library-based research analytics and decision support resources in place at <a href="https://hangingtogether.org/supporting-organizational-strategy-at-virginia-tech-libraries/" rel="noreferrer noopener" target="_blank">Virginia Tech</a>. </p>
<h5 class="wp-block-heading">Faculty activity reporting (FAR)</h5>
<p>At many (and probably most) US research institutions, RIMS facilitating public profiles <a href="https://hangingtogether.org/what-is-rim/" rel="noreferrer noopener" target="_blank">are separate </a>from platforms that support faculty activity reporting, annual performance reviews, and tenure and promotion processes. Furthermore, these faculty information system (FIS) processes are often still decentralized at the college level, although campus centralization of these workflows is trending upward, <a href="https://www.oclc.org/research/publications/2021/oclcresearch-rim-united-states-part-2-case-studies.html" rel="noreferrer noopener" target="_blank">as seen </a>at institutions like UCLA, Penn State, and Texas A&M. </p>
<p>Neither Illinois nor the Washington University School of Medicine are currently supporting the faculty activity reporting (FAR) use case. However, Carnegie Mellon is supporting FAR in a limited way by leveraging Elements data to develop standardized CVs for reappointment, promotion, and tenure processes for College of Fine Arts faculty. </p>
<p>Greater interoperability between these systems offers significant potential to reduce redundant data entry practices for faculty and staff, and I see growth as likely—particularly as more institutions seek to centralize FIS workflows for increased efficiency and cost savings. </p>
<h5 class="wp-block-heading">Unsupported use cases</h5>
<p>Unsurprisingly, given weaker national mandates in the United States, the presenters didn’t mention leveraging the other two use cases described in the 2021 report: </p>
<ul>
<li><strong>Open access workflows</strong> that simplify researcher deposit processes into institutional repositories</li>
<li><strong>Compliance monitoring</strong> through the tracking and reporting of information about research activities or open research, in response to external mandates.</li>
</ul>
<p>These use cases dominate in <a href="https://www.oclc.org/research/publications/2018/oclcresearch-practices-patterns-research-information-management/report.html" rel="noreferrer noopener" target="_blank">other national environments</a> such as the UK, Australia, Belgium, Netherlands, and <a href="https://www.oclc.org/research/publications/2017/oclcresearch-convenience-compliance-rim-europe.html" rel="noreferrer noopener" target="_blank">Finland</a>, and offer potentialities for future US uses as well. </p>
<h4 class="wp-block-heading">Challenges</h4>
<p>Each of the presentations made it clear that successfully implementing a RIMS is extremely challenging, including such things as:</p>
<ul>
<li>Faculty skepticism</li>
<li>No mandates for unit or researcher buy-in</li>
<li>Churn in campus leadership, resulting in uneven support (or even awareness), which can put a RIMS program at risk of losing support</li>
<li>Tensions between institutional and researcher needs</li>
<li>Decentralization</li>
<li>Resource limitations</li>
<li>The absence of any implemented use cases to build upon (i.e., you are starting from scratch)</li>
<li>The necessity of building trust-based collaborative relationships with other campus units</li>
</ul>
<p>There are also specific limitations related to the data in the RIMS:</p>
<ul>
<li>Need to enrich with data from a broad range of internal and external sources</li>
<li>Limitations of scope and usefulness of local HR data</li>
<li>Gaps in coverage for humanities, arts, and social sciences</li>
</ul>
<h4 class="wp-block-heading">Data enrichment and expanding uses over time</h4>
<p>Mark Zulauf’s slides provide a powerful visual representation of maturation of the Illinois Experts system since its launch in 2016. At that time, it primarily aggregated publications for affiliated researchers, harvested from Scopus, with the public portal as the only user of that data. Today, the aggregated dataset is much more robust and useful for facilitating campus insights, as it also includes patents, press/media reports, honors and awards, and researcher datasets ingested from multiple campus sources. </p>
<p>The 2023 slide also visualizes the increase in use cases and data consumers. In addition to the expertise discovery portal, the data is shared via API with campus web pages and the ORCID registry. And it is also available for use for institutional analysis and reporting. The system is now widely used across campus, providing previously unavailable insights and saving time in many ways, the result of a sustained investment by the University of Illinois Library and the Office of the Vice Chancellor for Research. </p>
<figure class="wp-block-gallery has-nested-images columns-default is-cropped wp-block-gallery-1 is-layout-flex wp-block-gallery-is-layout-flex">
<figure class="wp-block-image size-large"><a href="https://hangingtogether.org/wp-content/uploads/2024/02/Illinois_2016.jpg"><img alt="" class="wp-image-14100" height="402" src="https://hangingtogether.org/wp-content/uploads/2024/02/Illinois_2016.jpg" width="714" /></a></figure>
<figure class="wp-block-image size-large"><a href="https://hangingtogether.org/wp-content/uploads/2024/02/Illinois_2023.jpg"><img alt="" class="wp-image-14101" height="403" src="https://hangingtogether.org/wp-content/uploads/2024/02/Illinois_2023.jpg" width="717" /></a></figure>
</figure>
<h4 class="wp-block-heading">Strategies for success</h4>
<p>The presenters described some of the strategies they applied to make forward progress with their RIMS, despite the challenges. </p>
<p><strong>Build a richer dataset.</strong> Most RIMS implementations begin with metadata harvesting from external sources like Scopus, but, as Mark described, this is really just the starting place. By partnering with other campus units, the RIMS can include local data like patents and academic honors, making for a more robust view of campus research activities. Carnegie Mellon shares this vision, with a view of adding institutional facilities and equipment to their RIMS, to provide additional insights about the connections and <a href="https://hangingtogether.org/supporting-institutional-reporting-needs-with-rim-systems/" rel="noreferrer noopener" target="_blank">ROI of these resources</a>. WashU adds local membership data from on-campus centers and institutes to showcase relationships to help support buy-in.</p>
<p><strong>Directly engage with campus units</strong>. To support metadata reuse in other campus systems, the Illinois Library worked with the web development teams in the colleges of Liberal Arts and Sciences and Education on API integration into their websites. This investment has played a critical role in securing campus buy-in for Illinois Experts, first with administrators and later with faculty. </p>
<p><strong>Tailor solutions to unit needs.</strong> Operating in a decentralized, even federated campus environment, Carnegie Mellon has worked to identify the pain points of individual colleges and develop a plan for each unit. For the Tepper School of Business, this has meant leveraging Elements data to support accreditation reporting, while the library has provided aggregated publications for analysis to the College of Engineering. </p>
<p><strong>Stay laser focused. </strong>Related to their RIMS effort that began in 2019, Laura Simon emphasized the need to stay focused on the core objective of expertise discovery (the public portal use case). For Washington University School of Medicine, a conservative approach to deliver on this goal, despite other interesting opportunities, has helped them succeed. </p>
<h4 class="wp-block-heading">This is a marathon, not a sprint</h4>
<p>A major takeaway from this webinar was that achieving success with RIMS in the US takes time. It furthermore requires focus, investment, collaboration, and commitment to the project despite the significant challenges. <a href="https://hangingtogether.org/libraries-are-essential-partners-in-research-information-management-rim/" rel="noreferrer noopener" target="_blank">Libraries are achieving success as campus leaders</a> in these implementations, by leveraging library expertise with publications metadata, scholarly communications, persistent identifiers, publications indexes, open research, bibliometrics, and much more. I hope you will take the time to watch the full webinar presentation. </p>
<figure class="wp-block-embed is-type-video is-provider-vimeo wp-block-embed-vimeo wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
</div></figure>
<p>The post <a href="https://hangingtogether.org/a-marathon-not-a-sprint-implementing-research-information-management-systems-rims-in-the-us/">A marathon, not a sprint: implementing research information management systems (RIMS) in the US</a> appeared first on <a href="https://hangingtogether.org">Hanging Together</a>.</p>
Rebecca Bryant
https://hangingtogether.org/
Lucidworks: Is Semantic Search Enough for Ecommerce? A B2B Perspective
https://lucidworks.com/?p=28387
2024-03-07T08:00:57+00:00
<p>B2B ecommerce demands accurate search. Can semantic search deliver the precision businesses need? We explore its limits and introduce a...</p>
<p>The post <a href="https://lucidworks.com/post/is-semantic-search-enough-for-e-commerce-a-b2b-perspective/">Is Semantic Search Enough for Ecommerce? A B2B Perspective</a> appeared first on <a href="https://lucidworks.com">Lucidworks</a>.</p>
Brian Land
https://lucidworks.com/
Open Knowledge Foundation: Rebecca Firth: ‘A panacea of open data is needed to tackle climate-related disasters’
https://blog.okfn.org/?p=29265
2024-03-06T11:16:02+00:00
<div class="wp-block-image"><figure class="aligncenter size-large is-resized"><img alt="" class="wp-image-29267" height="480" src="https://blog.okfn.org/wp-content/files/2024/03/100-8-Rebecca-Firth-1024x576.png" width="856" /></figure></div>
<p>This is the eighth conversation of the <a href="https://blog.okfn.org/2022/01/10/100-conversations-to-inspire-our-new-direction/" rel="noreferrer noopener" target="_blank"><em>100+ Conversations to Inspire Our New Direction</em></a> (#OKFN100) project.</p>
<p><strong>Since 2023, we are meeting with more than 100 people to discuss the future of open knowledge, shaped by a diverse set of visions from artists, activists, scholars, archivists, thinkers, policymakers, data scientists, educators, and community leaders from everywhere.</strong></p>
<p>The Open Knowledge Foundation team wants to identify and discuss issues sensitive to our movement and use this effort to constantly shape our actions and business strategies to deliver best what the community expects of us and our network, a pioneering organisation that has been defining the standards of the open movement for two decades.</p>
<p>Another goal is to include the perspectives of people of diverse backgrounds, especially those from marginalised communities, dissident identities, and whose geographic location is outside of the world’s major financial powers.</p>
<p>How openness can accelerate and strengthen the struggles against the complex challenges of our time? This is the key question behind conversations like the one you can read below.</p>
<p>*</p>
<p class="has-medium-font-size"><strong>This week we had the opportunity to speak with <a href="https://www.hotosm.org/people/rebecca-firth/" rel="noreferrer noopener" target="_blank">Rebecca Firth</a>, Executive Director of <a href="https://www.hotosm.org/" rel="noreferrer noopener" target="_blank">Humanitarian OpenStreetMap Team</a> (HOT)</strong>, an international team dedicated to humanitarian action and community development through open mapping.</p>
<p>Rebecca joined HOT in 2016 after working in digital and innovation consulting. She holds a Bachelor’s and Master’s degree in Geography from the University of Cambridge, UK, where she focused on international development. Before taking on the role of Executive Director, Rebecca served as Interim Executive Director and Senior Director of Strategy & Programme. She has worked to improve HOT’s ability to provide longer-term capacity building to <a href="https://www.openstreetmap.org/" rel="noreferrer noopener" target="_blank">OpenStreetMap</a> communities through training and micro-grants, to increase the use of OpenStreetMap by NGOs and other partners, and to spread HOT’s message globally to new volunteers and partners. Rebecca also led HOT’s application for the 2020 <a href="https://www.hotosm.org/audacious" rel="noreferrer noopener" target="_blank">Audacious Project</a>. She has lived and worked in Borneo, Japan, Colombia and Peru, focusing on public health, education, disaster risk reduction and organisational management. Rebecca is currently based in London, UK.</p>
<p><strong>HOT is a new partner of <a href="https://opendataday.org/" rel="noreferrer noopener" target="_blank">Open Data Day</a> (ODD), a community event co-led by <a href="https://okfn.org/" rel="noreferrer noopener" target="_blank">Open Knowledge Foundation</a> and <a href="https://okfn.org/en/network/" rel="noreferrer noopener" target="_blank">Open Knowledge Network</a></strong>. This year, HOT is sponsoring <a href="https://blog.okfn.org/2024/02/28/and-the-winners-of-the-open-data-day-2024-mini-grants-are/" rel="noreferrer noopener" target="_blank">mini-grants</a> to promote local open mapping events in support of the United Nations Sustainable Development Goals. ODD is also part of the <a href="https://www.hotosm.org/opensummit23-24" rel="noreferrer noopener" target="_blank">HOT OpenSummit ’23-24</a>.</p>
<p>This conversation took place online on 27 February 2024 and was moderated by <strong>Renata Ávila</strong>, CEO of OKFN, and <strong>Lucas Pretti</strong>, OKFN’s Communications & Advocacy Director. </p>
<p>We hope you enjoy reading it.</p>
<p>*</p>
<p><strong>Renata Ávila:</strong> For a lawyer like me, it’s always been very clear what the barriers to openness are – inaccessible laws, closed databases, proprietary licences and so on. You’re a geographer by training, and I’m curious about the geographical perspective on openness. What role does openness play in your practice and work?</p>
<p><strong>Rebecca Firth:</strong> I love that, I’ve never been asked that question before. I’m sure every geographer you talk to would have a different opinion. For me, there’s something about geography: in theory, everyone can see it, but in practice, not everyone can. So what we do with HOT is sort of map places that might not be visible in other data sources, but they are very visible to the people who live in a particular place and are aware of the challenges that they face.</p>
<p>There’s this kind of strange intersection between place, which is obviously an intimate and local thing, and openness when it’s opened up to millions and millions of people. The very local nature of geography kind of collapses with the global appetite that we all have for open data. Because we’re not mapping anything that people don’t already know. It’s just that we have to put it somewhere where people can access it so that it can be used in the best possible way. Obviously, openness is a huge lever to achieve that.</p>
<p><strong>Renata Ávila:</strong> I find it very interesting how open mapping can become an infrastructure in places that lack it. I come from Guatemala – in places like this, sometimes you have the map, but the social layer is completely missing. You have lived in so many places. Based on this experience, please tell us a little bit about the perspective of community participation in mapping and the role of open mapping in critical moments.</p>
<p><strong>Rebecca Firth:</strong> There’s a lot of representation and justice that happens not just in mapping a place, but in people from that place being the ones to map that place. Because often data is something that’s used as a tool by one person against another. A good example would be indigenous land: there are a lot of people who do have data – like mining companies or resource companies – and a lot of people who don’t, like local communities. There’s a clash there.</p>
<p>So the practice of community mapping is about trying to get not just the data into the hands of people who haven’t traditionally had access to it, but the power to create it, update it, manipulate it, and figure out how to use it for their purposes. That’s only really possible through techniques that lower the barrier to entry to mapping and participating in data as much as possible. It has to happen through open source, because obviously these are communities where proprietary tools aren’t going to reach the lateral scale that we hope they will.</p>
<p>The thing that keeps me passionate about the work that HOT is doing is the premise that data shouldn’t be a cause of human suffering. All the people who are working on human suffering need access to information that is very difficult and expensive for them to get. So if we can be a part of solving that problem, that’s amazing.</p>
<p><strong>Renata Ávila:</strong> We’ve spent many years in the open movement discussing licensing, standards, interoperability and so on. And I think there are two missing layers, two unfinished pieces. One is crowdsourced participation and the community component of that – there’s always a blurred line between exploitative extractivism and meaningful participation and collaboration. The other is the governance structure. We would like to know more about how HOT is organised, how you work locally and globally, and how you connect with communities. Because I think the wider open movement has a lot to learn from this.</p>
<p><strong>Rebecca Firth:</strong> In terms of our structure, HOT is run by a group of a few hundred voting members – these are super dedicated volunteers or participants in past projects. One of the most important things they do is elect our board of directors. We’re very fortunate to be one of the few multi-million dollar non-profits in the world to have a board that is 100% elected by the community. The benefits of that are that the community is really at the forefront of all the major decisions of the organisation. And the board represents the community. That’s one of the unique things about HOT.</p>
<p>The voting members also coordinate and lead activities in a number of working groups, which are really great spaces where the board, the community and the staff can interact. It’s a meeting place where the community can engage with the staff, sharing their ideas and needs in a formalised way. So the staff are there to serve the community. That is a big part of their role as staff.</p>
<p>Of course, as a growing organisation, there are tensions. One thing that’s really hard about being an open community is also being an organisation. As an organisation, we have to meet deadlines for proposals, projects, funding, budgets and so on, which obviously don’t work on the same timescales as communities. Also, the organisation can’t grow enormously, but the community can. So our goal is not just to hire infinitely more people, but to grow the community exponentially, which is a challenge in terms of managing the different dynamics and tensions that that’s going to create.</p>
<p>As an open source movement, the sky is the limit. You can have an infinite mission, but what is your ability to actually achieve it? We set a goal a few years ago to map an area where 1 billion people live, and that was going to be the most vulnerable billion people in the world, those who are either at very high risk of disaster or experiencing very high levels of multidimensional poverty.</p>
<p>But how do you set up your organisation to do that? No one in the world can get close to 1 billion people. So we have a very decentralised structure where most of our work is done through <a href="https://www.hotosm.org/hubs/">four regional hubs</a> – Latin America and the Caribbean, Western North Africa, Eastern Southern Africa and Asia Pacific. Each of these hubs serves about 20 to 25 countries with a staff of about 15 people. Their aim is to develop leaders in the countries they serve who are experiencing the problems and have a deep understanding of the solutions to those problems.</p>
<p>I think it’s a really nice system. I’m really proud of it, but it’s also incredibly difficult.</p>
<p><strong>Lucas Pretti:</strong> Could you give us some concrete examples off the top of your head of recent open mapping projects that inspire you? I mean, what is the work that HOT is doing at the end of the day?</p>
<p><strong>Rebecca Firth:</strong> A really good example of local and global coordination working really well was the <a href="https://www.hotosm.org/projects/join-the-turkey-and-syria-earthquake-response/" rel="noreferrer noopener" target="_blank">response to the earthquake in Turkey and Syria</a> in February last year, just over a year ago. This was a community led disaster response and a good example of the power of local communities really changing the way disaster response happens. </p>
<p>This response was in collaboration with and led by a Turkish open mapping community called <a href="https://yercizenler.org/en/home/" rel="noreferrer noopener" target="_blank">Yer Çizenler</a>. They sprang into action very quickly when this event happened in collaboration with us. The affected areas were very densely populated and only partially mapped. So we did remote mapping of the affected areas in both Turkey and Syria, and we had almost 7,000 volunteers around the world who joined forces to map about 1.5 million homes and 66,000 kilometres of roads.</p>
<p>It was great in terms of the amount of mapping that was done, but of course mapping is pointless if it’s not used. So the key role of HOT was to make sure that we had partnerships with responding organisations, including government and local communities. I was really proud of this case because maps were used at every single stage, including search and rescue, which is often the hardest thing to get maps used for because you need them in the right hands incredibly quickly. We also had individual doctors, medics, people facilitating the delivery of medical care, people setting up infrastructure for temporary shelters, and a story of someone trying to get electricity to a tent city.</p>
<p>Today, OpenStreetMap is the standard expectation of any humanitarian responder in a crisis. I got a figure the other day that there have been 330,000 downloads of HOT data for humanitarian and development interventions in the last three years, so our data has been used for impact 330,000 times. It’s really amazing the scale we’ve reached, something I’m sure the early dreamers who created HOT out of the Haiti earthquake response in 2010 would be very proud of. </p>
<p><strong>Renata Ávila:</strong> At OKFN we’ve been working hard on standards and data interoperability in projects like <a href="https://frictionlessdata.io/" rel="noreferrer noopener" target="_blank">Frictionless Data</a>. This year we are developing the <a href="https://opendataeditor.okfn.org/" rel="noreferrer noopener" target="_blank">Open Data Editor</a>, which will be a very simple, no-code solution for data manipulation and publishing. Since you specifically mentioned data, I’m curious about the friction you face when working with data. When I say friction, I also mean social friction, institutional friction and so on.</p>
<p><strong>Rebecca Firth: </strong>Indeed, we face many of them. On the institutional side, things are getting better. There’s an expectation in almost every part of our global economy that decisions are going to be based on data, and good data is going to be required, that’s a trend that’s happened in the world over the last 10 years. And I think that’s helpful for us.</p>
<p>In terms of social friction, obviously how you map and who decides how you map is a really contentious issue and one that a lot of people would have different perspectives on. I can think of some personal experiences I’ve had with this. We once did some local mapping with a community in Peru, and we were tagging the village houses made with adobe, which is the name of the local mud bricks that the houses are made of. At some point, that got changed back on the map by someone in another country saying that the buildings should be brick or whatever. So we are standing in this village at this moment and we can see that this is made of adobe. Who has the power in this interaction? The indigenous person in the community or the person who knows how to do the mass undoing of edits? Open communities also generally reflect (and sometimes amplify) the power dynamics of the world. I think part of HOT’s role is really important to help navigate that.</p>
<p>On the technical side, what we’re trying to work on with our team is how to lower the barrier to entry for mapping and using maps. When I started, the tools were just incredibly difficult. They were all open, but it’s not really open if you can’t figure out how to use it. One of the parts of our vision is that everyone can access and contribute to the map, and that open map data is available and used for impact.</p>
<p>So one of the frictions we have is making sure that the process is really open. I’m not as well placed as most members of your community to debate the exact meaning of the word open, but for me, openness is not just about open data, it’s about open processes, it’s about open policies, it’s about making sure that everything is actually accessible and freely usable. We see so many examples of open data that is still impossible for people to use, whether it’s because it’s ridiculously large and you have to pay for cloud hosting, or because it’s a PDF that you can’t really work with. It’s a huge frustration. I wish the open community would take more seriously the definition of openness as accessibility rather than availability, because there’s such a big difference between the two.</p>
<p><strong>Renata Ávila:</strong> Absolutely. I’m glad you mentioned that. We recently <a href="https://blog.okfn.org/2023/10/10/open-knowledge-foundation-joins-the-digital-public-goods-alliance/" rel="noreferrer noopener" target="_blank">joined the Digital Public Good Alliance</a> (DPGA), and that brings me to the importance of standards and the horizontal effects they can have on communities. In particular, the importance of fighting to get those standards adopted by big players, big governments, big aid agencies and so on. </p>
<p>Some of the communities that intersect with both openness and maps are those working to mitigate the climate crisis. Of course, there is an element of unpredictability in natural disasters like earthquakes. Coming from Guatemala, I can tell you that you can never predict when the next big one is going to hit. But we can certainly predict other impending disasters. How can members of the open knowledge community work better with members of your communities to join our efforts in trying to solve the most pressing problems of our time, such as the climate crisis?</p>
<p><strong>Rebecca Firth:</strong> I didn’t think the conversation would go in that direction, and I’m glad it did because thematically I think it’s so important for people in open communities to engage with why climate change is such a big deal. Like all NGOs, we are used to working on impact areas by categories, such as public health, disaster response, gender equality, displacement, safe migration, climate resilience and sustainability. But what is happening now is that we have climate-related disasters, which lead to displacement, which leads to disease outbreaks, which disproportionately affect women and girls, and so on. So all of these impacts are now completely overlapping and cross-cutting. We need open data highways behind and above all of these areas. Never before has there been a greater need to have a panacea of data that touches on all of these issues, rather than siloed efforts.</p>
<p>Climate is a very, very local thing. The way you experience climate change is going to be radically different depending on where you live. At the moment there are climate models produced by scientists and universities that show a whole country as red, amber or green, based on a rating that is simply not the experience of the people who live there. Even at the city level, there are huge differences for people who live in vulnerable housing, or at the bottom of a hill, or in places where there is no shade, and so on. Experts are looking at this problem at a global level, but the point is how to visualise it locally and add some truth to these reports.</p>
<p>Sometimes I have conversations with people who don’t want to fund climate mitigation work because they’re interested in funding emissions reductions. They say “We’re not there yet, it’s 10, 20 years away”. And that is not true! We are working with communities that are affected by climate change right now. It’s just that these funders don’t know about it because they see a global model that turns a particular country or locality green.</p>
<p>I really think there’s an important role for open communities. The thing that would help us collaborate and get there faster is a really honest commitment to a minimum-viable product. Here’s an example. There’s this amazing project in Liberia with <a href="https://www.ilabliberia.org/" rel="noreferrer noopener" target="_blank">iLab Liberia</a> where they’re trying to map the resilience of buildings in coastal cities to flash floods. And they did it by mapping how deep the foundations are by the number of fingers. The map shows where all the buildings are with foundations that are one finger deep, two fingers deep, three fingers deep and so on. This has a huge impact on how resilient that building will be before the next flood.</p>
<p>Something similar happened in Tanzania, where they tried to record historical flash floods according to each resident’s memory of how far the water reached their body. I would consider that really good data. </p>
<p>That’s the kind of thing our communities need to work together on the most. What is the minimum data needed to solve this problem? If we can get that, we’ll be fine. But if we’re arguing about data models and schemas and not everything being perfect, then we’re never going to get out of that conversation.</p>
<p><strong>Lucas Pretti:</strong> I really like that. I think we are on the right path of collaboration, starting with a very minimum viable product, which is <a href="https://opendataday.org/" rel="noreferrer noopener" target="_blank">Open Data Day 2024</a> as part of <a href="https://www.hotosm.org/opensummit23-24" rel="noreferrer noopener" target="_blank">HOT OpenSummit ’23-24</a>. Your <a href="https://blog.okfn.org/2024/02/28/and-the-winners-of-the-open-data-day-2024-mini-grants-are/" rel="noreferrer noopener" target="_blank">sponsorship through mini-grants focused on open mapping activities</a> was a game changer this year. I think our two organisations share a recent practice of moving away from centralised, self-focused events towards supporting community events. I’d like you to talk about that. Do you think it’s a trend among global organisations that are as embedded in communities as we are?</p>
<p><strong>Rebecca Firth:</strong> You are right, we used to have this thing called the <a href="https://summit.hotosm.org/" rel="noreferrer noopener" target="_blank">HOT Summit</a>, which was a wonderful event, but it was a conference for us, limited to 200 people who could attend. I think it basically did not do what it was supposed to do. So, thanks to the community working group and the community staff at HOT, we took a completely different approach and asked ourselves, where are people talking about open mapping and open data and how can we support them to do that better?</p>
<p>So they came up with the idea of the OpenSummit. The idea is that HOT can support a range of different global events, from conferences to workshops to just hosting a session at another big event, and so on. It really opens up who can participate. Last year we supported 13 events. There were 113 sessions on open mapping attended by 300 people. And we also had 122 scholarships for community mappers to go to those events. So it’s been amazing in terms of really opening us up and getting out of ourselves. </p>
<p>I think it has a similar ethos to Open Data Day. I mean, we both want people to build partnerships and networks and collaborations to do their own thing. What we’ve learned from this is yet another example of how sticky the community is. Leaders are nurtured through these events because it’s the relationships they spark that keep them going. The more events we can support, the better chance we have of finding people who want to lead this mapping work in their countries.</p>
<p><strong>Renata Ávila:</strong> Beyond Open Data Day, the same thing is happening with the <a href="https://okfn.org/en/network/" rel="noreferrer noopener" target="_blank">Open Knowledge Network</a> and our <a href="https://network.okfn.org/specialist/" rel="noreferrer noopener" target="_blank">Global Directory</a>. It is almost like having different red phones everywhere that you can just call when something is happening in a country. Exponential change happens when local community members take action and share their knowledge to help the wider community.</p>
<p><strong>Rebecca Firth:</strong> Building on that, I think one mistake we’ve made in the past is communicating only in terms of big numbers: we’re going to cover an area of “one billion people”, we need “one million volunteers”, we’re going to work in “94 countries”. These huge top lines are obviously important when you’re defining a key overarching mission for an organisation, and we are doing them. But, the reality is often that the majority of local mapping is done by less than 10 people. </p>
<p>So practically, in the work we do each day, we don’t need to bring 3,000 people to a conference about HOT. We need to turn those three local mappers into six local mappers, and that will lead us to double the amount of local data available. And that will get us to one billion! One thing I’ve learned from the community working groups is that mass global campaigns may reach a lot of people, but not in a way that deeply nourishes a local community, and that’s what we need to focus on.</p>
Open Knowledge Foundation
https://blog.okfn.org
David Rosenthal: Microsoft's Archival Storage Research
tag:blogger.com,1999:blog-4503292949532760618.post-6134157773402508210
2024-03-05T22:41:38+00:00
<table border="2" cellpadding="5" cellspacing="0" class="tr-caption-container" rules="groups" style="float: right; margin-left: 1em; text-align: right;">
<caption align="bottom"><a href="https://doi.org/10.1063/1.5007621">2016 Media Shipments</a></caption>
<tbody>
</tbody>
<colgroup span="1">
</colgroup><colgroup span="3">
</colgroup><tbody>
<tr>
<td colspan="1"><br /></td>
<th align="center">Exabytes</th>
<th align="center">Revenue</th>
<th align="center">$/GB</th>
</tr>
</tbody><tbody>
<tr>
<th>Flash</th><td>120</td><td>$38.7B</td><td>$0.320</td>
</tr>
<tr>
<th>Hard Disk</th><td>693</td><td>$26.8B</td><td>$0.039</td>
</tr>
<tr>
<th>LTO Tape</th><td>40</td><td>$0.65B</td><td>$0.016</td>
</tr>
</tbody></table>
Six years ago I wrote <a href="https://blog.dshr.org/2018/03/archival-media-not-good-business.html"><i>Archival Media: Not a Good Business</i></a> and included this <a href="https://doi.org/10.1063/1.5007621">table</a>. The argument went as follows:<br />
<ul>
<li>The value that can be extracted from data decays rapidly with time.</li>
<li>Thus companies would rather invest in current than archival data.</li>
<li>Thus archival media and systems are a niche market.</li>
<li>Thus archival media and systems lack the manufacturing volume to drive down prices.</li>
<li>Thus although quasi-immortal media have low opex (running cost), they have high capex (purchase cost).</li>
<li>Especially now interest rates are non-zero, the high capex makes the net present value of their lifetime cost high.</li>
<li>Archival media compete with legacy generic media, which have mass-market volumes and have already amortized their R&D, so have low capex but higher opex through their shorter service lives.</li>
<li>Because they have much higher volumes and thus much more R&D, generic media have much higher Kryder rates, meaning that although they need to be replaced over time, each new unit at approximately equal cost replaces several old units, reducing opex.</li>
<li>Especially now interest rates are non-zero, the net present value of the lower capex but higher opex is likely very competitive.</li>
</ul>
Below the fold I look into why, despite this, Microsoft has been pouring money into archival system R&D for about a decade.<br />
<span><a name="more"></a></span>
<h3>Background</h3>
Eleven years ago Facebook announced they were building entire <a href="https://blog.dshr.org/2013/02/facebooks-cold-storage.html">data centers for cold storage</a>. They expected the major reason for reads accessing this data would be subpoenas. Eighteen months later I was finally able to report on Kestutis Patiejunas' talk explaining the technology and why it made sense in <a href="https://blog.dshr.org/2014/09/more-on-facebooks-cold-storage.html"><i>More on Facebook's "Cold Storage"</i></a>. They used two different technologies, the first exploiting legacy generic media in the form of mostly-powered-down hard drives. Facebook's <a href="https://blog.dshr.org/2014/09/more-on-facebooks-cold-storage.html">design for this was</a>:<br />
<blockquote>
aimed at limiting the worst-case power draw. It exploits the fact that this storage is at the bottom of the storage hierarchy and can tolerate significant access latency. Disks are assigned to groups in equal numbers. One group of disks is spun up at a time in rotation, so the worst-case access latency is the time needed to cycle through all the disk groups. But the worst-case power draw is only that for a single group of disks and enough compute to handle a single group.<br />
<br />
Why is this important? Because of the synergistic effects knowing the maximum power draw enables. The power supplies can be much smaller, and because the access time is not critical, need not be duplicated. Because Facebook builds entire data centers for cold storage, the data center needs much less power and cooling. It can be more like cheap warehouse space than expensive data center space. Aggregating these synergistic cost savings at data center scale leads to really significant savings.
</blockquote>
Patiejunas figured out that, only at cloud scale, the economics of archival storage could be made to work by reducing the non-media, non-system costs. This insight led to the second technology, <a href="https://blog.dshr.org/2014/09/more-on-facebooks-cold-storage.html">robots full of long-lived optical media</a>:<br />
<blockquote>
It has 12 Blu-ray drives for an entire rack of cartridges holding 10,000 100TB Blu-ray disks managed by a robot. When the robot loads a group of 12 fresh Blu-ray disks into the drives, the appropriate amount of data to fill them is read from the currently active hard disk group and written to them. This scheduling of the writes allows for effective use of the limited write capacity of the Blu-ray drives. If the data are ever read, a specific group has to be loaded into the drives, interrupting the flow of writes, but this is a rare occurrence. Once all 10,000 disks in a rack have been written, the disks will be loaded for reads infrequently. Most of the time the entire Petabyte rack will sit there idle.
</blockquote>
In theory Blu-ray disks have a 50-year life, but this is <a href="https://blog.dshr.org/2014/09/more-on-facebooks-cold-storage.html">irrelevant</a>:
<blockquote>
No-one expects the racks to sit in the data center for 50 years, at some point before then they will be obsoleted by some unknown new, much denser and more power-efficient cold storage medium
</blockquote>
Eight years ago in <a href="https://blog.dshr.org/2016/05/the-future-of-storage.html"><i>The Future Of Storage</i></a> I explained:<br />
<blockquote>
Every few months there is another press release announcing that some new, <a href="http://blog.dshr.org/2014/06/more-on-long-lived-media.html">quasi-immortal medium</a> such as 5D quartz or stone DVDs has solved the problem of long-term storage. But the problem stays resolutely unsolved. Why is this? Very long-lived media are inherently more expensive, and are a niche market, so they lack economies of scale. Seagate could easily make <a href="http://www.digitalpreservation.gov/news/events/other_meetings/storage09/docs/5-4_Anderson-seagate-v3_archive_study.pdf">disks with archival life</a>, but they did a study of the market for them, and discovered that no-one would pay the relatively small additional cost. The drives currently marketed for "archival" use have a shorter warranty and a shorter MTBF than the enterprise drives, so they're not expected to have long service lives.<br />
<br />
The fundamental problem is that long-lived media only make sense at very low Kryder rates. Even if the rate is only 10%/yr, after 10 years you could store the same data in 1/3 the space. Since space in the data center racks or even at Iron Mountain isn't free, this is a powerful incentive to move old media out. If you believe that Kryder rates will get back to 30%/yr, after a decade you could store 30 times as much data in the same space.
</blockquote>
The key parameter of these archival storage systems isn't the media life, it is the write bandwidth needed to keep up with the massive flow of <a href="https://blog.dshr.org/2014/09/more-on-facebooks-cold-storage.html">data to be archived</a>:<br />
<blockquote>
While a group of disks is spun up, any reads queued up for that group are performed. But almost all the I/O operations to this design are writes. Writes are erasure-coded, and the shards all written to different disks in the same group. In this way, while a group is spun up, all disks in the group are writing simultaneously providing huge write bandwidth. When the group is spun down, the disks in the next group take over, and the high write bandwidth is only briefly interrupted.
</blockquote>
The lesson Facebook taught a decade ago was that the keys to cost-effective archival storage were first, massive scale, and second, high write bandwidth. Microsoft has learned the lesson and has been working to develop cloud-scale, high write bandwidth systems using two quasi-immortal media, DNA and silica.<br />
<h3>DNA</h3>
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgqldpbCFfWyenDsCcOHRkumVjQEiOMB8Vxv6nSacYWJVuyYmy4pLbk2AnmPuRRZJu9b4hyphenhyphenYElZf_U6DKOulSPK9IwY4aA5BMIXYVSJMkFDTI0k3m__IdN4IGOieXr6Y4pep7zJJo1Or96-7pbPZYKoRseeMepNGUp2IVlwX5aLJEI0VfWDPA9RfKnVvPqA/s964/DNA-Prototype.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="122" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgqldpbCFfWyenDsCcOHRkumVjQEiOMB8Vxv6nSacYWJVuyYmy4pLbk2AnmPuRRZJu9b4hyphenhyphenYElZf_U6DKOulSPK9IwY4aA5BMIXYVSJMkFDTI0k3m__IdN4IGOieXr6Y4pep7zJJo1Or96-7pbPZYKoRseeMepNGUp2IVlwX5aLJEI0VfWDPA9RfKnVvPqA/w200-h122/DNA-Prototype.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://spectrum.ieee.org/dna-data-storage">Source</a></td></tr></tbody></table>
I have been tracking the development of DNA as a storage medium since 2012, when I wrote <a href="https://blog.dshr.org/2012/10/forcing-frequent-failures.html"><i>Forcing Frequent Failures</i></a> based on a <a href="http://www.extremetech.com/extreme/134672-harvard-cracks-dna-storage-crams-700-terabytes-of-data-into-a-single-gram">Harvard team</a> writing 700KB. Much of the recent progress has been driven by the collaboration between <a href="https://www.microsoft.com/en-us/research/project/dna-storage/">Microsoft Research and the University of Washington</a>. In 2019 they published the first <a href="https://doi.org/10.1038/s41598-019-41228-8">automated write-to-read prototype system</a> (pictured), <a href="https://spectrum.ieee.org/dna-data-storage">which was slow</a>:<br />
<blockquote>
This single-channel device, which occupied a tabletop, had a throughput of 5 bytes over approximately 21 hours, with all but 40 minutes of that time consumed in <a href="https://news.microsoft.com/source/features/innovation/hello-data-dna-storage/">writing “HELLO” into the DNA</a>.
</blockquote>
<br />
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhKXQ3KU9GFfnb9O-ZpTrrytpkUxaRqpYZmL-Y-facKjxQtn019OboY0cL7hiIfq5MuiiGlN4x8cPWNQSymVftq6QYwoZCAsocqcEhr9Z7cZQ-oaTS7CqWWvLOoRqfcamVsRoPC6syBHVnNfYT-cHuQLDFoBQJWNIoMqidkJ5-r3No6nGlV1bZtemwEmruM/s857/Gartner.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="139" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhKXQ3KU9GFfnb9O-ZpTrrytpkUxaRqpYZmL-Y-facKjxQtn019OboY0cL7hiIfq5MuiiGlN4x8cPWNQSymVftq6QYwoZCAsocqcEhr9Z7cZQ-oaTS7CqWWvLOoRqfcamVsRoPC6syBHVnNfYT-cHuQLDFoBQJWNIoMqidkJ5-r3No6nGlV1bZtemwEmruM/w200-h139/Gartner.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://spectrum.ieee.org/dna-data-storage">Source</a></td></tr></tbody></table>
Rob Carlson's <a href="https://spectrum.ieee.org/dna-data-storage"><i>The Quest for a DNA Data Drive</i></a> provides a useful overview of the current state of the art. Alas he starts by using one of my pet hates, the graph showing an immense gap between the "requirements" for data storage and the production of storage media. Carlson <a href="https://spectrum.ieee.org/dna-data-storage">captions the graph</a>:<br />
<blockquote>
Prior projections for data storage requirements estimated a global need for about 12 million petabytes of capacity by 2030. The research firm Gartner recently issued new projections, raising that estimate by 20 million petabytes. The world is not on track to produce enough of today’s storage technologies to fill that gap.
</blockquote>
<b><rant></b> Carlson's point is to suggest that there is a huge market for DNA storage. But this ignores economics. There will always be a "requirement" to store more data than the production of storage media, because some data is not valuable enough to justify the cost of storing it. The "gap" could only be filled if media were free. Customers will buy the storage systems they can afford and prioritize the data in them according to the value that can be extracted from it. <b></rant></b>.<br />
<br />
The size of the market for DNA storage systems depends upon their cost. In 2018's <a href="https://blog.dshr.org/2018/02/dnas-niche-in-storage-market.html"><i>DNA's Niche in the Storage Market</i></a> I imagined myself as the marketing person for a DNA storage system and posed these challenges:<br />
<blockquote>
Engineers, your challenge is to increase the speed of synthesis by a factor of a quarter of a trillion, while reducing the cost by a factor of fifty trillion, in less than 10 years while spending no more than $24M/yr.<br />
<br />
Finance team, your challenge is to persuade the company to spend $24M a year for the next 10 years for a product that can then earn about $216M a year for 10 years.
</blockquote>
Carlson is realistic about the <a href="https://spectrum.ieee.org/dna-data-storage">engineering challenges</a>:<br />
<blockquote>
For a DNA drive to compete with today’s archival tape drives, it must be able to write about 2 gigabits per second, which at demonstrated DNA data storage densities is about 2 billion bases per second. To put that in context, I estimate that the total global market for synthetic DNA today is no more than about 10 terabases per year, which is the equivalent of about 300,000 bases per second over a year. The entire DNA synthesis industry would need to grow by approximately 4 orders of magnitude just to compete with a single tape drive.
</blockquote>
One of the speed-ups that is needed is <a href="https://spectrum.ieee.org/dna-data-storage">chip-based massive parallelism</a>:<br />
<blockquote>
One of our goals was to build a semiconductor chip to enable high-density, high-throughput DNA synthesis. <a href="https://www.science.org/doi/10.1126/sciadv.abi6714">That chip</a>, which we completed in 2021, demonstrated that it is possible to digitally control electrochemical processes in millions of 650-nanometer-diameter wells.
</blockquote>
It was faster, but it's chemistry had <a href="https://spectrum.ieee.org/dna-data-storage">problems</a>:<br />
<blockquote>
The main problem is that it employs a volatile, corrosive, and toxic organic solvent (<a href="https://pubchem.ncbi.nlm.nih.gov/compound/Acetonitrile">acetonitrile</a>), which no engineer wants anywhere near the electronics of a working data center.<br />
<br />
Moreover, based on a sustainability analysis of a theoretical DNA data center performed my colleagues at Microsoft, I conclude that the volume of acetonitrile required for just one large data center, never mind many large data centers, would become logistically and economically prohibitive.
</blockquote>
Although it is the industry standard, this isn't the only way to <a href="https://spectrum.ieee.org/dna-data-storage">write DNA</a>:<br />
<blockquote>
Fortunately, there is a different emerging technology for constructing DNA that does not require such solvents, but instead uses a benign salt solution. Companies like <a href="https://www.dnascript.com/">DNA Script</a> and <a href="https://molecularassemblies.com/">Molecular Assemblies</a> are commercializing automated systems that use enzymes to synthesize DNA. These techniques are replacing traditional chemical DNA synthesis for some applications in the biotechnology industry. The current generation of systems use either simple plumbing or light to control synthesis reactions. But it’s difficult to envision how they can be scaled to achieve a high enough throughput to enable a DNA data-storage device operating at even a fraction of 2 gigabases per second.
</blockquote>
The Microsft/UW chip is an <a href="https://spectrum.ieee.org/dna-data-storage">alternative way to control enzymatic synthesis</a>:<br />
<blockquote>
The University of Washington and Microsoft team, collaborating with the enzymatic synthesis company Ansa Biotechnologies, recently took the first step toward this device. Using our high-density chip, we successfully demonstrated <a href="https://doi.org/10.1021/acssynbio.3c00044">electrochemical control of single-base enzymatic additions</a>.
</blockquote>
The link is to <a href="https://doi.org/10.1021/acssynbio.3c00044"><i>Spatially Selective Electrochemical Cleavage of a Polymerase-Nucleotide Conjugate</i></a> by Jake A. Smith <i>et al</i>:<br />
<blockquote>
Novel enzymatic methods are poised to become the dominant processes for de novo synthesis of DNA, promising functional, economic, and environmental advantages over the longstanding approach of phosphoramidite synthesis. Before this can occur, however, enzymatic synthesis methods must be parallelized to enable production of multiple DNA sequences simultaneously. As a means to this parallelization, we report a polymerase-nucleotide conjugate that is cleaved using electrochemical oxidation on a microelectrode array. The developed conjugate maintains polymerase activity toward surface-bound substrates with single-base control and detaches from the surface at mild oxidative voltages, leaving an extendable oligonucleotide behind. Our approach readies the way for enzymatic DNA synthesis on the scale necessary for DNA-intensive applications such as DNA data storage or gene synthesis.
</blockquote>
This has, as the article points out, potential for dramatically reducing the current cost of writing DNA, but it is still many orders of magnitude away from being competitive with a tape drive. The <a href="https://blog.dshr.org/search?q=Pangloss&max-results=20&by-date=true">good Dr. Pangloss</a> can continue to enjoy the optimism for many more years.<br />
<h3>Silica</h3>
The idea of writing data into fused silica with a femtosecond laser is at least a <a href="https://www.storagenewsletter.com/2014/10/31/rd-hitachi-rw-of-digital-data-in-fused-silica-glass/">decade and a half old</a><br />
<blockquote>
In 2009, Hitachi focused on fused silica known for its excellent heat and water resistance as a recording medium for long-term digital storage. After proposing the use of computed tomography to read data recorded with a femtosecond pulse laser, fused silica glass was confirmed as an effective storage medium. Applying this technology, it is possible to achieve multi-layer recording by changing the laser’s focal point to form microscopic regions (dots) with differing refractive indices. In 2012, a method was developed with Kyoto University to read the recorded dots in 4 layers using an optical microscope (recording density equivalent to a CD), and in 2013, 26 layer recording was achieved (recording density equivalent to a DVD). In order to increase recording density for practical applications, one means is to increase the number of recording layers. At the 100-layer level of recording density equivalent to a Blu-ray disc however issues arose in dot quality degradation and read errors resulting from crosstalk of data recorded in other layers.
</blockquote>
But in the last few years Microsoft Research has taken this idea and run with it, as they report in a 68-author paper at SOSP entitled <a href="https://doi.org/10.1145/3600006.3613208"><i>Project Silica: Towards Sustainable Cloud Archival Storage in Glass</i></a>. It is a fascinating paper that should be read by anyone interest in archival storage.
Like <a href="https://blog.dshr.org/2018/02/dnas-niche-in-storage-market.html">me</a>, the authors are skeptical of the near-term prospects for <a href="https://doi.org/10.1145/3600006.3613208">DNA storage</a>:<br />
<blockquote>
Technologies like DNA storage offer the promise of an extremely dense media for long-term data storage. However, the high costs and low throughputs of oligonucleotide synthesis and sequencing continue to hamper the technology’s feasibility. The total amount of data demonstrably stored in DNA remains on the order of MBs, and building a functional storage system that can offer reasonable SLAs underpinned by DNA is a substantial challenge. Alternative DNA-based technologies like dNAM attempt to bypass costly sequencing and synthesis steps, sacrificing density down to densities comparable with magnetic tape.
</blockquote>
Hence their focus on a medium with, in theory, a somewhat lower volumetric density. From their <a href="https://doi.org/10.1145/3600006.3613208">abstract</a>:<br />
<blockquote>
This paper presents Silica: the first cloud storage system for archival data underpinned by quartz glass, an extremely resilient media that allows data to be left in situ indefinitely. The hardware and software of Silica have been co-designed and co-optimized from the media up to the service level with sustainability as a primary objective. The design follows a cloud-first, data-driven methodology underpinned by principles derived from analyzing the archival workload of a large public cloud service.
</blockquote>
Their analysis of the workload of a tape-based cloud archival storage system in Section 2 <a href="https://doi.org/10.1145/3600006.3613208">shows that</a>:<br />
<blockquote>
on average for every MB read there are 47 MBs written, and for every read operation there are 174 writes. We can see some variation across months, but writes always dominate by over an order of magnitude.<br />
...<br />
Small files dominate the workload, with 58.7% of the reads for files of 4 MiB or smaller. However, these reads only contribute 1.2% of the volume of data read. Files larger than 256 MiB comprise around 85% of bytes read but less than 2% of total read requests. Additionally, there is a long tail of request sizes: there is ∼ 10 orders of magnitude between the smallest and largest requested file sizes.<br />
...<br />
We observe a variability in the workload within data centers, with up to 7 orders of magnitude difference between the median and the tail, as well as large variability across different data centers.<br />
...<br />
At the granularity of a day, the peak daily [ingress] rate is ∼16x higher than the mean daily rate. As the aggregation time increases beyond 30 days, the peak over mean ratio decreases significantly down to only ∼2, indicating that the average write rate is similar across different 30-day windows.<br />
...<br />
To summarize, as expected for archival storage, the workload is <i>heavily write-dominated</i>. However, unexpectedly, the IO operations are dominated by <i>small file accesses</i>.
</blockquote>
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgP3MmIfoEohS5K_2OPkmJsNV7rfL-qRf-eNHdGT-PFJPzZjwoq-X6pDHC0SJvqbDk7NOWKuO9aDBNiOdKKcFu9kpsqT3Hv5HFbgir51mzUV1YAJ08OUwnK2NMJWyVbZxKJUNe_zoSL8LuN/s647/FacebookWarmFig3.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="192" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgP3MmIfoEohS5K_2OPkmJsNV7rfL-qRf-eNHdGT-PFJPzZjwoq-X6pDHC0SJvqbDk7NOWKuO9aDBNiOdKKcFu9kpsqT3Hv5HFbgir51mzUV1YAJ08OUwnK2NMJWyVbZxKJUNe_zoSL8LuN/w200-h192/FacebookWarmFig3.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-muralidhar.pdf">Figure 3</a></td></tr></tbody></table>
Subramanian Muralidhar and a team from Facebook, USC and Princeton had a paper at the 2014 OSDI that described Facebook's warm layer above the two cold storage layers and below <a href="https://www.usenix.org/event/osdi10/tech/full_papers/Beaver.pdf">Haystack</a>, the hot storage layer. Section 3 of <a href="https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-muralidhar.pdf"><i>f4: Facebook's Warm BLOB Storage System</i></a> provides workload data for this layer, which filters out most of the IOPS before they hit the archival layers. I explained in <a href="https://blog.dshr.org/2014/10/facebooks-warm-storage.html"><i>Facebook's Warm Storage</i></a> that:<br />
<blockquote>
A BLOB is a Binary Large OBject. Each type of BLOB contains a single type of immutable binary content, such as photos, videos, documents, etc.<br />
...<br />
Figure 3 shows that the rate of I/O requests to BLOBs drops rapidly through time. The rates for different types of BLOB drop differently, but all 9 types have dropped by 2 orders of magnitude within 8 months, and all but 1 (profile photos) have dropped by an order of magnitude within the first week.<br />
<br />
The vast majority of Facebook's BLOBs are warm, as shown in Figure 5 - notice the scale goes from 80-100%. Thus the vast majority of the BLOBs generate I/O rates at least 2 orders of magnitude less than recently generated BLOBs.
</blockquote>
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhdiS9TuQFo-H4BD1VrPF_uw833Zbrc0cA2A_SuoOfp2tam3Fo7tNqEDjRPtXck9zJ9zm5af1wRHK69ipZjJKp45gHZa7PWMPeisjA6oqWnufM_2GCW7PA7sHbxjDdSvkMLnuRHeAB6HnpF/s595/FacebookFigure5.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="149" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhdiS9TuQFo-H4BD1VrPF_uw833Zbrc0cA2A_SuoOfp2tam3Fo7tNqEDjRPtXck9zJ9zm5af1wRHK69ipZjJKp45gHZa7PWMPeisjA6oqWnufM_2GCW7PA7sHbxjDdSvkMLnuRHeAB6HnpF/w200-h149/FacebookFigure5.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-muralidhar.pdf">Figure 5</a></td></tr></tbody></table>
Note these important differences between Microsoft's and Facebook's storage hierarchies:<br />
<ul>
<li>Microsoft stores generic data and depends upon user action to migrate it down the hierarchy to the archive layer, whereas Facebook stores 9 specific types of application data which is migrated automatically based upon detailed knowledge of the workload for each of the types.</li>
<li>Because Facebook can migrate data automatically, it can interpose a warm layer above the archive layer of the hierarchy, and because it has detailed knowledge about the behavior of each of the data types it can make good decision about when to move each type down the hierarchy.</li>
<li>Because the warm layer responds to the vast majority of the read requests and schedules the downward migrations, Facebook's archive layer's IOPS are a steady flow of large writes with very few reads, making efficient use of the hardware.</li>
</ul>
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhOK0u8vnqW0yd42tTDcpfQpqklYoJx1PkPhusifIk_aYgSXNluVydDdAswnbB_WAScPTI0for7eQFstp7fGX6T6IVEElNeKHzUS0OSxQw6lXTkbnVeev7SKLmdTT-E6-sNU4QO4Hk0xInSwXDvPqLhs8-x6nrq_Vxz7MWl7E1S4Zhiu15hOjvDGYXdPeEi/s903/SilicaFig2.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="104" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhOK0u8vnqW0yd42tTDcpfQpqklYoJx1PkPhusifIk_aYgSXNluVydDdAswnbB_WAScPTI0for7eQFstp7fGX6T6IVEElNeKHzUS0OSxQw6lXTkbnVeev7SKLmdTT-E6-sNU4QO4Hk0xInSwXDvPqLhs8-x6nrq_Vxz7MWl7E1S4Zhiu15hOjvDGYXdPeEi/w200-h104/SilicaFig2.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://doi.org/10.1145/3600006.3613208">Figure 2</a></td></tr></tbody></table>
Contrast Facebook's scheduled ingest flow with the bursty ingest rate shown in Figure 2 of the Silica paper, <a href="https://doi.org/10.1145/3600006.3613208">which finds that</a>:<br />
<blockquote>
At the granularity of a day, the peak daily rate is ∼16x higher than the mean daily rate.
</blockquote>
Another interesting aspect of the Silica design is that the technologies, and thus the hardware, used for writing and reading are completely different. The authors point out the <a href="https://doi.org/10.1145/3600006.3613208">implications for the design</a>:<br />
<blockquote>
As different technologies are used to read and write, after a platter is written it must be fully read using the same technology that will be used to read it subsequently. This happens before a platter is stored in the library and any staged write data is deleted.<br />
<br />
This design has an interesting consequence: during the period when user data is being written into the library, the workload is going to become <i>read</i>-dominated. Every byte written must be read to be verified, in addition to the user reads. The read bandwidth has to be provisioned for peak user read rate, however the read workloads are very bursty, so read drive utilization is extremely low on average. Thus, the verification workload simply utilizes what would otherwise be idle read drives.
</blockquote>
Using separate write and read drives has <a href="https://doi.org/10.1145/3600006.3613208">two advantages</a>:<br />
<blockquote>
This allows independent scaling of read and write throughput. Additionally, this design allows us to create the first storage system that offers true <i>air-gap-by-design</i>: the robotics are unable to insert a glass platter into a write drive once the glass media has been written.
</blockquote>
Since the majority of reads are for verification, the design needs to make specific provision for <a href="https://doi.org/10.1145/3600006.3613208">user reads</a>:<br />
<blockquote>
To enable high drive efficiency, two platters can be mounted simultaneously in a read drive: one undergoing verification, and one servicing a customer read. Customer traffic is prioritized over verification, with the read drive switching away when a platter is mounted for a customer read. As soon as the customer platter stops being accessed, the read drive has the ability to quickly switch to the other platter and continue verification. This is similar to avoiding head-of-line blocking of mice flows by elephant flows in networked systems.
</blockquote>
Like Facebook's, the prototype Silica systems are <a href="https://doi.org/10.1145/3600006.3613208">data-center size</a>:<br />
<blockquote>
A <i>Silica library</i> is a sequence of contiguous write, read, and storage racks interconnected by a platter delivery system. Along all racks there are parallel horizontal rails that span the entire library. We refer to a side of the library (spanning all racks) as a <i>panel</i>. A set of <i>free roaming</i> robots called <i>shuttles</i> are used to move platters between locations.<br />
...<br />
A read rack contains multiple read drives. Each read drive is independent and has slots into which platters are inserted and removed. The number of shuttles active on a panel is limited to twice the number of read drives in the panel. The write drive is full-rack-sized and writes multiple platters concurrently.
</blockquote>
Their performance evaluation focuses on the ability to respond to read requests within 15 hours. Their cost evaluation, like Facebook's, focuses on the savings from using warehouse-type space to house the equipment, although is isn't clear that they have actually done so. The rest of their cost evaluation is somewhat hand-wavy, as is natural for a system that <a href="https://doi.org/10.1145/3600006.3613208">isn't yet in production</a>:<br />
<blockquote>
The Silica read drives use polarization microscopy, which is a commoditized technique widely used in many applications and is low-cost. Currently, system cost in Silica is dominated by the write drives, as they use femtosecond lasers which are currently expensive and used in niche applications. This highlights the importance of resource proportionality in the system, as write drive utilization needs to be maximized in order to minimize costs. As the Silica technology proliferates, it will drive up the demand for femtosecond lasers, commoditizing the technology.
</blockquote>
I'm skeptical of this last point. Archival systems are a niche in the IT market, and one on which companies are loath to spend money, The only customers for systems like Silica are the large cloud providers, who will be reluctant to commit their archives to technology owned by a competitor. Unless a mass-market application for femtosecond lasers emerges, the scope for cost reduction is limited.<br />
<h3>Conclusion</h3>
Six years ago <a href="https://blog.dshr.org/2018/02/dnas-niche-in-storage-market.html">I wrote</a>:<br />
<blockquote>
<a href="http://blog.dshr.org/2016/12/the-medium-term-prospects-for-long-term.html">time-scales in the storage industry are long</a>. Disk is a <a href="https://en.wikipedia.org/wiki/History_of_IBM_magnetic_disk_drives#Early_IBM_HDDs">60-year-old technology</a>, tape is at least <a href="https://en.wikipedia.org/wiki/IBM_7_track">65 years old</a>, CDs are <a href="https://en.wikipedia.org/wiki/Compact_disc">35 years old</a>, flash is <a href="https://www.google.com/patents/US5095344">30 years old</a> and has yet to impact bulk data storage.
</blockquote>
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj-ZpPqvkf3GXj_kN5rc__s2L5rvYJFx7FAmoP_wx6B-74cGM0d6zq6Y1HZdiuKwCRMx91LIhSAOod9lRwnBjlKQh8mwXLNSbQqy3nRhk6fYJRY4W8jniBiE4E8Bp5dgQZF_H6ghOlIIkdYl1JQK8wkgW3TKLT0Z92Pb2BNOgRiRyHlYVCeR-yk3CZ4CA/s800/BitShipments.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="83" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj-ZpPqvkf3GXj_kN5rc__s2L5rvYJFx7FAmoP_wx6B-74cGM0d6zq6Y1HZdiuKwCRMx91LIhSAOod9lRwnBjlKQh8mwXLNSbQqy3nRhk6fYJRY4W8jniBiE4E8Bp5dgQZF_H6ghOlIIkdYl1JQK8wkgW3TKLT0Z92Pb2BNOgRiRyHlYVCeR-yk3CZ4CA/w200-h83/BitShipments.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://digitalpreservation.gov/meetings/DSA2023/loc_dsa2023_website_0104_lauhoff_Storage%20Landscape%20__0326.pdf">Source</a></td></tr></tbody></table>
Six years on flash has finally impacted the bulk storage market, but it isn't predicted to ship as many bits as hard disks for another four years, when it will be a 40-year-old technology. Actual demonstrations of DNA storage are only 12 years old, and similar demonstrations of silica media are 15 years old. History suggests it will be decades before these technologies impact the storage market.<br />
<br />
David. (noreply@blogger.com)
https://blog.dshr.org/
HangingTogether: Advancing IDEAs: Inclusion, Diversity, Equity, Accessibility, 5 March 2024
https://hangingtogether.org/?p=14222
2024-03-05T20:30:01+00:00
<p class="has-small-font-size"><em>The following post is one in a regular <a href="https://hangingtogether.org/tag/IDEA/" rel="noreferrer noopener" target="_blank">series</a> on issues of Inclusion, Diversity, Equity, and Accessibility, compiled by a team of OCLC contributors.</em></p>
<figure class="wp-block-image size-large"><a href="https://hangingtogether.org/wp-content/uploads/2024/03/rolands-zilvinskis-cPxRBHechRc-unsplash-scaled.jpg"><img alt="Hand holding a blank framed sign in front of a blurry leafy background." class="wp-image-14223" height="683" src="https://hangingtogether.org/wp-content/uploads/2024/03/rolands-zilvinskis-cPxRBHechRc-unsplash-1024x683.jpg" width="1024" /></a>Photo by <a href="https://unsplash.com/@rolzay?utm_content=creditCopyText&utm_medium=referral&utm_source=unsplash">Rolands Zilvinskis</a> on <a href="https://unsplash.com/photos/person-holding-rectangular-white-and-black-frame-cPxRBHechRc?utm_content=creditCopyText&utm_medium=referral&utm_source=unsplash">Unsplash</a>
</figure>
<h2 class="wp-block-heading">Teaching with primary sources, the African American Jewish community</h2>
<p>Using resources primarily from the Library of Congress (OCLC Symbol: DLC), the <a href="https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.loc.gov%2Fprograms%2Fteachers%2Fabout-this-program%2Fteaching-with-primary-sources-partner-program%2F&data=05%7C02%7Cproffitm%40oclc.org%7C0b15cbd47a814f2d872708dc396da681%7C516a75d7dc984163a03ff918d2a2bc9a%7C0%7C0%7C638448390304761533%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C0%7C%7C%7C&sdata=Al5xZZTbjbdZ7dm%2F6vh1qWDFl6CecSWjpZwleng69ws%3D&reserved=0">Teaching with Primary Sources</a> (TPS) Partner Program funds grants to grow a nationwide network of organizations that create and share educational programming and tools. The <a href="https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Ftpsteachersnetwork.org%2F&data=05%7C02%7Cproffitm%40oclc.org%7C0b15cbd47a814f2d872708dc396da681%7C516a75d7dc984163a03ff918d2a2bc9a%7C0%7C0%7C638448390304770967%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C0%7C%7C%7C&sdata=b2VqTYitRinScq9x5PKLorcipFNGo%2FyTvaFFBWOt1Wg%3D&reserved=0">TPS Teachers Network</a> put together a collection of resources, “<a href="https://nam12.safelinks.protection.outlook.com/?url=https%3A%2F%2Ftpsteachersnetwork.org%2Falbum%2F104145-celebrating-black-history-month-the-african-american-jewish-community&data=05%7C02%7Cproffitm%40oclc.org%7C0b15cbd47a814f2d872708dc396da681%7C516a75d7dc984163a03ff918d2a2bc9a%7C0%7C0%7C638448390304778324%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C0%7C%7C%7C&sdata=yGFn%2BeYYE8gVtvwIIDyo26d5rezILyIq9zdjHsNx64U%3D&reserved=0">Celebrating Black History Month and the African American Jewish Community</a>,” that looks at the complex relationship between these two communities.</p>
<p style="padding-left: 40px;">Although February is designated in the United States as “Black History Month” these resources provide content of continuing interest and value. To borrow playwright Tony Kushner’s phrase, I am “an intensely secular Jew” who has had a lifelong interest in and fascination with Jewish kinship to other groups. The TPS album brings together resources on topics ranging from entertainer Sammy Davis Jr. to Black Jewish rabbis, cantors, and congregations. <em>Contributed by Jay Weitz.</em></p>
<h2 class="wp-block-heading">ALA CORE Interest Group week sessions on DEI</h2>
<p><a href="https://www.ala.org/core/continuing-education/interest-group-week">ALA CORE Interest Group week</a> takes place the first week in March and features 30 different programs that are free for anyone to attend. The following IG sessions are particularly interesting to those working on DEI efforts:</p>
<ul>
<li>5 March 2024 2:00 p.m. CST: Conducting a Pilot for Library of Congress Demographic Group Terms Elizabeth Hobart, Interim Head of Cataloging and Metadata Services, Penn State (OCLC Symbol: UPM) – <a href="https://ala-events.zoom.us/webinar/register/WN_bOD0pDxfS4muS1P0CPSc_Q#/registration">Register</a></li>
<li>8 March 2024 – 10:00 a.m. CST: Homosaurus Usage in the OCLC Database: an Exploratory Analysis – Paromita Biswas (Continuing Resources Metadata Librarian), Amanda Mack (Cataloger in the Film & Television Archive), and Erica Zhang (Metadata Librarian for Open Access), UCLA (OCLC Symbols: CLU & UCFTA) – <a href="https://ala-events.zoom.us/webinar/register/WN_Z9IIqWyaSxmVfMuQRLhKqQ#/registration">Register</a></li>
<li>8 March 2024 – 3:00 p.m. CST Diversity Audits and the Role of Technical Services Staff”, presented by Rachel Fischer, Member Services Librarian for Technical Services, Cooperative Computer Services (CCS) (OCLC Symbol: JED) – <a href="https://ala-events.zoom.us/meeting/register/tJMocuGhrDooHtQo8jJM8N2R6eTFGgtK_q5u#/registration">Register</a></li>
</ul>
<p style="padding-left: 40px;">IG Week provides a good resource for everyone working in cataloging & metadata areas. In addition to the DEI-focused presentations, there will be lots of good learning opportunities around next-generation metadata, workflows, and professional development. <em>Contributed by </em><a href="https://www.oclc.org/research/people/urban-richard.html"><em>Richard J. Urban.</em></a></p>
<h2 class="wp-block-heading">Applying toponymic justice to library spaces</h2>
<p>Authors Natalia Fernández, Jane Nichols, and Diana Park explain toponymic justice was enacted in the renaming of a library classroom at the Oregon State University (OCLC Symbol: ORE)’s Valley Library. In the article, “<a href="https://www.inthelibrarywiththeleadpipe.org/2024/engaging-in-toponymic-justice/">Engaging in Toponymic Justice: Proactively Naming the Nishihara Family Classroom</a>” (posted 7 February 2024 in <em>In the Library with the Lead Pipe</em>), they characterize the renaming of the OSU library classroom as “proactive naming,” that results in a name reflecting values, an inspiring person or other meaningful name, regardless of funding. (The classroom was temporarily named “2nd Floor West Classroom.”) The name “Nishihara Family Classroom,” primarily after Janet Nishihara, Director of OSU’s Education Opportunities Program, but including “family” for Nishihara siblings who had been student workers at the OSU Libraries. The classroom’s door contains text that gives context to the classroom’s name within the library space: “This room is named in honor of the Nishihara Family for the dedication to student learning and success.”</p>
<p style="padding-left: 40px;">The authors place this OSU example in the context of the trend for many institutions to reevaluate their naming policies and change the names of existing spaces named after controversial people. However, their article is one part of a much larger and complex conversation about toponymic justice. The library classroom had a temporary generic name in 2019 when it opened. The renaming of the Louisiana State University (OCLC Symbol: LUU) main library building in 2020 from “Middleton Library” to “LSU Library” (a temporary name) is a more complex case. The library was named for Middleton in 1978 because of his accomplishment of having the new library built while he was university president. However, Middleton’s <a href="https://www.theadvocate.com/baton_rouge/news/education/troy-h-middletons-name-removed-from-lsu-library-hours-after-board-approval/article_a50055f4-b23a-11ea-847e-fb4e868ff514.html">pro-segregationist views</a> made keeping that name untenable. LSU’s decision to temporarily rename the library “LSU Library” may have been an imperfect solution, but it reflects the reality that removing a controversial name may be easier than providing a meaningful one. <em>Contributed by </em><a href="https://www.oclc.org/research/people/james-kate.html"><em>Kate James</em></a><em>.</em></p><p>The post <a href="https://hangingtogether.org/advancing-ideas-inclusion-diversity-equity-accessibility-5-march-2024/">Advancing IDEAs: Inclusion, Diversity, Equity, Accessibility, 5 March 2024</a> appeared first on <a href="https://hangingtogether.org">Hanging Together</a>.</p>
Merrilee Proffitt
https://hangingtogether.org/
Lucidworks: Gen AI Transforms Search Experiences: 4 Takeaways
https://lucidworks.com/?p=28530
2024-03-04T22:09:46+00:00
<p>Learn about the power of generative AI in shaping search experiences in 2024 and beyond.</p>
<p>The post <a href="https://lucidworks.com/post/gen-ai-transforms-search-experiences-4-takeaways/">Gen AI Transforms Search Experiences: 4 Takeaways</a> appeared first on <a href="https://lucidworks.com">Lucidworks</a>.</p>
Lila Schoenfield
https://lucidworks.com/
HangingTogether: The OCLC Research Library Partnership and the art of gathering
https://hangingtogether.org/?p=14114
2024-03-04T15:56:25+00:00
<div class="wp-block-image">
<figure class="alignleft size-full is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2024/03/Director-Letter-Feb-2024.png"><img alt="Image is a collage made up of an abstract diagram, a close up of hands gesturing an a computer, and a word cloud" class="wp-image-14116" height="788" src="https://hangingtogether.org/wp-content/uploads/2024/03/Director-Letter-Feb-2024.png" style="width: 469px; height: auto;" width="940" /></a></figure></div>
<p>A new year always offers an opportunity to reflect on the past and plan intentionally for the year to come. Lately, my mind has been on the art of gathering, thanks to <a href="https://search.worldcat.org/title/1161986001">this inspiring book</a> by Priya Parker.</p>
<p>In looking forward and back, I recognize the value of connecting both in-person and virtually. Whether we gather in the same physical space or online, I always appreciate the unstructured moments where we check in with one another, sharing our current perspectives and pathways. Having the opportunities to learn from each other and experiencing the shared desire to care and connect creates positive energy.</p>
<p>The RLP team works to center the human connection in our programming and bring that flow of energy to our partner network.</p>
<h2 class="wp-block-heading"><strong>Metadata Managers: new vision and energy</strong></h2>
<p>The power of community and connection was a theme in our January kickoff meetings of the Metadata Managers Focus Group. Senior Program Officer Richard Urban has actively stewarded this group, and its Planning Group, which is tasked with re-envisioning focus and activities, honoring the expertise and time commitment of all those involved. We’re excited to welcome these new members:</p>
<ul>
<li>Liz Bodian, Metadata Technologies Librarian at Brandeis University</li>
<li>Susan Dahl, Director, Content Services at University of Calgary</li>
<li>Chingmy Lam, Manager, Metadata Services at University of Sydney Library</li>
<li>Chloe Misorski, Cataloging Librarian at Cleveland Museum of Art</li>
<li>Helen Williams, Metadata Manager at LSE Library</li>
</ul>
<p>Look for summaries coming soon from our January meetings, as well as announcements about upcoming focus group meetings, and related programming. See the most recent activities and full list of the <a href="https://www.oclc.org/research/areas/data-science/metadata-managers.html">Metadata Managers Planning Group</a>.</p>
<h2 class="wp-block-heading"><strong>SHARES delivers value</strong></h2>
<p>The <a href="https://www.oclc.org/research/activities/shares/workgroups.html#execgroup">SHARES Executive Group</a> is another dynamic group in the OCLC RLP, lending expertise and strategic vision to our efforts. We are pleased to welcome four new SHARES Executive Group members: </p>
<ul>
<li>Marilyn Creswell, University of Michigan Law School </li>
<li>Vicky Flood, University of Manchester</li>
<li>Sylvie Larsen, University of Pennsylvania </li>
<li>Kerry Kristine McElrone, Swarthmore College </li>
</ul>
<p>Senior Program Officer Dennis Massie continues to energize the <a href="https://www.oclc.org/research/partnership/shares-intro.html">SHARES community</a> with his weekly town halls (190 and counting). Recently, he engaged specifically with SHARES institutions in the UK and Ireland to gain insights into collection sharing challenges there and to discuss patterns, trends, and opportunities indicated by analysis of their FY23 ILL activity. The numbers demonstrate that each of these UK and Irish institutions draws significant value from SHARES participation, especially borrowing physical items; the numbers also reveal there are multiple viable approaches to utilizing SHARES for maximum benefit depending upon your situation. </p>
<h2 class="wp-block-heading"><strong>It’s all about the data</strong></h2>
<p>In each of our areas of focus, we see the growing need to use data to better understand impact, recognize trends, and make better decisions. Earlier this month, we hosted <a href="https://hangingtogether.org/exploring-the-challenges-and-opportunities-of-research-data-management-rdm/">a facilitated discussion on data-driven decision making in libraries</a>, exploring how insights can support research information management, collection management, and the library’s value proposition to institutional stakeholders. This session was part of the <a href="https://connect.oclc.org/en/oclc-liber-building-for-the-future-series-2023">OCLC and LIBER joint series</a>, “Building for the future: Opportunities and responsibilities for state-of-the-art services.” <br /><br />Our next facilitated discussion session will take place Wednesday, 17 April 2024, focusing on AI, machine learning, and data science, and <a href="https://www.oclc.org/oclc-forms/en/events/2024/liber-webinar-ai-machine-learning-data-science.html?_gl=1*1i0l7ua*_gcl_au*MTMyMTM2OTE0My4xNzA3NzUwNzI1">you can join us</a>. </p>
<h2 class="wp-block-heading"><strong>RLP Leadership Roundtables</strong></h2>
<p>I’ve mentioned our exemplary Metadata Managers and SHARES groups, which offer a welcoming venue for connection and deep participation and the opportunity to influence RLP and OCLC Research programming. I’m excited to share that we’re launching <a href="https://www.oclc.org/research/partnership/engagement/rlp-leadership-roundtables.html">new leadership roundtable discussions</a> that will address challenges in the key areas of research support and special collections, particularly as they relate to the large ecosystem of changes we face today: staffing demographics, impact assessment, and collaboration opportunities.</p>
<p>The RLP draws from a unique, international mix of independent, academic, national, and museum libraries, and we know there is value in bringing our partners together to learn from each other. Senior Program Officers Rebecca Bryant (research support) and Chela Scott Weber (archives and special collections) moderate the discussions.</p>
<h2 class="wp-block-heading"><strong>Benefits of participation</strong></h2>
<p>It’s a great privilege to meet with peers. Whether an event is virtual or in person, some of the most compelling moments occur during the more social, less structured times where we can reconnect individually with peers and colleagues and see the world from their unique perspectives. Learning about the wide variety of local challenges and opportunities that you face is essential for us to synthesize the common themes and recognize “scalable moments” where we can devote energy and attention to shed light on the current landscape.</p>
<p>Our leadership networks, workshops, and other virtual convenings that center person-to-person interaction and leverage our peer network are what make RLP programs so vibrant and enriching. In our busy world, it can be hard to extend beyond a familiar professional peer group. Let RLP help ease the process of expanding your organizational networks through a variety of programming and interest groups, each with different levels of time and attention investment.</p>
<h2 class="wp-block-heading"><strong>Looking forward</strong></h2>
<p>The talented <a href="https://www.oclc.org/research/people/rlp.html">RLP team of program officers,</a> along with the broader OCLC Research team, are eager to extend our work, hosting conversations—and <em>gathering</em>—learning as we go and growing as a community.</p>
<p>Thank you for your time and attention, but most of all, your continued support.<br /><br /></p>
<p>The post <a href="https://hangingtogether.org/the-oclc-research-library-partnership-and-the-art-of-gathering/">The OCLC Research Library Partnership and the art of gathering</a> appeared first on <a href="https://hangingtogether.org">Hanging Together</a>.</p>
Rachel Frick
https://hangingtogether.org/
Artefacto: Show your work – finding & creating inspiration in the library sector
https://www.artefacto.org.uk/?p=1609
2024-03-04T09:32:53+00:00
We love seeing case studies, posts and articles from libraries and library staff celebrating their wins, whether big and small. But we also learn a lot from libraries sharing their experiences when things don’t necessarily go to plan. And we’ve learnt from publishing more than 230 issues of our library newsletter that library professionals also [...]<p><a class="understrap-read-more-link" href="https://www.artefacto.org.uk/show-your-work-inspiration-in-the-library-sector/">Continue Reading...</a></p>
<p><a href="https://www.artefacto.org.uk/show-your-work-inspiration-in-the-library-sector/" rel="nofollow">Source</a></p>
Artefacto
https://www.artefacto.org.uk
Digital Library Federation: DLF Digest: March 2024
https://www.diglib.org/?p=30786
2024-03-01T14:00:49+00:00
<div class="wp-block-image"><img alt="DLF Digest logo: DLF logo at top center "Digest" is centered. Beneath is "Monthly" aligned left and "Updates" aligned right." class="wpa-warning wpa-suspicious-alt aligncenter size-medium wp-image-26917" height="159" src="https://www.diglib.org/wp-content/uploads/sites/3/2023/08/DLFdigest2023-300x159.png" width="300" /></div>
<div></div>
<div>
<p><i><span style="font-weight: 400;">A monthly round-up of news, upcoming </span></i><a href="https://www.diglib.org/groups/"><i><span style="font-weight: 400;">working group</span></i></a><i><span style="font-weight: 400;"> meetings and events, and </span></i><a href="https://www.clir.org/"><i><span style="font-weight: 400;">CLIR</span></i></a><i><span style="font-weight: 400;"> program updates from the </span></i><a href="https://www.diglib.org/"><i><span style="font-weight: 400;">Digital Library Federation</span></i></a><span style="font-weight: 400;">. <a href="https://www.diglib.org/category/dlf-digest/"><em>See all past Digests here</em></a>. </span></p>
<p><span style="font-weight: 400;"><br />
Hello DLF community – it’s March! Last month we had the pleasure of opening and receiving a lot of proposals for our </span><a href="https://forum2024.diglib.org"><span style="font-weight: 400;">in-person DLF Forum happening in Michigan in July</span></a><span style="font-weight: 400;">. Keep your eye out for community voting opening early next week, where you can see submitted proposals and vote for your favorites. DLF working groups were also busy last month and made some great plans for the future. Make sure to stay up to date on group meetings by reviewing the </span><a href="https://digital-conferences-calendar.info/"><span style="font-weight: 400;">DLF Community Calendar</span></a><span style="font-weight: 400;">.</span></p>
<p><span style="font-weight: 400;">-Team DLF</span></p>
<h2><span style="font-weight: 400;">This month’s news:</span></h2>
<ul>
<li><b>Applications Open: </b><a href="https://www.clir.org/recordings-at-risk/apply-for-an-award/"><span style="font-weight: 400;">CLIR invites applications</span></a><span style="font-weight: 400;"> from collecting organizations for the digital reformatting of audio and audiovisual materials for the </span><a href="https://www.clir.org/recordings-at-risk/"><span style="font-weight: 400;">Recordings at Risk</span></a><span style="font-weight: 400;"> grant program now through April 17, 2024.</span></li>
<li><b>Learn IIIF Basics:</b><span style="font-weight: 400;"> An upcoming 5-day workshop (March 18-22) will teach attendees the basics about the International Image Interoperability Framework (IIIF). No prior knowledge of IIIF is required. </span><a href="https://www.eventbrite.com/e/march-2024-iiif-online-training-5-day-course-tickets-795358006207"><span style="font-weight: 400;">Learn more about the workshop and register.</span></a></li>
<li><b>Register: </b><a href="https://iiif.io/event/2024/los-angeles/?mc_cid=eeb66a2c05&mc_eid=e802b62d11#register"><span style="font-weight: 400;">Registration is now open</span></a><span style="font-weight: 400;"> for the IIIF Annual Conference, to be held in Los Angeles, June 4-7. </span></li>
<li><b>Training Opportunities Available: </b><span style="font-weight: 400;">IIIF is hosting a series of free online training sessions throughout March and April. </span><a href="https://iiif.io/news/2024/02/09/world-training/"><span style="font-weight: 400;">Learn more about the World Training Events and register.</span></a></li>
<li><b>Register:</b> <a href="https://netpreserve.org/ga2024/registration/"><span style="font-weight: 400;">Registration is now open</span></a><span style="font-weight: 400;"> for the International Internet Preservation Consortium’s (IIPC) General Assembly and Web Archiving Conference in Paris, April 24-26.<br />
</span></li>
</ul>
<h2><span style="font-weight: 400;">This month’s open DLF group meetings:</span></h2>
<p><span style="font-weight: 400;">For the most up-to-date schedule of DLF group meetings and events (plus NDSA meetings, conferences, and more), bookmark the </span><a href="https://www.diglib.org/opportunities/calendar/"><span style="font-weight: 400;">DLF Community Calendar</span></a><span style="font-weight: 400;">. Can’t find meeting call-in information? Email us at </span><a href="mailto:info@diglib.org"><span style="font-weight: 400;">info@diglib.org</span></a><span style="font-weight: 400;">. Reminder: Team DLF working days are Monday through Thursday.</span></p>
<ul>
<li style="font-weight: 400;"><b>Born-Digital Access Working Group (BDAWG):</b><span style="font-weight: 400;"> Tuesday, March 5, 2pm ET / 11am PT.</span></li>
<li style="font-weight: 400;"><b>Digital Accessibility Working Group (DAWG): </b><span style="font-weight: 400;">Wednesday, March 6, 2pm ET / 11am PT. </span></li>
<li style="font-weight: 400;"><b>Assessment Interest Group (AIG) Cultural Assessment Working Group (CAWG): </b><span style="font-weight: 400;">Monday, March 11, 2pm ET/11am PT.</span></li>
<li style="font-weight: 400;"><b>AIG Cost Assessment Working Group: </b><span style="font-weight: 400;">Monday, March 11, 3pm ET/12pm PT.</span></li>
<li style="font-weight: 400;"><b>AIG Metadata Assessment Working Group: </b><span style="font-weight: 400;">Thursday, March 14, 1:15pm ET / 10:15am PT.</span></li>
<li style="font-weight: 400;"><b>AIG User Experience Working Group:</b><span style="font-weight: 400;"> Friday, March 15, 11am ET / 8am PT. </span></li>
<li style="font-weight: 400;"><b>Committee for Equity and Inclusion (CEI):</b><span style="font-weight: 400;"> Monday, March 25, 3pm ET / 12pm PT. </span></li>
<li style="font-weight: 400;"><b>Climate Justice Working Group: </b><span style="font-weight: 400;">Wednesday, March 27, 12pm ET / 9am PT. </span></li>
<li style="font-weight: 400;"><b>AIG Metadata Working Group: </b><span style="font-weight: 400;">Thursday, March 28, 1:15pm ET / 10:15am PT.</span></li>
<li style="font-weight: 400;"><b>DAWG Policy & Workflows subgroup:</b><span style="font-weight: 400;"> Friday, March 29, 1pm ET / 11am PT. </span></li>
</ul>
<p><i><span style="font-weight: 400;">DLF groups are open to ALL, regardless of whether or not you’re affiliated with a </span></i><a href="https://www.diglib.org/about/members/"><i><span style="font-weight: 400;">DLF member organization</span></i></a><i><span style="font-weight: 400;">. </span></i><a href="https://www.diglib.org/groups/"><i><span style="font-weight: 400;">Learn more about our working groups on our website</span></i></a><i><span style="font-weight: 400;">. Interested in scheduling an upcoming working group call or reviving a </span></i><a href="https://www.diglib.org/groups/past/"><i><span style="font-weight: 400;">past group</span></i></a><i><span style="font-weight: 400;">? </span></i><a href="https://www.diglib.org/dlf-organizers-toolkit/"><i><span style="font-weight: 400;">Check out the DLF Organizer’s Toolkit</span></i></a><i><span style="font-weight: 400;">. As always, feel free to get in touch at </span></i><a href="mailto:info@diglib.org"><i><span style="font-weight: 400;">info@diglib.org</span></i></a><i><span style="font-weight: 400;">. </span></i></p>
<h2><span style="font-weight: 400;">Get Involved / Connect with Us</span></h2>
<p><span style="font-weight: 400;">Below are some ways to stay connected with us and the digital library community: </span></p>
<ul>
<li style="font-weight: 400;"><a href="https://share.hsforms.com/1MhcafbpARxGCIS1OQD6rKgc21y3"><b>Subscribe</b><span style="font-weight: 400;"> to the DLF Forum newsletter</span></a><span style="font-weight: 400;">.</span></li>
<li style="font-weight: 400;"><b>Join, start, or revive</b><span style="font-weight: 400;"> a working group and </span><a href="https://wiki.diglib.org/Main_Page"><span style="font-weight: 400;">browse their work on the DLF Wiki</span></a><span style="font-weight: 400;">.</span></li>
<li style="font-weight: 400;"><a href="https://lists.clir.org/cgi-bin/wa?A0=DLF-ANNOUNCE"><b>Subscribe</b><span style="font-weight: 400;"> to our community listserv, DLF-Announce</span></a><span style="font-weight: 400;">.</span></li>
<li style="font-weight: 400;"><a href="https://digital-conferences-calendar.info/"><b>Bookmark</b><span style="font-weight: 400;"> our Community Calendar</span></a><span style="font-weight: 400;">.</span></li>
<li style="font-weight: 400;"><a href="https://www.diglib.org/about/join/"><b>Learn more</b><span style="font-weight: 400;"> about becoming a DLF member organization</span></a><span style="font-weight: 400;">. </span></li>
<li style="font-weight: 400;"><b>Follow us</b><span style="font-weight: 400;"> on </span><a href="https://www.instagram.com/clirdlf/"><span style="font-weight: 400;">Instagram</span></a><span style="font-weight: 400;">, </span><a href="https://www.facebook.com/CLIRDLF/"><span style="font-weight: 400;">Facebook</span></a><span style="font-weight: 400;">, </span><a href="https://www.linkedin.com/company/digital-library-federation/"><span style="font-weight: 400;">LinkedIn</span></a><span style="font-weight: 400;">, </span><a href="https://www.youtube.com/user/DLFCLIR"><span style="font-weight: 400;">YouTube</span></a><span style="font-weight: 400;">, </span><a href="https://twitter.com/CLIRDLF"><span style="font-weight: 400;">and X</span></a><span style="font-weight: 400;">. </span></li>
<li style="font-weight: 400;"><b>Contact us</b><span style="font-weight: 400;"> at </span><a href="mailto:info@diglib.org"><span style="font-weight: 400;">info@diglib.org</span></a><span style="font-weight: 400;">.</span></li>
</ul>
</div>
<p>The post <a href="https://www.diglib.org/dlf-digest-march-2024/" rel="nofollow">DLF Digest: March 2024</a> appeared first on <a href="https://www.diglib.org" rel="nofollow">DLF</a>.</p>
Aliya Reich
https://www.diglib.org
Mita Williams: Boolean is Dead AND I feel fine
https://librarian.aedileworks.com/?p=1448
2024-02-29T19:34:57+00:00
Leonard Bernstein!
Mita Williams
https://librarian.aedileworks.com
Harvard Library Innovation Lab: The Cloud
https://lil.law.harvard.edu/blog/2024/02/29/the-cloud/
2024-02-29T00:00:00+00:00
<p><img alt="The moment the cloud explodes" src="https://lil-blog-media.s3.amazonaws.com/exploding-cloud.webp" /></p>
<p>How do you make the invisible visible? This is the central premise of The Cloud, a project that I’ve been working on as a technologist in residence at LIL. The idea originated as a visual joke about the cloud: the vaporous metaphor we use to describe the distributed servers that host remotely-run software and infrastructure.</p>
<p>Initially, I was curious about the effects of the applications and services LIL runs (see <a href="https://lil.law.harvard.edu/blog/2024/02/08/the-cost-of-a-digital-archive/">this post</a> for a more detailed look at the cost of Perma.cc), and thought it would be interesting to visualize when users were interacting with one of our apps. From there the idea grew and melded with my other interests: what if, rather than just notifying us that users were interacting with us, we could be reminded of something specific, such as the carbon emissions of an action? What would that look like?</p>
<p>Obviously, it had to include a cloud. And when do you see clouds? In a thunderstorm! What if a cloud emitted little lightning strikes every time someone created a Perma Link?</p>
<p><img alt="A cloud with a lightning bolt descending from it." src="https://lil-blog-media.s3.amazonaws.com/LightningCloud.webp" /></p>
<p>That seemed a little too hard.
What if instead, it rained?</p>
<p><img alt="A cloud in the middle of a rain shower." src="https://lil-blog-media.s3.amazonaws.com/RainyCloud.webp" /></p>
<p>I pursued this idea a bit, finding resources to create a smart water pump that I could program to respond to our API. But I hesitated–all that water around all those devices–the idea seemed too wet to execute.</p>
<p>What if we found a way to represent rain that wasn’t literal rain? That could work. We could use LEDs, and program them to look like rain was falling every time someone created a new link.</p>
<p><img alt="A cloud against a screen of LEDs." src="https://lil-blog-media.s3.amazonaws.com/LEDCloud-small.webp" /></p>
<p>This seemed like a physically reasonable project, but it wasn’t quite hitting the mark. Why was the cloud raining every time someone hit our API? What did that have to do with carbon emissions and climate change?</p>
<p>Instead, I decided to simplify the conceit. A user scrolls through various digital activities (such as a Google search or mining bitcoin) and their associated carbon footprints, culled from various sources.</p>
<figure>
<img alt="Four screenshots of the Cloud application, showing different amounts of carbon emissions on the screen." src="https://lil-blog-media.s3.amazonaws.com/sample-cloud.webp" />
Four screenshots from the Cloud app
</figure>
<p>The cloud responds by growing at each step, until it ultimately explodes.</p>
<div class="embed-container">
</div>
<p>I was lucky enough to be able to try it out on students, staff, and faculty at Harvard Law School’s Caspersen Student Center.</p>
<p><img alt="The cloud application and attachment set up in the Casperson Student Center." src="https://lil-blog-media.s3.amazonaws.com/cloud_display.webp" /></p>
<p><img alt="Looking over the shoulder of a student using the app." src="https://lil-blog-media.s3.amazonaws.com/Overshoulder.webp" /></p>
<p><img alt="Two smiling students interact with the Cloud app." src="https://lil-blog-media.s3.amazonaws.com/student_1.webp" /></p>
<p><img alt="A student interacts with the Cloud app." src="https://lil-blog-media.s3.amazonaws.com/student_2.webp" /></p>
<p>Running the pop-up events provided invaluable insights. Some users were shocked and said that they had never previously considered the carbon impact their digital lives might have. There were those who were simply delighted by the project and found the cloud itself a relatively benign presence. Others found the cloud and its conceit somewhat terrifying–especially as it grew larger and appeared to be on the edge of bursting. Many wanted to touch it, and most wanted to know what steps they could take to lower their personal footprints. This is a tough question to answer given that the sources of impact are so diffuse, but some suggestions include extending the life of your electronics and avoiding passive consumption. Putting pressure on companies to reveal the environmental impacts of their products could also be an effective tactic because having that information would enable users to make more informed choices about their purchases. The takeaways from the pop-ups confirmed my suspicion: that this is an area ripe for further education.</p>
<p>I hope to bring the Cloud to other locations (such as libraries), to continue to use it as an education tool. The code itself is available in a <a href="https://github.com/harvard-lil/the-cloud">GitHub repo</a> with instructions for how to build the cloud attachment for anyone who wants to create their own.</p>
<p><img alt="A glowing cloud floats above the Seattle skyline." src="https://lil-blog-media.s3.amazonaws.com/Cloud2-print.webp" /></p>
<p><strong>Special thanks to</strong>:</p>
<ul>
<li>Amitabh Shrivastava</li>
<li>Ben Steinberg</li>
<li>Greg Leppert</li>
<li>Tal Nagourney</li>
</ul>
Rebecca Kilberg
https://lil.law.harvard.edu/blog/
Open Knowledge Foundation: And the winners of the Open Data Day 2024 Mini-Grants are. . .
https://blog.okfn.org/?p=29212
2024-02-28T11:19:58+00:00
<div class="wp-block-image"><figure class="aligncenter size-large is-resized"><a href="https://blog.okfn.org/wp-content/files/2024/02/odd-mini-grants-results.png"><img alt="" class="wp-image-29257" height="291" src="https://blog.okfn.org/wp-content/files/2024/02/odd-mini-grants-results-1024x676.png" width="442" /></a></figure></div>
<div class="wp-block-spacer" style="height: 30px;"></div>
<p>We at the <a href="https://okfn.org/" rel="noreferrer noopener" target="_blank">Open Knowledge Foundation</a> (OKFN) are excited to announce the list of organisations that have been awarded mini-grants to help them host <a href="http://opendataday.org/" rel="noreferrer noopener" target="_blank">Open Data Day</a> (ODD) events and activities across the world. </p>
<p><strong>Our team received a total of 305 applications</strong> and was greatly impressed by the quality of the event proposals. In 2024, we are running <a href="https://blog.okfn.org/2024/01/24/open-data-day-2024-mini-grants-open-call/" rel="noreferrer noopener" target="_blank">two separate calls</a> to accommodate the diverse interests in our community. The first call was for the general community, and the second was specifically for activities related to open mapping.</p>
<div class="wp-block-spacer" style="height: 40px;"></div>
<hr class="wp-block-separator" />
<div class="wp-block-spacer" style="height: 40px;"></div>
<h2>General Mini-Grant Winners</h2>
<div class="wp-block-spacer" style="height: 30px;"></div>
<p><strong>This call was open to any practices and disciplines carried out by open data communities around the world</strong> – such as hackathons, tool demos, artificial intelligence, climate emergency, digital strategies, open government, citizen participation, automation, monitoring, etc. <strong>A total of 18 events will receive</strong> a<strong> grant amount of USD 300 each</strong>, thanks to the sponsorship of <a href="https://okfn.org/en/gambia/" rel="noreferrer noopener" target="_blank">Jokkolabs Banjul</a> (Gambia), <a href="https://okfn.org/en/" rel="noreferrer noopener" target="_blank">Open Knowledge Foundation</a> (OKFN), <a href="https://okfn.de/" rel="noreferrer noopener" target="_blank">Open Knowledge Germany</a>, <a href="https://www.datopian.com/" rel="noreferrer noopener" target="_blank">Datopian</a> and <a href="https://linkdigital.com.au/" rel="noreferrer noopener" target="_blank">Link Digital</a>.</p>
<figure class="wp-block-image size-large"><a href="https://blog.okfn.org/wp-content/files/2024/01/sponsors-general-2.png"><img alt="" class="wp-image-29168" height="143" src="https://blog.okfn.org/wp-content/files/2024/01/sponsors-general-2-1024x143.png" width="1024" /></a></figure>
<div class="wp-block-spacer" style="height: 30px;"></div>
<p>Here are the winning proposals by country, in alphabetical order:</p>
<div class="wp-block-spacer" style="height: 30px;"></div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://twitter.com/meninasdageo" rel="noreferrer noopener" target="_blank">Meninas da Geo</a></strong></p>
<p class="has-medium-font-size"><img alt="🇧🇷" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1e7-1f1f7.png" style="height: 1em;" /> Brazil</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Belém<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“SDGs in the Amazon”</strong> – Understanding the impact of open data in the Amazon, the site of COP30 in 2025.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://facebook.com/DRMAfrica" rel="noreferrer noopener" target="_blank">Disaster Risk Management in Africa – DRM Africa</a></strong></p>
<p class="has-medium-font-size"><strong><img alt="🇨🇩" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1e8-1f1e9.png" style="height: 1em;" /> </strong>Democratic Republic of the Congo</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Goma<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Open Data for climate risk-informed societies”</strong> – Leveraging open data to mitigate adverse effects of lakes and sea level rise in the African great lakes region.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://women4sustainability.org/" rel="noreferrer noopener" target="_blank">Women for Sustainability Africa</a></strong></p>
<p><img alt="🇬🇭" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1ec-1f1ed.png" style="height: 1em;" /> Ghana<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“EcoSolutions: Harnessing Tools for Climate Resilience”</strong> – To empower participants with the knowledge and skills to navigate the diverse array of tools available for addressing climate challenges.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://www.dialogos.org.gt" rel="noreferrer noopener" target="_blank">Diálogos A.C.</a></strong></p>
<p class="has-medium-font-size"><img alt="🇬🇹" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1ec-1f1f9.png" style="height: 1em;" /> Guatemala</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Guatemala City<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Data and Drinks: Girls at the table”</strong> – To “put on the table” the importance of data with a gender perspective to move towards gender equality, contribute to closing gender gaps and overcoming gender stereotypes.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://www.sweadindia.org" rel="noreferrer noopener" target="_blank">Society for Women’s Education and Awareness Development (SWEAD)</a></strong></p>
<p class="has-medium-font-size"><img alt="🇮🇳" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1ee-1f1f3.png" style="height: 1em;" /> India</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Cuddalore<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Village Leaders Conclave: Navigating the Climate Crisis with Open Data”</strong> – To empower 100 elected village-level leaders in Cuddalore district, Tamil Nadu, India, with the knowledge and tools to address the climate emergency through open data-driven decision-making and collaborative local action.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://mdi.ac.in/" rel="noreferrer noopener" target="_blank">Management Development Institute</a></strong></p>
<p class="has-medium-font-size"><img alt="🇮🇳" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1ee-1f1f3.png" style="height: 1em;" /> India</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Gurgaon<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“NoCode-LowCode GeoAI Workshop for Sustainable Climate Action”</strong> – To empower participants with tools for low carbon economy and meaningful climate action, fostering innovation and collaboration through multi-modal open data and open source software such as KNIME.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://labmgf.dica.polimi.it/" rel="noreferrer noopener" target="_blank">Politecnico di Milano, Department of Civil and Environmental Engineering</a></strong></p>
<p class="has-medium-font-size"><img alt="🇮🇹" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1ee-1f1f9.png" style="height: 1em;" /> Italy</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Milan<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Mapping Climate Change in 4D: Belvedere Glacier’s Open Geo Data for Education and Research”</strong> – To conduct an innovative teaching workshop dedicated at familiarizing students of the GIS course with raster data and point cloud processing using real data from an alpine glacier, which is experiencing an extreme retreat due to climate change.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://www.facebook.com/pastoralistpeoplesinitiative?mibextid=ZbWKwL" rel="noreferrer noopener" target="_blank">Pastoralist Peoples’ Initiative</a> </strong></p>
<p class="has-medium-font-size"><strong><img alt="🇰🇪" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1f0-1f1ea.png" style="height: 1em;" /> </strong>Kenya</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Marsabit<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Empowering Young Changemakers: Harnessing Open Data & Indigenous Knowledge for Climate Action”</strong> – To empower young people from Kenyan pastoralist communities in Marsabit County to address the climate emergency using open data and indigenous knowledge.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://github.com/CodeForAfrica/" rel="noreferrer noopener" target="_blank">Code for Africa</a> </strong></p>
<p class="has-medium-font-size"><strong><img alt="🇰🇪" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1f0-1f1ea.png" style="height: 1em;" /> </strong>Kenya</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Nairobi<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Open Data for Environmental Monitoring”</strong> – To celebrate and promote the impact of open data in environmental monitoring, through showcasing the impact of sensors.AFRICA’s citizen science initiative and inspiring participants to explore and innovate with open data for climate-resilient cities in Africa.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://twitter.com/OpenTechNP" rel="noreferrer noopener" target="_blank">Open Tech Community</a></strong></p>
<p class="has-medium-font-size"><img alt="🇳🇵" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1f3-1f1f5.png" style="height: 1em;" /> Nepal</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Dhumbarahi<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Open Data for Green & Circular Economy”</strong> – To discuss and create visualisations/stories regarding the use of open data in green and circular economy. The event is going to be in a workshop style where there will be participants from college/university clubs.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><a href="https://web.facebook.com/resourceconnects" rel="noreferrer noopener" target="_blank"><strong>Resource Connects for Education Initiative</strong></a></p>
<p class="has-medium-font-size"><strong><img alt="🇳🇬" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1f3-1f1ec.png" style="height: 1em;" /> </strong>Nigeria</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Sokoto<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Let’s Count 4SDGs”</strong> – To enhance community awareness and engagement in best practices for Open Data in achieving the Sustainable Development Goals (SDGs).</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://inspireit.com.ng/" rel="noreferrer noopener" target="_blank">Technology for Inspiration Initiative – InspireIT</a></strong></p>
<p class="has-medium-font-size"><strong><img alt="🇳🇬" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1f3-1f1ec.png" style="height: 1em;" /> </strong>Nigeria</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Owerri<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Climate-Induced Displacement: Understanding Impacts on African Women through Open Data”</strong> – To raise awareness, facilitate informed discussions, and propose data-driven strategies to address the unique challenges faced by African women affected by climate-induced displacement, leveraging open data for better understanding and sustainable solutions.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://meta.wikimedia.org/wiki/Learnovation_Network_Foundation" rel="noreferrer noopener" target="_blank">Learnovation Network Foundation (LNF)</a></strong></p>
<p class="has-medium-font-size"><strong><img alt="🇳🇬" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1f3-1f1ec.png" style="height: 1em;" /> </strong>Nigeria</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Ilorin<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Wikidata Loves SDGs Nigeria”</strong> – To harness the power of Wikidata to open up and update data related to SDGs, including fields of work, targets, indicators, organisations working on SDG topics, organisations whose field of work encompasses any of the SDGs, and SDG advocates in Nigeria.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://www.facebook.com/profile.php?id=100089495294944" rel="noreferrer noopener" target="_blank">MUMSA Initiative</a></strong></p>
<p class="has-medium-font-size"><strong><img alt="🇳🇬" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1f3-1f1ec.png" style="height: 1em;" /> </strong>Nigeria</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Ningi<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Hacking for Healthy Food & Green Futures: An Open Data Challenge for Ningi Youth”</strong> – To empower Ningi youth to use open data to develop innovative solutions for food security, mental health, and climate change, contributing to SDGs 2, 3, and 13.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://digitalgrassroots.org/" rel="noreferrer noopener" target="_blank">Digital Grassroots</a></strong></p>
<p class="has-medium-font-size"><strong><img alt="🇳🇬" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1f3-1f1ec.png" style="height: 1em;" /> </strong>Nigeria</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Zaria<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Open Data as a Human Right Workshop: Empowering Law Students for Sustainable Development”</strong> – Empower law students by framing open data as a fundamental human right, exploring its intersection with digital rights, and highlighting its role in advancing sustainable development.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://ruralaiduganda.org/" rel="noreferrer noopener" target="_blank">Rural Aid Foundation</a></strong></p>
<p class="has-medium-font-size"><img alt="🇺🇬" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1fa-1f1ec.png" style="height: 1em;" /> Uganda</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Kibaale<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Empowering migrant and refugee women to use open data to hold duty bearers accountable for quality sexual reproductive health services”</strong> – Mobilize and orient a pool of 60 rural migrant refugee women and girls and 10 women-led community organisations on the concept of open data and how to use open data to hold duty bearers accountable in providing quality SRHR services.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="http://detroitography.com" rel="noreferrer noopener" target="_blank">DETROITography</a></strong></p>
<p class="has-medium-font-size"><img alt="🇺🇸" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1fa-1f1f8.png" style="height: 1em;" /> United States</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Detroit<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Neighborhood Data Discovery”</strong> – The event will focus on presenting neighborhood-level data on SDG indicators for participants to learn about and explore as well as an opportunity to provide feedback and future direction to measuring SDGs at the neighborhood level.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://utopix.cc/serie/femicidios/" rel="noreferrer noopener" target="_blank">UTOPIX Femicide Monitor</a></strong></p>
<p class="has-medium-font-size"><img alt="🇻🇪" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1fb-1f1ea.png" style="height: 1em;" /> Venezuela</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Caracas<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Bootcamp INFOTOPIA version 2.0: Learning how to monitor and infographics gender-based violence”</strong> – Training for organizations to monitor and visualise open data on gender-based violence in the Capital District.</p>
</div>
</div>
<hr class="wp-block-separator" />
<div class="wp-block-spacer" style="height: 30px;"></div>
<h2>Open Mapping Mini-Grant Winners</h2>
<div class="wp-block-spacer" style="height: 30px;"></div>
<p><strong>This call was specifically seeking to promote events related to open mapping</strong> – such as the use and promotion of geodata, mapathons, environmental monitoring, disaster response, community mapping, land productivity analysis, etc. <strong>A total of 8 open mapping events will receive a grant amount of USD 300 each</strong>, thanks to the sponsorship of <a href="https://www.hotosm.org/" rel="noreferrer noopener" target="_blank">Humanitarian OpenStreetMap</a> (HOT).</p>
<figure class="wp-block-image size-large"><a href="https://blog.okfn.org/wp-content/files/2024/01/sponsors-mapping-2.png"><img alt="" class="wp-image-29171" height="143" src="https://blog.okfn.org/wp-content/files/2024/01/sponsors-mapping-2-1024x143.png" width="1024" /></a></figure>
<div class="wp-block-spacer" style="height: 30px;"></div>
<p>Here are the winning proposals by country, in alphabetical order:</p>
<div class="wp-block-spacer" style="height: 30px;"></div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://sites.google.com/view/youthmappersufba/p%C3%A1gina-inicial" rel="noreferrer noopener" target="_blank">YouthMappers UFBA</a></strong></p>
<p class="has-medium-font-size"><img alt="🇧🇷" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1e7-1f1f7.png" style="height: 1em;" /> Brazil</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Salvador<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Communities Mapping Communities: Brazil-Africa Connection”</strong> – Empower vulnerable communities in Brazil and Africa through the exchange of knowledge facilitated by open mapping data.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://github.com/GeOsmFamily" rel="noreferrer noopener" target="_blank">GeOsm Family</a></strong></p>
<p class="has-medium-font-size"><img alt="🇨🇲" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1e8-1f1f2.png" style="height: 1em;" /> Cameroon</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Yaoundé<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Build a mapping community for kids”</strong> – Children’s initiation to mapping and territories geolocalisation.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://rumbo.digital/" rel="noreferrer noopener" target="_blank">Asociación Rumbo Digital</a></strong></p>
<p class="has-medium-font-size"><img alt="🇨🇴" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1e8-1f1f4.png" style="height: 1em;" /> Colombia</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Bogotá<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Environmental Mapping: Collecting Colombia’s biodiversity data through urban trees”</strong> – Introduce new mappers to open data collection applied to urban biodiversity with OSM notes.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://www.facebook.com/youthmappersUAO?_rdc=1&_rdr" rel="noreferrer noopener" target="_blank">YouthMappersUAO</a></strong></p>
<p class="has-medium-font-size"><img alt="🇨🇮" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1e8-1f1ee.png" style="height: 1em;" /> Cote d’Ivoire</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Bouaké<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“WaterPointMapping”</strong> – To produce a participatory map of the areas where agro-pastoralists have access to water in the dry season, in order to improve the OpenStreetMap database quantitatively and qualitatively.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://www.facebook.com/profile.php?id=100087635536676" rel="noreferrer noopener" target="_blank">Media Sensitive to Disasters – MSD Network</a></strong></p>
<p class="has-medium-font-size"><img alt="🇨🇩" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1e8-1f1e9.png" style="height: 1em;" /> Democratic Republic of the Congo</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Goma<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Forced displacements open mapping”</strong> – Identify newly established displaced camps in eastern Congo war-torn regions for humanitarian assistance.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://www.instagram.com/youthmappers.upi/" rel="noreferrer noopener" target="_blank">UPI YouthMappers</a></strong></p>
<p class="has-medium-font-size"><img alt="🇮🇩" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1ee-1f1e9.png" style="height: 1em;" /> Indonesia</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Bandung<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Bus-friendly: mapping participation guiding blocks & halt for equality of public transportation users and engage disabled voices”</strong> – Evaluate the condition of the Non-BRT Trans Metro Pasundan halt in terms of accessibility for people with disabilities and marginalised people (SDGs 11 & 10) using OSM & Wikipedia.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://www.facebook.com/UPRIYouthMappers/" rel="noreferrer noopener" target="_blank">UP Resilience Institute YouthMappers</a></strong></p>
<p class="has-medium-font-size"><img alt="🇵🇭" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1f5-1f1ed.png" style="height: 1em;" /> Philippines</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Quezon City<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“PedalMap: Engaging Biking Communities in Open Mapping for Sustainable Development Goals”</strong> – Through a combination of remote mapping and field sessions, we plan to collaborate with biking communities to collect 360 satellite images through Mapillary and enhance biking-related OpenStreetMap (OSM) data.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column">
<p class="has-medium-font-size"><strong><a href="https://ecomappers.org/" rel="noreferrer noopener" target="_blank">EcoMappers</a></strong></p>
<p class="has-medium-font-size"><img alt="🇷🇼" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f1f7-1f1fc.png" style="height: 1em;" /> Rwanda</p>
<p class="has-small-font-size"><img alt="📍" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4cd.png" style="height: 1em;" /> Kigali City<br /><strong> </strong></p>
</div>
<div class="wp-block-column">
<p><strong>“Mapping Nyarugenge High-Risk Zone for Disaster Preparedness”</strong> – To map disaster-prone areas and provide training to youth on mapping tools for effective disaster response.</p>
</div>
</div>
<div class="wp-block-columns">
<div class="wp-block-column"></div>
</div>
<hr class="wp-block-separator" />
<div class="wp-block-spacer" style="height: 30px;"></div>
<h3>About Open Data Day</h3>
<div class="wp-block-spacer" style="height: 30px;"></div>
<p><a href="https://opendataday.org/" rel="noreferrer noopener" target="_blank">Open Data Day</a> (ODD) is an annual celebration of open data all over the world. Groups from many countries create local events on the day where they will use open data in their communities. ODD is led by the <a href="https://okfn.org/" rel="noreferrer noopener" target="_blank">Open Knowledge Foundation</a> (OKFN) and this year’s edition is co-organised by <a href="https://okfn.org/en/gambia/" rel="noreferrer noopener" target="_blank">Jokkolabs Banjul</a> (Gambia), <a href="https://okfn.de/" rel="noreferrer noopener" target="_blank">Open Knowledge Germany</a>, <a href="https://okfn.org/en/ghana/" rel="noreferrer noopener" target="_blank">Open Knowledge Ghana</a>, and <a href="https://oknp.org/" rel="noreferrer noopener" target="_blank">Open Knowledge Nepal</a>, all members of the <a href="https://okfn.org/en/network/" rel="noreferrer noopener" target="_blank">Open Knowledge Network</a>.</p>
<p>As a way to increase the representation of different cultures, since 2023 we offer the opportunity for organisations to host an Open Data Day event on the best date between March 2nd and 8th. All outputs are open for everyone to use and re-use.</p>
<p>In 2024, Open Data Day is also a part of the <a href="https://www.hotosm.org/opensummit23-24" rel="noreferrer noopener" target="_blank">HOT OpenSummit ’23-24 initiative</a>, a creative programme of global event collaborations that leverages experience, passion and connection to drive strong networks and collective action across the humanitarian open mapping movement</p>
<p>For more information, you can reach out to the Open Knowledge Foundation team by emailing <a href="mailto:opendataday@okfn.org" rel="noreferrer noopener" target="_blank">opendataday@okfn.org</a>. You can also join the <a href="https://groups.google.com/forum/#!forum/open-data-day" rel="noreferrer noopener" target="_blank">Open Data Day Google Group</a> to ask for advice or share tips and get connected with others.</p>
Lucas Pretti
https://blog.okfn.org
HangingTogether: Bibliotheken ondersteunen datagedreven besluitvorming
https://hangingtogether.org/?p=14049
2024-02-28T08:16:00+00:00
<p class="has-small-font-size"><em>Met dank aan Vincent Jordaan, OCLC, voor het vertalen van de oorspronkelijke </em><a href="https://hangingtogether.org/libraries-support-data-driven-decision-making/" rel="noreferrer noopener" target="_blank"><em>Engelstalige blogpost</em></a><em>.</em></p>
<p>Het <a href="https://www.oclc.org/research/partnership.html" rel="noreferrer noopener" target="_blank">OCLC Research Library Partnership</a> (RLP) en LIBER (een vereniging van Europese onderzoeksbibliotheken) organiseerden op 7 februari 2024 een begeleide discussie over besluitvorming op basis van data. De bijeenkomst maakte deel uit van de lopende reeks <a href="https://connect.oclc.org/en/oclc-liber-building-for-the-future-series-2023" rel="noreferrer noopener" target="_blank">Building for the future</a> (Bouwen aan de toekomst), waarin we onderzoeken hoe bibliotheken werken aan innovatieve dienstverlening, zoals beschreven in de <a href="https://connect.oclc.org/en/oclc-liber-building-for-the-future-series-2023" rel="noreferrer noopener" target="_blank">LIBER-strategie 2023-2027</a>.</p>
<figure class="wp-block-image alignleft size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2024/02/s-o-c-i-a-l-c-u-t-r0saAQNjEjQ-unsplash-scaled.jpg"><img alt="This image shows three women seated at a table working at computers." class="wp-image-13881" height="683" src="https://hangingtogether.org/wp-content/uploads/2024/02/s-o-c-i-a-l-c-u-t-r0saAQNjEjQ-unsplash-1024x683.jpg" style="width: 488px; height: auto;" width="1024" /></a>Photo by <a href="https://unsplash.com/@socialcut?utm_content=creditCopyText&utm_medium=referral&utm_source=unsplash">S O C I A L . C U T</a> on <a href="https://unsplash.com/photos/3-women-sitting-on-chair-in-front-of-table-with-laptop-computers-r0saAQNjEjQ?utm_content=creditCopyText&utm_medium=referral&utm_source=unsplash">Unsplash</a>
</figure>
<p>Het OCLC RLP-team werkte samen met leden van de LIBER-werkgroepen <a href="https://libereurope.eu/working-group/research-data-management/" rel="noreferrer noopener" target="_blank">Research Data Management</a> en <a href="https://libereurope.eu/working-group/liber-data-science-in-libraries-working-group/" rel="noreferrer noopener" target="_blank">Data Science in Libraries</a> om de discussievragen op te stellen. Net zoals in ons vorige gesprek over research data management, hebben we geprobeerd de discussie praktisch te houden. We vroegen de deelnemers om hun huidige en toekomstige inspanningen te delen. Ook wilden we graag hun gedachten horen over de rol en waarde van de bibliotheek bij het ondersteunen van datagedreven besluitvorming. De discussies in kleine groepen werden gefaciliteerd door enthousiaste vrijwilligers van <a href="https://libereurope.eu/working-groups/" rel="noreferrer noopener" target="_blank">LIBER-werkgroepen</a> en OCLC.</p>
<p>De virtuele bijeenkomst werd bijgewoond door deelnemers van 35 instellingen in 15 landen uit Europa, Noord-Amerika en Azië. Ondanks de vele regionale en nationale verschillen waren er verschillende hoofdthema’s die in de zeven discussiegroepen naar voren kwamen.</p>
<h4 class="wp-block-heading">Wat betekent datagedreven besluitvorming voor bibliotheken?</h4>
<p>We stelden deelnemers deze vraag in een online poll. We bereikten een vrij sterke consensus dat datagedreven besluitvorming betekent “het gebruik van bewijs om beslissingen te onderbouwen en de resultaten ervan te evalueren”. Hoewel we in deze discussie de term “datagedreven” hebben gebruikt, erkennen we dat anderen de voorkeur geven aan “datagestuurd” of “databewust”.</p>
<p>In de gesprekken kwam naar voren dat kwaliteitsdata belangrijk zijn voor het onderbouwen van beslissingen. Ook kwam naar voren dat we voorzichtig moeten zijn en data alleen als hulpmiddel moeten gebruiken bij besluitvorming. Het is belangrijk om data te begrijpen binnen de juiste context. Deze data moeten niet enkel worden beschouwd als vervanging voor andere kwalitatieve onderzoeksmethoden om kennis te vergaren.</p>
<figure class="wp-block-image size-full"><a href="https://hangingtogether.org/wp-content/uploads/2024/02/dddm.jpg"><img alt="" class="wp-image-13954" height="445" src="https://hangingtogether.org/wp-content/uploads/2024/02/dddm.jpg" width="921" /></a><em><sup>Reacties op een online poll met een vraag over de betekenis van “datagedreven” besluitvorming</sup></em></figure>
<h4 class="wp-block-heading">Hoe maken bibliotheken gebruik van datagedreven besluitvorming?</h4>
<p><strong>Er zijn talloze manieren waarop bibliotheken datagedreven besluitvorming gebruiken. </strong>We hoorden van deelnemers die <a href="https://hangingtogether.org/category/collective-collections/" rel="noreferrer noopener" target="_blank">gezamenlijke inspanningen</a> beschreven voor collectiebeheer. Hierbij werkt een groep bibliotheken samen om hun collecties samen te beheren, beslissingen over collectiebehoud te ondersteunen, en nog veel meer. Ook kunnen uitleenstatistieken worden gebruikt voor collectieontwikkeling en het saneren van collecties.</p>
<p>De deelnemers spraken ook over het analyseren van data met betrekking tot het gebruik van de bibliotheekgebouwen. Denk daarbij aan het automatisch registreren van hoeveel mensen een ruimte binnenkomen en verlaten of het analyseren van Wi-Fi-gebruik. Op deze manier kunnen ze de drukte in elke ruimte meten en onderbouwen ze beslissingen over ruimtebeheer.</p>
<p>De aanwezigen benadrukten ook de groeiende rol van de bibliotheek in onderzoeksanalyse, ter ondersteuning van institutionele doelstellingen. In het Verenigd Koninkrijk is de bibliotheek vaak verantwoordelijk voor het beheer van data over de wetenschappelijke kennis van de instelling, voor rapportage aan het nationale Research Excellent Framework (REF)<a href="https://feeds.feedburner.com/Hangingtogetherorg#_edn1" id="_ednref1">[i]</a>.</p>
<p>Ook op andere plekken ondersteunen bibliotheekmedewerkers institutionele inspanningen om inzicht te krijgen in de onderzoeksproductiviteit, de voortgang naar open onderzoeksdoelen en het identificeren van mogelijke samenwerkingsverbanden. Bibliotheken creëren ook specifieke functies om een breed scala aan onderzoeksdata te beheren en beschikbaar te maken voor hergebruik. Een onderwerp wat ook terugkwam in een <a href="https://libereurope.eu/article/data-curation-an-interview-with-matthias-towe/" rel="noreferrer noopener" target="_blank">recent LIBER-interview met Matthias Töwe</a>, Data Curator bij de ETH Zürich Library.</p>
<h4 class="wp-block-heading">Datagedreven besluitvorming ondersteunen is een uitdaging</h4>
<p><strong>Bibliotheken worden overspoeld met data.</strong> Verschillende deelnemers beschreven het gevoel overweldigd te worden door alle beschikbare gegevens. Alleen al de hoeveelheid maakt het lastig om data effectief te beheren, op te schonen en te gebruiken. Ook is het soms moeilijk om te weten welke data beschikbaar zijn, omdat ze verspreid zijn over vele silo’s binnen de organisatie. Daarom is meer organisatie en transparantie noodzakelijk.</p>
<p><strong>Samenwerking is vereist</strong>, ongeacht de omvang. Het <a href="https://crl.acrl.org/index.php/crl/article/view/24618/32438" rel="noreferrer noopener" target="_blank">analyseren van collecties van meerdere instellingen</a> vergt aanzienlijke investeringen en betrokkenheid van diverse belanghebbenden in verschillende instellingen en bibliotheekafdelingen. Zelfs bij het oplossen van lokale operationele vraagstukken moeten bibliotheekmedewerkers <a href="https://hangingtogether.org/social-interoperability-getting-to-know-all-about-you/" rel="noreferrer noopener" target="_blank">sociale interoperabiliteit</a> toepassen. Een deelnemer merkte op: “we hebben data van anderen nodig”, om eigen taken te volbrengen.</p>
<p><strong>Gebruikers die om data en rapporten vragen, kunnen vaak niet helder uitleggen wat ze precies nodig hebben.</strong> Dit blijkt een veelvoorkomend probleem te zijn, zoals uit onze online poll over de spanningen en uitdagingen van samenwerking rond datagedreven besluitvorming bleek. Een kleine groep besprak de noodzaak om “referentie-interviews” toe te passen bij het praten met gebruikers van data, om zo beter te begrijpen welke vragen ze willen beantwoorden en deze te verduidelijken.</p>
<h4 class="wp-block-heading">Wat biedt de bibliotheek aan meerwaarde voor datagedreven besluitvorming?</h4>
<p>We vroegen de deelnemers om in kleine groepen te discussiëren over de algehele waarde die de bibliotheek biedt bij het faciliteren van beslissingen op basis van data:</p>
<p><strong>Bibliotheken weten alles over metadata. </strong>De vaardigheden en kennis van bibliotheekmedewerkers over bibliotheekdata zijn van onschatbare waarde voor het beheren van collecties en meer. Deze deskundigheid op het gebied van metadata is duidelijk een sterk punt, maar wordt vaak over het hoofd gezien. Een deelnemer uitte zijn zorgen over het feit dat bibliotheekexpertise te gemakkelijk wordt afgedaan als “alleen boeken”. Dit gebeurt zonder de overdraagbaarheid en waarde van deze vaardigheden te erkennen, zoals ervaring met complexe bedrijfssystemen, vaardigheid met databeheer en de consistente toepassing van regels, standaarden en beleid.</p>
<p><strong>Bibliotheken gebruiken data om op een verantwoorde manier bronnen te beheren. </strong>Activiteiten zoals gedeelde drukwerkcollecties en andere collectieve collecties vertrouwen op verzamelde data over bibliotheekcollecties. Deze data worden gebruikt om beslissingen te nemen over collectieontwikkeling, retentie en langdurig en kosteneffectief beheer van wetenschappelijke data. Verschillende deelnemers beschreven ook hoe data over collecties en het gebruik van bibliotheekgebouwen werden gebruikt. Dit gebeurde om beslissingen te nemen over toekomstig ruimtegebruik. Bibliotheken moeten namelijk aantonen dat ze hun middelen goed benutten, zodat ze blijvende financiering ontvangen.</p>
<p><strong>Ondersteunende diensten voor onderzoek gaan verder dan de traditionele bibliotheek en zijn zeer zichtbaar voor andere belanghebbenden op de campus. </strong>Bibliotheekondersteuning op het gebied van bijvoorbeeld research data management, onderzoeksinformatie en het beheer van data voor nationale rapportagevereisten, in lijn met de strategische prioriteiten van de campus, trekt vaak de meeste aandacht van niet-bibliotheekbelanghebbenden.<br /> Deelnemers uit het Verenigd Koninkrijk en Hong Kong benadrukten bijvoorbeeld de centrale rol van de bibliotheek bij het verzamelen van de wetenschappelijke data van de instelling. Dit ter ondersteuning van nationale rapportageverplichtingen en om analyses te maken van de output en impact van institutioneel onderzoek. Een Canadese deelnemer beschreef de aanstelling van een bibliometrische bibliothecaris, die nu leiding geeft aan een informeel netwerk van business intelligence officers binnen de universiteit. De groep biedt nu ondersteuning bij het nemen van beslissingen over naleving, beoordeling en financiering.<br /> Bibliotheken onderzoeken ook hoe ze een reeks indicatoren kunnen definiëren die inzicht geven in open onderzoeksactiviteiten, zoals beschreven in een <a href="https://hangingtogether.org/supporting-open-research-at-the-university-of-manchester-libraries/" rel="noreferrer noopener" target="_blank">recente RLP-webinarpresentatie</a> door Scott Taylor van de Universiteit van Manchester.</p>
<h4 class="wp-block-heading">Hoe kunnen bibliotheken hun waarde beter communiceren en welke strategieën kunnen ze hiervoor gebruiken?</h4>
<p><strong>Het is belangrijk dat bibliotheekbestuurders actief pleiten voor de rol en toegevoegde waarde van de bibliotheek. </strong>We hebben veel voorbeelden gehoord van bibliotheken die ondersteuning bieden bij institutionele besluitvorming. Maar het blijft nog steeds een uitdaging om niet-bibliotheekbelanghebbenden ervan te overtuigen dat de bibliotheek een waardevolle bijdrage levert. Een terugkerende zorg is dat mensen simpelweg niet aan de bibliotheek denken. Deze zorg kwam ook naar voren in de <a href="https://hangingtogether.org/exploring-the-challenges-and-opportunities-of-research-data-management-rdm/" rel="noreferrer noopener" target="_blank">vorige begeleide discussie over research data management</a>. Daarom is het van belang dat bibliotheekleiders vastberaden zijn in het benadrukken van de kennis en vaardigheden van bibliotheekmedewerkers. Ze moeten partners begeleiden om de bibliotheek op nieuwe en eigentijdse manieren te zien en te waarderen.</p>
<p><strong>Gebruik een ‘doelenboom’ om duidelijk te maken wat belangrijk is en om dit intern en extern te communiceren</strong>. Een Britse deelnemer aan een van de kleine groepsdiscussies vertelde hoe ze zo’n doelenboom had gemaakt voor haar metadatateam. Daarin stond in grote lijnen wat ze wilden bereiken en hoe dit past binnen de doelen van de bibliotheek en de universiteit. Zo liet ze zien dat de catalogiseerders niet alleen maar “in de hoek zaten en boeken doornamen”, maar een belangrijke rol hebben in het beheren van kwaliteitsmetadata, wat weer helpt bij allerlei bedrijfsbehoeften. De andere deelnemers aan de discussie waren enthousiast over dit idee. Het lijkt een goede manier om het team te versterken en ervoor te zorgen dat iedereen dezelfde doelen voor ogen heeft.</p>
<p><strong>Visualisaties en data storytelling zijn nodig.</strong> Een sterk thema in de gesprekken in kleine groepen was dat de data alleen niet genoeg zijn. Bibliotheekmedewerkers moeten ook vaardigheden ontwikkelen in het presenteren van data. Ze moeten het gebruik van visualisaties beheersen om effectief te communiceren en enthousiasme te wekken voor de resultaten.</p>
<div class="wp-block-image">
<figure class="alignright size-full is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2024/02/Feel.jpg"><img alt="" class="wp-image-13922" height="378" src="https://hangingtogether.org/wp-content/uploads/2024/02/Feel.jpg" style="width: 470px; height: auto;" width="621" /></a><em><sup>Word cloud gemaakt tijdens de bijeenkomst</sup></em></figure></div>
<p><strong>Bibliotheekmedewerkers moeten hun kennis bijspijkeren, zowel individueel als in teamverband.</strong> Ze zijn goed in het beheren van data, maar ze missen vaak training in data-analyse met programma’s zoals Power BI en Tableau. Verschillende deelnemers hebben verteld over het aanleren van deze vaardigheden. Iemand uit Hong Kong beschreef bijvoorbeeld hoe haar instelling een groep heeft opgericht om samen data-analyse te verkennen, waardoor ze van elkaar kunnen leren. Een andere deelnemer uit Nederland doet iets vergelijkbaars met een lokale werkgroep die vaardigheden in datavisualisatie aanleert en een breder praktijknetwerk opbouwt. Over het algemeen zeiden de deelnemers dat er behoefte is aan bijscholing van het huidige personeel en dat er in de toekomst ook nieuwe mensen met goede technische vaardigheden nodig zijn voor data-analyse.</p>
<p>We sloten de bijeenkomst af met de uitnodiging aan de deelnemers om in één woord te zeggen over hoe ze zich voelden na het gesprek. Ze gaven aan dat ze zich geïnspireerd, geïnformeerd en aangemoedigd voelden.</p>
<h4 class="wp-block-heading">Doe mee aan de volgende begeleide discussie over AI, Machine Learning en Data Science</h4>
<p>De volgende sessie in deze reeks over geavanceerde diensten vindt plaats op 17 april. Tijdens deze bijeenkomst verkennen we gezamenlijk de uitdagingen en kansen van AI, machine learning en data science. De focus ligt op de manieren waarop onderzoeksbibliotheken vooruitstrevende technologieën gebruiken, of willen gebruiken, om werkprocessen in de bibliotheek, metadata en meer te verbeteren. Door gestructureerde discussies in kleine groepen te faciliteren, nodigen we deelnemers uit om ideeën te delen en op te doen over hun visies op de toekomst van AI en datawetenschap. Tegelijkertijd willen we gericht de uitdagingen onderzoeken waarmee bibliotheken worden geconfronteerd bij <a href="https://www.oclc.org/research/publications/2019/oclcresearch-responsible-operations-data-science-machine-learning-ai.html" rel="noreferrer noopener" target="_blank">het verantwoord toepassen van opkomende technologieën</a>. <a href="https://www.oclc.org/oclc-forms/en/events/2024/liber-webinar-ai-machine-learning-data-science.html" rel="noreferrer noopener" target="_blank">Meld je vandaag nog aan</a> om een plekje te bemachtigen.</p>
<hr class="wp-block-separator has-alpha-channel-opacity" />
<p><a href="https://feeds.feedburner.com/Hangingtogetherorg#_ednref1" id="_edn1">[i]</a> Het Research Excellence Framework (REF) is een systeem dat wordt gebruikt in het Verenigd Koninkrijk om de kwaliteit van het onderzoek aan universiteiten en onderzoeksinstellingen te evalueren en te beoordelen.</p>
<p>The post <a href="https://hangingtogether.org/bibliotheken-ondersteunen-datagedreven-besluitvorming/">Bibliotheken ondersteunen datagedreven besluitvorming</a> appeared first on <a href="https://hangingtogether.org">Hanging Together</a>.</p>
Rebecca Bryant
https://hangingtogether.org/
Ed Summers: congressedits notes
https://inkdroid.org/2024/02/28/congressedits/
2024-02-28T05:00:00+00:00
<p>
Ok, this is a blast from the past. Here are some rough notes for a
conversation today with <a href="http://gvptsites.umd.edu/calvo/">Ernesto</a> about how <a href="https://en.wikipedia.org/wiki/CongressEdits">congressedits</a>
worked, and Wikipedia data more generally. I couldn’t summon the will to
create a slidedeck, but I was taking some notes, and it seemed easiest
to just drop them here. Ernesto does some amazing <a href="https://ilcss.umd.edu/research">work</a> studying the political
dimensions of the web.
</p>
<p>
This is really just a list of links to follow…
</p>
<h3 id="what-was-it">
What was it?
</h3>
<p>
<a href="https://en.wikipedia.org/wiki/File:Screenshot_of_@congressedits_Tweet_1045422483082551302.png"><img src="https://inkdroid.org/images/congressedits-example.png" /></a>
</p>
<ul>
<li>
archive: <a href="https://edsu.github.io/congressedits-archive/">https://edsu.github.io/congressedits-archive/</a>
</li>
<li>
article about it: <a href="https://source.opennews.org/articles/automating-transparency/">https://source.opennews.org/articles/automating-transparency/</a>
</li>
<li>
anon: <a href="https://github.com/edsu/anon">https://github.com/edsu/anon</a>
</li>
<li>
other anon bots: <a href="https://github.com/edsu/anon?tab=readme-ov-file#community">https://github.com/edsu/anon?tab=readme-ov-file#community</a>
</li>
</ul>
<h3 id="how-did-it-work">
How did it work?
</h3>
<ul>
<li>
anonymous editing of wikipedia uses IP address of the user as the
username: e.g. <a href="https://en.wikipedia.org/w/index.php?diff=1210806587&oldid=1200740098">https://en.wikipedia.org/w/index.php?diff=1210806587&oldid=1200740098</a>
</li>
<li>
what is my IP address: <a href="https://whatismyipaddress.com/">https://whatismyipaddress.com/</a>
</li>
<li>
IP ranges for Congress from GovTrack: <a href="https://github.com/govtrack/govtrack.us-web/blob/031ea63cc4dda2a24c341834f081f46519f7f7f9/website/middleware.py#L15-L32">https://github.com/govtrack/govtrack.us-web/blob/031ea63cc4dda2a24c341834f081f46519f7f7f9/website/middleware.py#L15-L32</a>
</li>
<li>
previous work on wikistream: <a href="https://wikistream.toolforge.org">https://wikistream.toolforge.org</a>
</li>
<li>
Event Stream API: <a href="https://wikitech.wikimedia.org/wiki/Event_Platform/EventStreams">https://wikitech.wikimedia.org/wiki/Event_Platform/EventStreams</a>
</li>
<li>
example of listening to Event Stream API with Python: <a href="https://gist.github.com/edsu/80bf00eacbe97378568acf4964bb7e23">https://gist.github.com/edsu/80bf00eacbe97378568acf4964bb7e23</a>
</li>
</ul>
<h3 id="research">
Research
</h3>
<ul>
<li>
Wikipedia XTools: <a href="https://xtools.wmcloud.org/ec">https://xtools.wmcloud.org/ec</a>
for example, search for edits from the US Senate: 156.33.0.0/16
</li>
<li>
CIDR: <a href="https://en.wikipedia.org/wiki/Classless_Inter-Domain_Routing">https://en.wikipedia.org/wiki/Classless_Inter-Domain_Routing</a>
</li>
<li>
Internet registries: <a href="https://en.wikipedia.org/wiki/Regional_Internet_registry">https://en.wikipedia.org/wiki/Regional_Internet_registry</a>
</li>
<li>
Registration Data Access Protocol: <a href="https://en.wikipedia.org/wiki/Registration_Data_Access_Protocol">https://en.wikipedia.org/wiki/Registration_Data_Access_Protocol</a>
</li>
<li>
ARIN: <a href="https://search.arin.net/rdap/">https://search.arin.net/rdap/</a>
</li>
<li>
Another example: tracking Ukrainian Websites: <a href="https://github.com/edsu/gov-ua">https://github.com/edsu/gov-ua</a>
</li>
</ul>
Ed Summers
https://inkdroid.org/
Lucidworks: Win B2B Buyers with Smarter Search and More
https://lucidworks.com/?p=28496
2024-02-28T00:09:47+00:00
<p>Elevate your B2B digital experience. Learn how search, personalization, and self-service tools win over modern B2B buyers. </p>
<p>The post <a href="https://lucidworks.com/post/win-b2b-buyers-with-smarter-search-and-more/">Win B2B Buyers with Smarter Search and More</a> appeared first on <a href="https://lucidworks.com">Lucidworks</a>.</p>
Brian Land
https://lucidworks.com/
Lucidworks: 8 Pitfalls of Gen AI in Search — and How to Avoid Them
https://lucidworks.com/?p=28332
2024-02-27T15:23:15+00:00
<p>In the rush to embrace the future, it's easy to overlook the pitfalls of adopting new technologies. Large language models...</p>
<p>The post <a href="https://lucidworks.com/post/8-pitfalls-of-gen-ai-in-search/">8 Pitfalls of Gen AI in Search — and How to Avoid Them</a> appeared first on <a href="https://lucidworks.com">Lucidworks</a>.</p>
Lucidworks
https://lucidworks.com/
Digital Library Federation: Shira Peltzman Elected 2024 NDSA Vice Chair
https://www.diglib.org/?p=30823
2024-02-27T15:05:53+00:00
<p><span style="font-weight: 400;">Shira Peltzman, in her second year as a member of the NDSA Coordinating Committee, has been elected by the NDSA Leadership as its 2024 Vice Chair and 2025 Chair. The Vice Chair’s duties include: </span></p>
<ul>
<li style="font-weight: 400;"><span style="font-weight: 400;">Managing the annual process to elect new CC members.</span></li>
<li style="font-weight: 400;"><span style="font-weight: 400;">Facilitating the new member application process.</span></li>
<li style="font-weight: 400;"><span style="font-weight: 400;">Convening quarterly meetings for the Co-Chairs of Working Groups and Interest Groups.</span></li>
<li style="font-weight: 400;"><span style="font-weight: 400;">Participating in quarterly meetings between NDSA and CLIR.</span></li>
<li style="font-weight: 400;"><span style="font-weight: 400;">Along with the Chair, ensuring the NDSA Code of Conduct is carried out.</span></li>
</ul>
<p><span style="font-weight: 400;">Shira Peltzman (1st CC term, 2023-2025) is the Associate Director for Preservation Digital Strategies at Yale University Library where she provides leadership and direction for digital preservation, media preservation, and preservation imaging. In her role she serves as an advocate for sustainable stewardship and works with stakeholders across campus to champion ambitious preservation initiatives that support enduring access to Yale’s digital collections.</span></p>
<p><span style="font-weight: 400;">Please join me in congratulating Shira on this new role!</span></p>
<p>The post <a href="https://www.diglib.org/shira-peltzman-elected-2024-ndsa-vice-chair/" rel="nofollow">Shira Peltzman Elected 2024 NDSA Vice Chair</a> appeared first on <a href="https://www.diglib.org" rel="nofollow">DLF</a>.</p>
kussmann
https://www.diglib.org
HangingTogether: Collaboration is optional
https://hangingtogether.org/?p=14000
2024-02-27T15:03:06+00:00
<div class="wp-block-image">
<figure class="alignleft size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2023/10/greyhound-scaled.jpg"><img alt="" class="wp-image-13281" height="683" src="https://hangingtogether.org/wp-content/uploads/2023/10/greyhound-1024x683.jpg" style="width: 392px; height: auto;" width="1024" /></a><em><sup>This dog has the option to chomp the hand poking his nose. So far, he has chosen not to exercise it. Photo by <a href="https://unsplash.com/@benwilliams?utm_content=creditCopyText&utm_medium=referral&utm_source=unsplash">Benjamin Williams</a> on <a href="https://unsplash.com/photos/a-dog-is-being-petted-by-a-person-MEljQmUhgQw?utm_content=creditCopyText&utm_medium=referral&utm_source=unsplash">Unsplash</a></sup></em></figure></div>
<p>I really like the concept of options as a way of thinking about future opportunity. In this post, I’d like to make a case that adopting an “options perspective” can strengthen library decision-making in a range of scenarios – including the decision whether to collaborate. But let me start with a few brief remarks about options that will help clarify application of this concept to libraries. </p>
<p>An option bestows upon its owner the <em>right, but not the obligation</em> to do something. For example, in finance, a call option grants the right, but not the obligation, to purchase a security (say, a particular stock) at a specified price at any time before the option expires. Why is a call option valuable? If the market price of the stock rises above the price specified in the option contract (the “strike price”), you can exercise the option, purchase the security at the strike price, and then sell it on the open market for a profit. But there is no obligation to exercise the option; if the market price remains below the strike price, then the option will likely be allowed to expire unexercised.</p>
<p>If you boil away the details of particular types of options and the specific features of the option contract, I think you are left with two general points:</p>
<ol>
<li>Options represent an <em>opportunity</em> to do something in the future.</li>
<li>Possessing that opportunity is <em>valuable</em>.</li>
</ol>
<p>Call options – and other types of financial options – are not free. In fact, option values are monetized and traded in financial markets. This leads to a third general point about options:</p>
<ol start="3">
<li>There is a <em>cost</em> to acquire an option.</li>
</ol>
<p>In other words, you pay an upfront cost to acquire an option, in the hopes of exercising it for a larger return sometime in the future. So you must expend resources to get an option; it’s not just a choice that already exists on its own.</p>
<h2 class="wp-block-heading">The options perspective</h2>
<p>Enough about financial options. What does this have to do with libraries? Let me illustrate the connection through examples having to do with digital preservation and research data management.</p>
<p>As I mentioned at the outset, I like the concept of options, and the reason is because I have found that it pops up in all sorts of contexts, and often provides some unexpected insights. In 2010, I wrote an appendix for the report <a href="https://www.sdsc.edu/assets/docs/pub/BRTF_Final_Report.pdf" rel="noreferrer noopener" target="_blank"><em>Sustainable Economics for a Digital Planet: Ensuring Long-term Access to Digital Information</em></a> in which I made the case that the concept of options deepens our understanding of investing in digital preservation. Preserving a digital object is an uncertain enterprise – in particular, it is often unknown whether future usage of the object will justify the expense of preserving it. But this decision need not be made once and for all at the outset; by making initial commitments to retain the object for a finite period – say a few years – the repository has for all intents and purposes “purchased” the option to preserve the object for a longer period, a decision which can then be made at a later time after reevaluating usage patterns for the object over the initial retention period.</p>
<p>The example of digital preservation satisfies the three basic features of options enumerated above:</p>
<ol>
<li><em>Opportunity:</em> the opportunity to preserve the digital object long term</li>
<li><em>Value:</em> the potential benefits of ongoing usage of the object</li>
<li><em>Cost: </em>the expense of initial preservation actions</li>
</ol>
<p>The scenario laid out above is not merely a thought exercise, but a useful framework for addressing a real-world problem for many academic libraries: data set retention. For example, the <a href="https://databank.illinois.edu/">Illinois Data Bank</a> is a “public access repository for publishing research data from the University of Illinois at Urbana-Champaign.” The <a href="https://databank.illinois.edu/policies#preservation_policy">preservation policy</a> associated with a deposited data set includes a commitment to preserve the data set for a minimum of five years. After this period, the Data Bank reserves the right to review the data set and determine if it will be retained or deaccessioned. This is implicitly an option-based approach to data curation: an initial investment to retain the data set for a limited period sets up an option to continue to preserve it long term. The review described in the preservation policy is, essentially, a decision whether or not to exercise that option.</p>
<p>Contrast this to an extreme case: a once-and-for-all decision at the time the data is ready for deposit to either accept the data set and commit to retaining it indefinitely, or not accept it at all. In either eventuality, an element of choice, or flexibility, is lost: either the ability to deaccession a data set if its predicted future value does not warrant its preservation cost, or the ability to resolve some of the uncertainty over the data set’s future value by preserving it for a limited time, and then make a more informed decision about long term retention later. And of course, if the object is not retained at the outset – if the option to preserve is not created through the initial investment in curation – a potentially valuable data set could be lost forever.</p>
<h2 class="wp-block-heading">The option to collaborate</h2>
<p>But the utility of the options framework does not end here. Our latest OCLC Research report,<a> </a><a href="https://www.oclc.org/research/publications/2023/rdm-collaboration/rdm-library-collaboration-case-studies.html"><em>Building Research Data Management Capacity: Case Studies in Strategic Library Collaboration</em></a>, highlights another area where an option-focused perspective yields useful insight: library collaboration.</p>
<p>Our report documents several case studies of multi-institutional collaborations in the RDM space. One theme running through these case studies was the importance of trust among collaborating partners. Trusted partners not only improve the chances of success for current collaborative efforts, but also open up opportunities for expanding collaboration into new areas. As we observe in the report:</p>
<p class="is-style-default">“Another example of an intangible benefit is the accumulation of trust through the shared experience of collaboration. Trust, in turn, is important to the success and the prospect of future partnerships. . . . In this sense, collaborating is itself a benefit of collaboration, building up a shared foundation that can create an ‘option to collaborate’ for the future.”</p>
<p>In other words, investing in a collaboration – even a small-scale effort with limited objectives – cultivates among the partners a shared experience of working together, which can be leveraged as other opportunities for collaboration arise. This <em>intangible benefit</em> – the creation of an option to collaborate in the future – sits alongside any direct, transactional benefit an institution receives from participating in the partnership, but is probably rarely accounted for in any cost-benefit analysis of participating in a collaboration.</p>
<p>The report illustrates this idea with the example of the Texas Data Repository (TDR), a service that allows researchers affiliated with a member institution of the Texas Digital Library (TDL) to publish their data sets. As the report notes, an important incentive to participate in the TDR was that it “was designed and operated by the TDL, a trusted entity with a track record of building community and shared capacities among its membership.” The experience of working with TDL partners on past collaborative efforts created a viable “option to collaborate” on future joint endeavors – an option that was indeed exercised when the opportunity to build shared data repository capacity arose. </p>
<h2 class="wp-block-heading">“Part of the value of collaboration is collaborating”</h2>
<p>A similar observation is found in another recent OCLC Research report, <em><a href="https://www.oclc.org/research/publications/2023/sustaining-art-research/sustaining-art-research-collections-case-studies.html" rel="noreferrer noopener" target="_blank">Sustaining Art Research Collections: Case Studies in Collaboration</a></em>. This report explores the experiences of several art museum libraries partnering with academic libraries as part of a strategy for achieving long-term sustainability for their collections. The report notes:</p>
<p>“. . . [A]n important intangible benefit of any partnership is the creation of a shared history of collaboration between partners that can be leveraged in the future. As staff from different institutions accumulate experience working together, a measure of trust and confidence in the relationship grows. This ends up representing an ‘option to collaborate’ that can be exercised in the future—either on an entirely new effort, or on extending existing collaborations into new activities. Part of the value of collaboration is collaborating, and this should not be overlooked when assessing the benefits returned from working with other institutions.”</p>
<p>For example, in a case study detailing the partnership between the Hirsch Library at the Museum of Fine Arts, Houston and the nearby Fondren Library at Rice University, we found that individuals we spoke to at both institutions emphasized the <em>value of the relationship</em><strong> </strong>as distinct from the value of the current collaboration. They believed that “regardless of the benefits perceived from the original agreement, the relationship between the two institutions is valuable and should be protected and preserved.” These partnering institutions saw that the full value of a collaboration goes beyond the immediate transactional benefits, to include the value created by cultivating an option to work together in the future.</p>
<p>In short, collaboration shares the same option-like features we saw with the digital preservation and data set retention examples mentioned earlier: a value in investing in something that creates the opportunity to make choices at a later time. And like those examples, there is insight to be gained about collaboration from thinking about it from an options-based perspective.</p>
<p>In making the case for the option value in collaboration, I am <em>not</em> suggesting that libraries should enter into every collaborative opportunity that comes along in the expectation of creating valuable options to collaborate that can be exercised in the future. Instead, the potential value of a trusted relationship with a partnering institution that can catalyze future collaborations should be considered alongside the many other factors that play into <a href="https://www.oclc.org/research/publications/2022/strategic-collaboration/strategic-collaboration-report.html" rel="noreferrer noopener" target="_blank">treating collaboration as a strategic choice</a>. In doing so, libraries will be addressing the recommendation put forward in <a href="https://www.oclc.org/research/publications/2023/rdm-collaboration/rdm-library-collaboration-case-studies.html"><em>Building Research Data Management Capacity</em></a>: <em>value the intangible benefits of collaboration</em>.</p>
<h2 class="wp-block-heading">Insights from options improve decision making</h2>
<p>More generally, the findings from our research on library collaboration, as well as our work in other areas of strategic interest to libraries, suggest that decision making can be improved by adopting an options perspective. <em>When confronted with decisions that involve future opportunity, it is valuable to factor that opportunity into assessments of costs and benefits.</em> Can a modest investment now lead to an expansion of future choices or flexibility?</p>
<p>Many decision makers probably consider this option value at least implicitly as part of their decision making process: for example, repository managers know that if effort is not made to retain and curate a data set now, the ability to use the data set later may be irrevocably lost. But the existence of an option value may be less clear in considering collaborative opportunities and the value that flows from them. In these situations, we may need to look a little harder for the option value. But as the findings from <a href="https://www.oclc.org/research/publications/2023/rdm-collaboration/rdm-library-collaboration-case-studies.html" rel="noreferrer noopener" target="_blank"><em>Building Research Data Management Capacity</em></a> indicate, the option value is often there, and it can be significant.</p>
<p>The post <a href="https://hangingtogether.org/collaboration-is-optional/">Collaboration is optional</a> appeared first on <a href="https://hangingtogether.org">Hanging Together</a>.</p>
Brian Lavoie
https://hangingtogether.org/
Jodi Schneider: A Retraction Notice Not Retrieved: Wrong DOI
https://jodischneider.com/blog/?p=2881
2024-02-25T02:53:00+00:00
<p><em>Part 2 of an occasional series on the <a href="https://infoqualitylab.org/projects/risrs2020/bibliography/">Empirical Retraction Lit bibliography</a></em></p>
<p>Our systematic search for the <em><a href="https://infoqualitylab.org/projects/risrs2020/bibliography/">Empirical Retraction Lit bibliography</a></em> EXCLUDES retraction notices or retracted publications using database filters. Still, some turn up. (Isn’t there always a metadata mess?)</p>
<p>While most retraction notices and retracted publications can be excluded at the title screening stage, a few make it through to the abstract screening, and, for items with no abstracts, to the full-text screening. Today’s example is “<a href="https://doi.org/10.1308/rcsann.2014.94">Retraction of unreliable publication</a>“. Kept at the title-screening stage**; no abstract; so it’s part of the full-text screening. PubMed metadata would have told us it’s a “Retraction of Publication” – but this particular record came from Scopus.</p>
<p>The Zotero-provisioned article, “<a href="https://doi.org/10.1308/rcsann.2014.157">Clinical guidelines: too much of a good thing</a>“, had nothing to do with retraction so I went back to the record (which had this <a href="https://www.scopus.com/inward/record.uri?eid=2-s2.0-84897800625&doi=10.1308%2fxxx&partnerID=40&md5=890fe628194eaaf73333dcd49c8d8df3">link with the Scopus EID</a>). To see what went wrong, I searched Scopus for <strong>EID(2-s2.0-84897800625)</strong> which finds the Scopus record, complete with an incorrect DOI: 10.1308/xxx which today takes me to a <a href="https://doi.org/10.1308/rcsann.2015.97.4.326">third article with another DOI</a>.***</p>
<figure class="wp-block-image size-full"><a href="https://jodischneider.com/blog/wp-content/uploads/2024/02/search-result.png"><img alt="" class="wp-image-2883" height="452" src="https://jodischneider.com/blog/wp-content/uploads/2024/02/search-result.png" width="800" /></a></figure>
<p>Scopus Preview is even more interesting because it shows the EMTREE terms “note” and “retracted article” (which are not so accurate in my opinion):</p>
<figure class="wp-block-image size-full"><a href="https://jodischneider.com/blog/wp-content/uploads/2024/02/scopus-preview.png"><img alt="" class="wp-image-2886" height="538" src="https://jodischneider.com/blog/wp-content/uploads/2024/02/scopus-preview.png" width="800" /></a></figure>
<p>In my <a href="https://doi.org/10.1007/s11192-020-03631-1">2020 <em>Scientometrics</em> article</a>, I cataloged challenges in getting to the full-text retraction notice for a single article. It’s not clear how common such errors are, nor how to systematically check for errors. </p>
<p>I’m continuing to think about this, since, for <a href="https://infoqualitylab.org/projects/risrs2020/">RISRS II</a>, I’m on the lookout for metadata disasters (in research-ese: What are the implications of specific instances of successes and failures in the metadata pipeline, for designing consensus practices?) </p>
<p>This particular retrieval error is due to the wrong DOI – which could affect any article (not just retraction notices). I’ve reported the DOI error to the Scopus document correction team.</p>
<p>It’s helpful that working on the <em><a href="https://infoqualitylab.org/projects/risrs2020/bibliography/">Empirical Retraction Lit bibliography</a></em> surfaces anomalous situations.</p>
<p></p>
<p>**Keeping “Retraction of unreliable publication” for abstract screening may seem overgenerous. But consider the title “Retractions”. Surely “Retractions” is the title of a bulk retraction notice! Nope, <a href="https://doi.org/10.1162/REST_a_00469">it’s a research article in the <em>Review of Economics and Statistics</em> by Azoulay, Furman, Krieger, and Murray</a>. Thanks, folks. While plurals are more likely than singulars to signal research articles and editorials I try to keep vague/ambiguous titles for a closer look.</p>
<p>***For 10.1308/xxx Crossref just lists this latest article. Same with Scopus.</p>
<figure class="wp-block-image size-full"><a href="https://jodischneider.com/blog/wp-content/uploads/2024/02/crossref-10.1308xxx.png"><img alt="" class="wp-image-2887" height="304" src="https://jodischneider.com/blog/wp-content/uploads/2024/02/crossref-10.1308xxx.png" width="800" /></a></figure>
<p>But my university library system has multiple results – a mystery!</p>
<figure class="wp-block-image size-full"><a href="https://jodischneider.com/blog/wp-content/uploads/2024/02/Illinoisbento-10.1308xxx-1.png"><img alt="" class="wp-image-2892" height="679" src="https://jodischneider.com/blog/wp-content/uploads/2024/02/Illinoisbento-10.1308xxx-1.png" width="800" /></a></figure>
jodi
https://jodischneider.com/blog
Lucidworks: Unlock User Intent with Gen AI: Thrive in a Post-Cookie World
https://lucidworks.com/?p=28479
2024-02-22T23:44:40+00:00
<p>Explore how retailers use Generative AI to navigate the privacy-first landscape and decode user intent.</p>
<p>The post <a href="https://lucidworks.com/post/unlock-user-intent-with-gen-ai-thrive-in-a-post-cookie-world/">Unlock User Intent with Gen AI: Thrive in a Post-Cookie World</a> appeared first on <a href="https://lucidworks.com">Lucidworks</a>.</p>
Brian Land
https://lucidworks.com/
Lucidworks: How Intranet Search Powers Effective Content Management
https://lucidworks.com/?p=28473
2024-02-22T19:13:38+00:00
<p>It’s time to lose the frustration and find what you need. Intranet search solutions build efficient content discovery and team collaboration.</p>
<p>The post <a href="https://lucidworks.com/post/how-intranet-search-powers-effective-content-management/">How Intranet Search Powers Effective Content Management</a> appeared first on <a href="https://lucidworks.com">Lucidworks</a>.</p>
Lucidworks
https://lucidworks.com/
David Rosenthal: Competition-proofing
tag:blogger.com,1999:blog-4503292949532760618.post-2647250638123695568
2024-02-22T16:00:00+00:00
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjfeROHBB3VcUVaQSjLlxDb4rpKVWB9soukI9gSNqLzUNXEc_JEAhk8VgH8Va-Wc-Y6FRtQnDoBjx1N3rQuol19rdxj70OTe4lp8DwlpTsHsF-Mc4ryTHaEBeg95wDkhFONw9s5Yhhc7yrRK5t1BrMz2sG5yaqakcK9Lfpp3AKyWSV8Dk5zfnUdDkDpEBMg/s1141/MarketCaps.jpeg" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="138" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjfeROHBB3VcUVaQSjLlxDb4rpKVWB9soukI9gSNqLzUNXEc_JEAhk8VgH8Va-Wc-Y6FRtQnDoBjx1N3rQuol19rdxj70OTe4lp8DwlpTsHsF-Mc4ryTHaEBeg95wDkhFONw9s5Yhhc7yrRK5t1BrMz2sG5yaqakcK9Lfpp3AKyWSV8Dk5zfnUdDkDpEBMg/w200-h138/MarketCaps.jpeg" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://www.ft.com/content/c43661bb-a087-4a54-a4b4-a6549d9156c3">Source</a></td></tr></tbody></table>
Apart from getting started in the midst of one of Silicon Valley's regular downturns, another great thing about the beginnings of Nvidia was that instead of insisting on the "minimum viable product" our VCs, Sutter Hill and Sequoia, gave us the time to develop a real architecture for a family of chips. It enabled us to get an amazing amount of functionality into a half-micron gate array; <a href="https://blog.dshr.org/2014/12/hardware-io-virtualization.html">I/O virtualization</a>, a DMA engine, a graphics processor that rendered curved surfaces directly, not by approximating them with triangles, a sound engine and support for game controllers. As I write, after a three decade-long history of bringing innovations to the market, Nvidia is <a href="https://www.ft.com/content/c43661bb-a087-4a54-a4b4-a6549d9156c3">America's third most valuable company</a>. <br />
<br />
I've written <a href="https://blog.dshr.org/search/label/venture%20capital">several times</a> about how in pursuit of a quicker buck, VCs have largely discarded the slow process of building an IPO-ready company like Nvidia in favor of building one that will be acquired by one of the dominant monopolists. These VCs don't support innovation. Even if their acquisition-bound companies do innovate in their short lives, their innovations are rarely tested in the market after the acuisition.<br />
<br />
Below the fold I discuss a new paper that presents a highly detailed look at the mechanisms the dominant companies use to neutralize the threats startups could pose to their dominance.<br />
<span><a name="more"></a></span>
<br />
In <a href="https://dx.doi.org/10.2139/ssrn.4713845"><i>Coopting Disruption</i></a> law professors Mark Lemley (Stanford) and Matthew Wansley (Cardozo) ask a good question:<br />
<blockquote>
Our economy is dominated by five aging tech giants—Alphabet, Amazon, Apple, Meta, and Microsoft. Each of these firms was founded more than twenty years ago: Apple and Microsoft in the 1970s, Google and Amazon in the 1990s, and Facebook in 2004. Each of them grew by successfully commercializing a disruptive technology—personal computers (Apple), operating systems (Microsoft), online shopping (Amazon), search engines (Google), and social networks (Facebook). Each of them displaced the incumbents that came before them. But in the last twenty years, no company has commercialized a new technology in a way that threatens the tech giants. Why?
</blockquote>
The TL;DR of Lemley and Wansley's answer to their question <a href="https://dx.doi.org/10.2139/ssrn.4713845">is</a>:<br />
<blockquote>
While there are many reasons for the tech giants’ continued dominance, we think an important and overlooked one is that they have learned how to coopt disruption. They identify potentially disruptive technologies, use their money to influence the startups developing them, strategically dole out access to the resources the startups need to grow, and seek regulation that will make it harder for the startups to compete. When a threat emerges, they buy it off. And after they acquire a startup, they redirect its people and assets to their own innovation needs.
</blockquote>
<a href="https://dx.doi.org/10.2139/ssrn.4713845">They observe that</a>:<br />
<blockquote>
a company that is started with the goal of being swallowed by a tech giant probably isn’t contributing much to society.
</blockquote>
<h3>Introduction</h3>
They start by identifying the advantages and disadvantages the incumbents possess in their efforts to monetize innovations. Their list of advantages is:<br />
<ul>
<li>"large incumbents can take advantage of economies of scale" not just in manufacturing, but also in marketing an distribution by exploiting their existing customer relationships.</li>
<li>"Large incumbents can also take advantage of economies of scope. Innovation creates “involuntary spillovers”—new knowledge that has economic value beyond the specific product that the firm was developing."</li>
<li>"large incumbents can access capital at a lower cost" for example from retained earnings from their cash cows.</li>
<li>"large incumbents may have another potential advantage—a longer investment time horizon" even more so now with the compression of VC time horizons.</li>
</ul>
Their list of incumbents disadvantages in innovation is more interesting:<br />
<ol>
<li>"their success will cannibalize their own market share" or "More generally, a monopolist has diminished incentives to introduce new products, improve product quality, or lower prices because any new sales generated replace its existing sales." Economists call this "Arrow's replacement effect"; more specifically: "The general lesson is, all else equal, the larger a firm’s market share and the less it is threatened by competition, the weaker its incentives to innovate. So we should expect large incumbents to not innovate much. And if they can dispense with the competitors rather than have to compete with them, they will do that."</li>
<li>"their managers prefer to deliver incremental innovations to their existing customers". Unlike Arrow's theory, "Christensen’s theory of disruptive innovation, ... focuses on the career incentives of middle managers ... Incumbent managers have an incentive to deliver sustaining innovations—incremental improvements in quality to the firm’s existing products that will please its existing customers. But they have substantial disincentives to pursue projects that upset the apple cart, even if doing so would bring new customers to the firm" The fundamental problem is that "Housing an innovation project inside a firm with diverse lines of business creates conflict with those other businesses. Some firm assets—cash, cloud computing, equipment, facilities, and engineers’ time—are rivalrous and finite, so executives must be willing to fight internal constituencies to devote those resources to innovation." Ingenuity, NASA's wildly successful Mars helicopter is a good example, as Eric Berger reports in <a href="https://arstechnica.com/space/2024/02/before-ingenuity-ever-landed-on-mars-scientists-almost-managed-to-kill-it/"><i>Before Ingenuity ever landed on Mars, scientists almost managed to kill it</i></a>. It was competing for cost, weight and risk with Perseverance's primary mission.</li>
<li>"their single veto point decision-making structure encourages risk-aversion" More specifically: "Inside a large incumbent, decisions about whether to fund an innovative project must pass through one veto point. In the venture capital market, many competing investors independently decide whether to finance an innovative idea. Inside a firm, an employee with an innovative idea must pitch an idea to managers who ultimately report to one executive gate-keeper. In the venture capital market, if a would-be startup founder pitches an idea to ten VC firms, and nine of them are not persuaded, the idea gets funded." The advantage of market-based finance over internal finance applies not just to the initiation but also the continuation of an innovation project. Inside a firm, an executive who has soured on a project can terminate it. In the venture capital market, when a startup’s initial investors grow skeptical, the company can still pitch outsiders on infusing more cash." The authors make this important point (my emphasis): "And while economists often describe markets as efficient, there is no reason to believe individual corporate executives make efficient (or even rational) decisions. Just ask Twitter. Markets work not because private executives make good decisions but because the ones who make bad decisions get driven out. But <b>that dynamic only works with competition</b>."</li>
<li>"they cannot appropriately compensate employees working on innovation projects." The reason they cannot is that: "Startups solve this problem by giving employees stock options. Every employee with significant equity knows that if the startup successfully exits, they will be rewarded. Stock in a large, diversified public company does not create similar incentives. The incentives are diluted because the value of the stock will be affected by too many variables unrelated to the success of the specific innovation project." And that: "large firms do not recognize internal “property rights” to innovations that employees develop. If they did, employees might become reluctant to share information. But not protecting internal property rights gives innovative employees incentive to leave. If employees at a large firm found their own startup and raise venture capital to fund it, they will earn a much greater share of the profits of the innovation."</li>
</ol>
The authors go on to describe five techniques incumbents use to neutralize the threat of disruption that innovative startups might pose; network effects, self-preferencing, paying for defaults, cloning, and coopting the disruptor. They claim other have described the first four, but they don't amount to an adequate explanation for why the tech giants haven't been disrupted. I will summarize each of the four in turn..
<h3>Network effects</h3>
Nearly three decades ago W. Brian Arthur, in <a href="http://www.amazon.com/Increasing-Returns-Dependence-Economics-Cognition/dp/0472064967"><i>Increasing Returns and Path Dependence in the Economy</i></a> explained how increasing returns to scale, or network effects, of technology markets typically led to them being dominated by one player. Consider a new market opened up by a technologcal development. Several startups enter, for random reasons one gets bigger then the others, network effects amplify its advantage in a feedback loop.<br />
<br />
This effect is more important now, as the the giants' business models have evolved to become <a href="https://dx.doi.org/10.2139/ssrn.4713845">platforms</a>:<br />
<blockquote>
The tech giants’ core businesses are built on platforms. A platform is an intermediary in a two-sided market. It connects users on one side of the market with users on the other side for transactions or interactions.<br />
...<br />
Platforms tend to exhibit network effects—the addition of a new user increases the value of a platform to existing users and attracts new users.
</blockquote>
This is precisely the mechanism Brian Arthur described, but applied to a business model that has since been enabled by the Internet.<br />
<h3>Self-preferencing</h3>
Self-preferencing happens when a platform isn't just a two-sided market, but one in which the platform itself is <a href="https://dx.doi.org/10.2139/ssrn.4713845">a vendor</a>:<br />
<blockquote>
Amazon, for example, both invites third party vendors to sell their products in its online marketplace and sells its own house brands that compete with those vendors. Amazon has a powerful advantage in that competition. It has access to data on all of its competitors—who their customers are, which products are selling well, and which prices work best. And it controls which ads consumers see when they search for a specific product. Assuming Amazon uses that information to prefer its own products to those of its competitors (either by pricing strategically or by promoting its own products in search results) – something alleged but not yet proven in a pending antitrust case -- the result is to bias competition. Vendors cannot realistically protest Amazon’s self-preferencing (or just go elsewhere) because Amazon has such a dominant share in the online retail market.
</blockquote>
<h3>Paying for defaults</h3>
The value of the default position is notorious <a href="https://dx.doi.org/10.2139/ssrn.4713845">because</a>:<br />
<blockquote>
Alphabet pays Apple a reported $18 billion (with a b) each year for Google to be the default search engine on iOS devices. Android and iOS together account for 99% of the U.S. mobile operating system market. Consequently, almost everyone who uses a smartphone in America is accustomed to Google search. Alphabet claims that “competition is just a click away.” But research and experience have shown that defaults can be somewhat sticky. So controlling the default position can give Alphabet (or whoever wins the Apple bid) an advantage. That said, someone has to be the default, and it might be better for consumers if the default is the search engine most users already prefer. The real problem might be the idea of paying for placement, whoever wins the bidding war.
</blockquote>
<h3>Cloning</h3>
There are many examples of a tech giant tryng to neutralize the threat from a startup by using the threat of cloning their product to force the startup to sell itself, or of actually cloning the product and using their market power to swamp the startup. Meta's addition of Reels to Instagram in response to Tik Tok is an obvious example. But the authors make two good points: <a href="https://dx.doi.org/10.2139/ssrn.4713845">First</a>:<br />
<blockquote>
Cloning is only objectionable if the tech giant wins out not by competition on the merits, but by exclusionary conduct.
</blockquote>
Second, that cloning <a href="https://dx.doi.org/10.2139/ssrn.4713845">often fails</a>:<br />
<blockquote>
Google+, Google’s effort to build a social media service that combined the best of Facebook and Twitter was an abject failure. Apple’s effort to control the music world’s move to streaming by offering its own alternative to Spotify hasn’t prevented Spotify from dominating music streaming and eclipsing the once-vibrant (and Apple-dominated) market for music downloads. Meta’s effort to copy Snap, then TikTok, by introducing Stories and Reels has not proven terribly successful, and certainly has not prevented those companies from building their markets.
</blockquote>
The fact that the giants can clone a startup's product leads the authors to <a href="https://dx.doi.org/10.2139/ssrn.4713845">ask</a>:<br />
<blockquote>
If the product is cloneable, then why would you buy the company and burn cash paying off its VCs?
</blockquote>
There are two possible answers. It may be faster and easier, though likely not cheaper, to "acquihire" the startup's talent than to recruit equivalent talent in the open market. Or it may be faster and easier, though likely not cheaper, to acquire the company and its product rather than cloning it.
<h3>Inadequate Explanation</h3>
The authors use the example of <a href="https://dx.doi.org/10.2139/ssrn.4713845">Microsoft</a>:<br />
<blockquote>
Microsoft enjoyed strong network effects in the 1990s as the dominant maker of operating system software – far more dominant than it is today. It cloned internet browser technology from upstarts like Netscape, and it engaged in anticompetitive conduct designed to ensure that it, not Netscape, became the browser of choice.82 But Microsoft’s victory over Netscape was short-lived. New startups – Mozilla and then Google – came out of nowhere and took the market away from it. Microsoft still benefits from network effects, and it still uses cloning and self-preferencing to send users to its Edge browser. But it doesn’t work. Microsoft employed all the tools of a dominant firm in a network market, but it still faced disruption.
</blockquote>
So these four techniques aren't an explanation for the recent dearth of disruption.<br />
<h3>Coopting disruption</h3>
The authors imagine themselves as a tech giant, asking what else they would do to prevent disruption, and coming up with <a href="https://dx.doi.org/10.2139/ssrn.4713845">four new techniques</a>:<br />
<ul>
<li>"First, you would learn as much as you can about which companies had the capability to develop disruptive innovations and try to steer them away from competing with you – perhaps by partnering with them, or perhaps by investing in them."</li>
<li>"Second, you would make sure that those companies could not access the critical resources they would need to transform their innovation into a disruptive product."</li>
<li>"Third, you would tell your government relations team to seek regulation that would build a competitive moat around your position and keep disruption out."</li>
<li>"Fourth, if one of the companies you were tracking nevertheless did start to develop a disruptive product, you would want extract that innovation—and choke off the potential competition—in an acquisition."</li>
</ul>
These are the techniques they call "coopting disruption", pointing out that the <a href="https://dx.doi.org/10.2139/ssrn.4713845">tech giants have</a>:<br />
<ul>
<li>"built a powerful reconnaissance network covering emerging competitive threats by investing in startups as corporate VCs and by cultivating relationships with financial VCs."</li>
<li>"accumulated massive quantities of data that are essential for many software and AI innovations, and they dole out access to this data and to their networks selectively."</li>
<li>"asked legislators to regulate the tech industry—in a way that will buttress incumbents."</li>
<li>"repeatedly bought potentially competitive startups in a way that has flown—until a few years ago—below the antitrust radar."</li>
</ul>
The authors detail many examples of each of these techniques, for example Facebook conditioning access to user data on the purchase of advertising, and Google's purchase of DoubleClick and YouTube. Interestingly, they contrast the recent purchasing of the tech giants with Cisco's famously successful purchases in the 90s:
<blockquote>
The Cisco story exemplifies how the venture capital market, as a market, is better at exploring a series of risky ideas than a firm with a single risk-averse gatekeeper. It also illustrates how the advantages of a large incumbent—in this case access to markets and existing customer relationships—can sometimes extract more market value out of a technology than a new entrant.
</blockquote>
The rapid evoluution of networking technology at the time meant that even Cisco, the largest company in the market, didn't have the R&D resources to explore all the opportunities. They depended upon VCs to fund the initial explorations, rewarding them by paying over the odds for the successes. Their market power then got the successes deployed much faster than a startup could.<br />
<h3>Why Is Cooption Bad?</h3>
The authors explain the <a href="https://dx.doi.org/10.2139/ssrn.4713845">harms of cooption</a>:<br />
<blockquote>
Our claim here is that the same dynamics that inhibit disruptive innovation by longstanding employees of large incumbents inhibit disruptive innovation by new employees from acquired startups.<br />
...<br />
The tech giants win from coopting disruption even though it destroys social value. In fact, they benefit in two ways. They make faster incremental progress on the sustaining innovations that they want. They get the new code, the valuable intellectual property, and the fresh ideas of the startup. And, critically, they also kill off a competitor. They no longer have to worry about the startup actually developing the more disruptive innovation and leapfrogging them or other tech giants acquiring the startup and using its assets to compete with them.
</blockquote>
And, by making the innovators from the startup rich, the acquirer greatly reduces their incentives for future innovation. <a href="https://blog.dshr.org/2024/02/the-stanford-digital-library-project.html">Andy Bechtolsheim</a> is an outlier.<br />
<h3>Remedies?</h3>
Lemley and Wansley, who seem to think in fours, make a set of four proposals for how these harms might be reduced:<br />
<ul>
<li><b>Unlocking Directorates</b> — under the Clayton Act "interlocking officers and directors between companies that compete, even in part, are illegal <i>per se</i> – without any inquiry into whether the companies in fact restrained competition because of their overlapping interests or whether the conduct offered procompetitive benefits." Companies with less than $4.1M in revenue are exampt, which excludes most startups; this should be revised.</li>
<li><b>Limiting Leveraging of Data and Networks</b> — "we would impose on incumbent tech monopolists a presumptive duty of nondiscrimination in access where the defendant (1) provides or sells data or network access to at least some unaffiliated companies and (2) refuses to provide or sell the same data or network access to the plaintiff company on comparable terms, but (3) the plaintiff does not operate a competing network or otherwise compete with the defendant in the market from which it collected the relevant data."</li>
<li><b>Regulating Regulation</b> — "Done right, regulation of technology can be beneficial and even necessary to the development of that technology, minimizing the risk of harm to third parties and ensuring that the world views the technology as safe and trustworthy. But all too often regulation has become a way to insulate incumbents from competition, with predictable results." The authors' suggestions exemplify this difficulty, being rather vague and aspirational.</li>
<li><b>Blocking Cooptive Acquisitions</b> — this is the most complex of the four proposals, and builds on <a href="https://www.jstor.org/stable/45467503"><i>Nascent Competitors</i></a> by C. Scott Hemphill & Tim Wu, who write:<br />
<blockquote>
We favor an enforcement policy that prohibits anticompetitive conduct that is reasonably capable of contributing significantly to the maintenance of the incumbent s market power. That approach implies enforcement even where the competitive significance of the nascent competitor is uncertain.
</blockquote>
Justifying blocking mergers because of a nascent threat that might never materialize is problematic. But it is only more so than the current way anti-trust works, by projecting likely harm to consumer welfare, which also might never materialize (although it almost always does). Lemley and Wansley explain <a href="https://dx.doi.org/10.2139/ssrn.4713845">the dilemma</a>:<br />
<blockquote>
antitrust enforcers need a strategy for blocking cooptive acquisitions that works within existing case law (or plausible improvements to that law) and is surgical enough to avoid chilling investment.
</blockquote>
Some cases are <a href="https://dx.doi.org/10.2139/ssrn.4713845">obvious</a>:<br />
<blockquote>
For cooptive acquisitions like Facebook/Instagram deal, we think Hemphill and Wu’s strategy makes sense. Zuckerberg’s email arguing for acquiring startups like Instagram because “they could be very disruptive to us” is a smoking gun of anticompetitive intent.
</blockquote>
But Lemley and Wansley go further, arguing for blocking megers based the startup's ability to <a href="https://dx.doi.org/10.2139/ssrn.4713845">innovate distruptive technology</a>:<br />
<blockquote>
Of course, an approach to policing startup acquisitions based on innovation capabilities need limits. Many startups have some innovation capabilities that could have a significant effect on competition. We can cabin enforcement in three ways—by focusing on specific technologies and specific firms and by looking at the cumulative effects of multiple acquisitions.
</blockquote>
Their examples of technologies include generative AI and virtual and augmented reality, both cases where it is already too late. The companies they identify "Alphabet, Amazon, Apple, Microsoft, and Meta" are all veterans of multiple acquisitions in these areas. But they argue that committing to
<a href="https://dx.doi.org/10.2139/ssrn.4713845">challenge fuure mergers</a>:<br />
<blockquote>
would create socially desirable incentives for startups. A startup developing one of the listed technologies would gain stronger incentives to turn its innovations into the products that its management team believed would garner the highest value on the open market—rather than the one most valuable to the tech giants. They would also gain stronger incentives to build a truly independent business and go public since an acquisition by the tech giants would be a less likely exit.
</blockquote>
</li>
</ul>
I think these would all be worthwhile steps, and I'm all in favor of updating anti-trust law and, even better, actually enforcing the laws on the books. But I am skeptical that the government can spot potentially disruptive technologies before the tech giants spot and acquire them. Especially since the government can't be embedded in the VC industry the way the tech giants are. Note that many of the harms Lemley and Wansley identify happen shortly after the acquisition. Would forcing Meta to divest Instagram at this late date restore the innovations the acquisition killed off?<br />
David. (noreply@blogger.com)
https://blog.dshr.org/
In the Library, With the Lead Pipe: Forming and Sustaining a Community of Practice for Volunteer-Based EDI Work
https://www.inthelibrarywiththeleadpipe.org/?p=11407
2024-02-21T18:08:21+00:00
<h2 class="wp-block-heading">In Brief</h2>
<blockquote class="wp-block-quote">
<blockquote class="wp-block-quote">
<p><em>Equity, Diversity, and Inclusion (EDI) are essential to the preservation of intellectual freedom (American Library Association). Yet some Library and Information Science scholars argue that EDI work within libraries is not evolving significantly or rapidly enough. Using our work in building the Diverse BookFinder Community of Practice as an example, we highlight overarching principles that can guide EDI professional development towards greater effectiveness and sustainability. Sharing concrete strategies and examples of how to keep community and the community’s shared purpose at the forefront of a volunteer-based EDI program, we posit that with only minimal adjustments, our model can be adapted to fit other EDI work, whether it’s focused on sustaining the efforts of an external volunteer group or on supporting and sustaining the crucial and everyday EDI work of librarians in collection development, programming, and committee building. We end the manuscript with a checklist designed to support the development of EDI training. </em></p>
</blockquote>
</blockquote>
<p>At the <a href="https://diversebookfinder.org/">Diverse BookFinder</a> (DBF), we work to move the diverse books discussion beyond increasing the number of books (see Aronson et al.) to a deeper consideration of <em>how</em> Black and Indigenous people and People of Color (BIPOC) are represented <em>within </em>diverse books. To accomplish this change, we’ve cataloged and analyzed thousands of trade picture books published or distributed in the United States (including various Canadian publishers) since 2002 to surface and create a one-of-a-kind resource. </p>
<p>In 2020, the DBF received funding from the Institute of Museum and Library Services (IMLS) to “reach up” and include all of children’s literature in our work: picture books (generally ages 3-8), early readers (5-9), middle grade (7-12), and young adult books (12-18). As we worked to expand, it became clear that we would require an exponentially larger group of people to read and analyze texts. Practically, this meant that the DBF needed to establish a volunteer-based community of learners who could be trained in the specific methods of DBF book analysis and who would be invested enough in the project to provide continued participation. In response, we created and sustained a Community of Practice (CoP). </p>
<p>As we worked through the process of creating, training, and sustaining a diverse, volunteer-based community of learners, we discovered that we were creating a model for sustainable Equity, Diversity, and Inclusion (EDI) work that is missing in the field of librarianship: a model that chips away at traditional systemic and institutional barriers to create truly inclusive and collaborative working partnerships. Using our work in building the DBF CoP as an example and drawing together interdisciplinary research and practice, we highlight overarching principles that can guide EDI professional development towards greater effectiveness and sustainability. We share concrete strategies and examples of how to keep community and the community’s shared purpose at the forefront of a volunteer-based EDI program. Furthermore, we posit that with only minimal adjustments, our model can be adapted to fit other EDI work, whether it’s focused on sustaining the efforts of an external volunteer group or on supporting and sustaining the crucial and everyday EDI work of library professionals in collection development, programming, and committee building.</p>
<h2 class="wp-block-heading"><strong>Why is the DBF Work Important to Libraries? </strong></h2>
<p>Providing patrons with access to diverse books is central to librarianship and the role of library professionals who have an obligation to maintain collections that represent the experiences, interests, and needs of historically marginalized communities (“Diverse Collections”). Furthermore, longstanding disparities in publishing have created a need for library professionals to be more intentional about their selection process (see Cummins). Library professionals must identify and select books that provide visual and textual representations of diverse characters across various forms and genres. They must also identify and select books that depict diverse characters in culturally relevant ways. This task is further complicated by recent attempts to ban library books that highlight the unique experiences of BIPOC and LGBTQIA+ communities. In 2022, the American Library Association (ALA) tracked 1,269 book challenges, the highest number yet, mostly aimed at removing diverse books (“American Library”). These challenges are harmful to EDI work in libraries because they can exacerbate existing inequities present within library collections. </p>
<p>In light of these challenges, ALA leaders have taken the position that equity, diversity, and inclusion are essential to the preservation of intellectual freedom. Despite this position, some Library and Information Science scholars argue that EDI work within libraries is not evolving significantly or rapidly enough, due to the lack of diversity that’s prevalent within the library workforce and<em> </em>library collections (Dali and Caidi). Additionally, libraries articulate diversity as a core value but have not developed methodologies that would align practice with professional values (see Espinosa de los Monteros and Enimil; Dali and Caidi).</p>
<p>We therefore argue that in order to maintain the cadence of EDI work, library professionals must be intentional about their approach to collection development and management. Without intentionality, we fear that EDI work may continue to evolve slowly. According to Dr. Martin Luther King Jr., “Justice too long delayed is justice denied” (839). </p>
<p>The work of the DBF is beneficial to libraries in two ways. At a granular level, the DBF can support selection decisions related to diverse content. Through the <a href="https://cat.diversebookfinder.org/?_ga=2.89939931.1314262628.1695995481-407329941.1695995479">DBF Collection Analysis Tool (CAT)</a>, libraries can access a snapshot of their collections in order to determine where diversity gaps exist in terms of the representation and presentation of BIPOC communities. Also, the metadata undergirding the DBF work provides a shared EDI language specific to children’s literature, potentially making it easier for all library professionals, BIPOC community members, and allies to talk about EDI in children’s literature writ large.</p>
<p>Second, the DBF CoP serves as a model for recruiting, engaging, and sustaining large groups of library professionals in diverse collection development practices and other EDI activities. Our model achieves measurable outcomes and encourages collaboration among library workers and/or volunteers from different cultures, ethnicities, and backgrounds, with varying levels of professional experience and types of expertise. </p>
<h2 class="wp-block-heading"><strong>Literature Review and Theoretical Framework for Our Work</strong></h2>
<p>Our leadership group grounded the design and implementation of the CoP in feminist pedagogical theories that have been developing for over 25 years. In a feminist classroom, students and teachers work together to achieve mutual goals through “collaboration, community building and validating knowledge based on experience” (McCusker 445). In developing our training plan and throughout the training program, we placed significant emphasis on personal lived experiences and translating those experiences into learning opportunities to effect social change. Rather than insisting on a traditional academic model that centers expertise with a clear head of the classroom, we chose to create an environment with a shared responsibility for learning between facilitators and learners since we had much to learn from one another’s unique positionalities (Tedesco-Schneck 267, Grissom-Broughton 166). Having multiple voices involved in planning, during the training program, and in interpreting DBF’s metadata allowed us to decenter authority and power, a necessary condition for EDI work. We also provided ample opportunities for continual reflexive practices in order to analyze intersections of oppression and how these intersections play out in our reading of texts (Grissom-Broughton 171, McCusker 456). </p>
<p>Another crucial aspect to our model was a pedagogy of care, which involves “an approach based on an ethic of care as both a moral imperative and pedagogical necessity (Gay, 2018)” (Barek et al). Pedagogy of care theories stress the relationship between teacher and student with an emphasis on mutual respect and authentic dialogue with compassion, reciprocity, and positionality. Focusing on our learners from a perspective of radical compassion, in which we try to relieve causes of distress and discomfort, allowed all of us, facilitators and learners alike, to center radical self-care (Ravitch 6). In doing so, we could “lovingly revise parts of ourselves as a necessary dimension of our work to re-envision and reconstruct the world from a perspective of equity, social identity, and liberation” (Ravitch 6). </p>
<p>Upon reflection after the initial training program was complete, we saw clear connections between meaningful learning and our initial roots, intellectual partners, and intentions. After all, meaningful learning occurs when “learners are active, constructive, intentional, cooperative, and working on authentic tasks” (Jonassen 49). In particular, our focus was intentional and goal-oriented with an authentic task of coding approximately 2,380 books within a year, so that library professionals and readers could make more informed decisions about book selection. In order to support meaningful learning, our training program was inherently cooperative to help all of us, facilitators and learners, solve problems and generate new knowledge. Like a feminist classroom, “these characteristics are interrelated, interactive, and interdependent” (Jonassen 51). </p>
<p>However, we also felt that meaningful learning and some of the other frameworks we drew upon were more limited in their depictions of the relational aspects of learning and teaching than what we were striving for in our model. Living up to the expectations of facilitators and learners and developing authentic, honest, and caring relationships are essential to the reparative work often involved in EDI projects and partnerships. Traditionally, student-teacher relationships have been viewed from a binary positive-negative emotional response, but studies relying on this binary “do not consider the interplay between the emotions of student-teacher relationship and the cultural and social organization of interaction” (Tormey 994). Our relationships to others are influenced by the implicit biases we all carry with us and bring to our perceptions of others. Thus, in preparing an EDI focused training program, it’s necessary to fully understand the relational aspects of how people learn and how people teach. Relationally, there are multiple levels of engagement in a training program: between facilitators; between learners; between learners and facilitators; and between learners, facilitators, and the materials under analysis.</p>
<p>Each of these relational levels requires attention not only to others but also to ourselves. Kathleen M. Quinlan outlined different relational levels in the classroom and noted that “education is relational, and emotions are central to relationships. … how we feel <em>with </em>and <em>about </em>others are central to the quality of our relationships” (102). In maintaining expectations and authentic, caring relationships, we create a relational third space for action, thought partnership, and empathy (Ravich 4).</p>
<h2 class="wp-block-heading"><strong>Planning and Recruitment</strong></h2>
<p><strong>Leadership Team </strong></p>
<p>When developing an EDI project in librarianship, the leadership team is responsible for the planning, promotion, and implementation of the project’s objectives, so determining the members of this team is crucial. While the DBF has multiple teams working on various aspects of the project, the CoP Advisory Group (CoP AG) is a team of seven (originally eight) members from a variety of professional backgrounds, lived experiences, and positionalities. Two of the original DBF founders and a former DBF project manager are a part of our group, each bringing significant experience in applying the specific methods of DBF book analysis to picture books. Four new members joined the group as part of the expansion into early readers, middle grade, and young adult literature. This combination not only allowed for cohesion between the two phases of the database but also for flexibility in considering new interpretations of the DBF’s metadata and diverse children’s literature and audiences. </p>
<p>Academically, our group members are experts in psychology, librarianship, children’s literature, and gender and sexuality studies, and one is an award-winning author-illustrator of children’s books. Most importantly, however, each member brought experience in and with EDI work from various vantage points, whether as BIPOC and/or with expertise in working with minoritized populations. Collectively, we were grounded in interdisciplinary feminist, critical race, anti-racist, gender, and sexuality theories. Since our individual thought partners and lived experiences varied, each member brought an invaluable, unique perspective to EDI work and a shared respect for discussion, collaboration, and willingness to learn and work by consensus. </p>
<p><strong>Planning for a Virtual Experience</strong></p>
<p>After creating the leadership team, an integral part of the planning process included preparing a program that would function well as an entirely virtual experience. We knew we would be recruiting volunteers from across the United States and Canada and that our volunteers would be coming from a variety of backgrounds with a wide array of scheduling needs. This meant that we wanted to be as intentional as possible in creating a training structure that would cater to the greatest number of participants. Planning for this reality involved three major components: creating a flexible training structure; encouraging consistency and active learning; and creating and providing easily accessible training materials. </p>
<p>To allow for a variety of schedules and time zones, we focused on building a training course structure that was intentionally flexible. Dividing the training sessions into Large Core Classes (facilitator-led instruction) and Small Group Sessions (group-led discussion) allowed us to provide a training that could be both asynchronous (the recorded Large Core Classes) and synchronous (Small Group Sessions). In addition to providing flexibility for scheduling, dividing the training into these two types of experiences also furthered our goal of following a feminist pedagogical model that decenters expertise by allowing us to place equal importance on both facilitator-imparted knowledge (Large Core Classes) and group learning through more personal interactions and discussions (Small Group Sessions).<sup><a class="footnote-link footnote-identifier-link" href="https://www.inthelibrarywiththeleadpipe.org/2024/volunteer-edi-work/#footnote_0_11407" id="identifier_0_11407" title="During the seven-week training program, learners and facilitators participated in weekly Large Core Classes and Small Group Sessions. After the training program ended, we continued to hold monthly Small Group Sessions to discuss new coding questions and share insights and approaches. The monthly Small Group Sessions provided ongoing consistency and community and continued our collaborative approach to learning from one another.">1</a></sup></p>
<p>Furthermore, the synchronous sessions were offered on various days and times throughout the week, and participants were invited to select which session would work best for them rather than being assigned a session. This attention to flexible scheduling encouraged engagement and prevented scheduling conflicts from prohibiting participation, which made the involvement of such a large group of volunteers sustainable. In October 2022, we started our first CoP with 76 volunteers and by August 2023, we retained 63 volunteers, with the departing volunteers leaving for various personal reasons rather than for reasons related to our shared work. Of the remaining 63 volunteers, 37 expressed interest in continuing work with the DBF even after their initial one-year term was completed. </p>
<p>Throughout this process, it was necessary for the CoP facilitators to support the work of incoming coders and to ensure that the overall scope of our work was meaningful and produced tangible results. One way in which we fostered these goals was through an emphasis on consistency. With such a large group of learners being split among various Small Groups centered on discussion, we wanted to make sure that each learner received consistent messaging and training. To this end, facilitators met weekly. During these weekly meetings, facilitators reviewed how the prior week’s class and discussion sessions had gone and considered how we could best create ongoing active learning opportunities that would enrich and reinforce the knowledge constructed during the Large Core Classes. This constant loop of feedback between the facilitators, as well as between the facilitators and learners, helped us to create spaces where participants had consistent structure and support and thus felt comfortable engaging in sensitive dialogue around topics of race, culture, and identity.</p>
<p>Once the course structure and schedule were finalized, it was also important to provide training materials and instructions in such a way that they would be available to all participants, regardless of when they were working on their assigned tasks. Thus, we developed our training materials using the Google Suite of products which suited our need for software that provides user friendly and accessible collaboration at no cost. With Google Sites, we created both an online instructional manual and a “Training Base Camp,” which served as a resource center for learners and facilitators. The base camp stored all the important documents, links, and forms that a learner or facilitator might need to access during the training program and their year of coding. We also used Google Forms to create “Question Submission Forms” so that participants had the opportunity to submit questions on a rolling basis without having to wait for the next session. This created a loop of constant feedback between the learners and facilitators that was both cooperative and flexible. </p>
<p><strong>Recruitment</strong></p>
<p>Once a flexible and adaptive course structure was established, we turned our attention to volunteer recruitment. Given our focus on diversity and inclusion and our goal of creating a widely diverse CoP, we guided our recruitment efforts towards library organizations that included participants who already had some experience with diversity in children’s literature and with metadata. We also focused our efforts on recruiting volunteers from professional organizations that already had a stated diversity focus.<sup><a class="footnote-link footnote-identifier-link" href="https://www.inthelibrarywiththeleadpipe.org/2024/volunteer-edi-work/#footnote_1_11407" id="identifier_1_11407" title="In our recruitment efforts, we promoted the DBF CoP to members of the following library organizations: the American Library Association’s Ethnic and Multicultural Information Exchange Round Table (EMIERT); the Black Caucus of ALA (BCALA); the Association for Library Service to Children’s Equity, Diversity, and Inclusion Implementation Task Force; the American Association of School Librarians’ Diversity, Equity and Inclusion Community of Practice; REFORMA: The National Association to Promote Library and Information Services to Latinos and the Spanish Speaking; the Asian Pacific American Library Association (APALA); the American Indian Library Association (AILA); the Rainbow Roundtable (RRT); and the Association of Jewish Libraries (AJL). We also shared the opportunity with the Association for Library Service to Children (ALSC), the Young Adult Library Services Association (YALSA), and divisions of the National Council of Teachers of English (NCTE).">2</a></sup></p>
<p>Our strategic goal of recruiting diversity-minded individuals was reflected in the creation of our application materials, as well as where we shared them. The short-answer questions on the application asked potential participants to reflect on the importance of diversity in children’s literature and on how their own identities and positionalities might influence how they interact with literature and other participants. By intentionally directing our recruitment strategy and materials towards library professionals already engaged in EDI work, we aimed to create a group that was both diverse and already familiar with some of the concepts addressed in our training. We asked applicants to provide a resume, and the application included an optional <a href="https://drive.google.com/file/d/1U3B3Cx7yxYu2wFnjSwFDvwpy-1eIm2zv/view?usp=sharing">Lived Experience survey</a> through which they could disclose their races and ethnicities, as well as a number of other identities, such as gender, sexuality, religion, and ability status. </p>
<p>The success of our recruitment efforts can be seen through both the professional and demographic diversity that was achieved within our participant cohort. Our first cohort from 2022-2023 included a wide array of academic, public, and school librarians, as well as students working towards a Master’s degree in Library and Information Sciences, hailing from 31 states and Canada. Of our 63 remaining CoP members, 48 completed the Lived Experience survey. Of these individuals, 26 self-identified as BIPOC (41%), whereas among US credentialed librarians as a whole, the percentage identifying as BIPOC is only 12% (“Diversity Counts”). </p>
<h2 class="wp-block-heading"><strong>Implementation and Responsiveness</strong></h2>
<p><strong>Facilitators and Learners</strong></p>
<p>We intentionally created a community of learning in which learning was an authentic, active, social process for all of us. As members of the CoP AG, we chose to call ourselves facilitators because we wanted to emphasize that none of us are – or can be – experts in all aspects of EDI work, particularly as EDI work is constantly evolving. Just as members of our advisory group learned continually from one another, we knew we would learn from the cohort members, too, particularly if we invited them to bring their whole selves to the program and share their insights and expertise. We referred to the new cohort members as learners, but they quickly became co-facilitators, helping to shape the training program and some of the coding work itself.</p>
<p>From our own experiences, we knew that learning all the DBF terminology and identifying the categories and tags in books takes time and can initially feel overwhelming, so we scaffolded the training, with the overall coding process broken down into smaller, more manageable sections. Learners developed their coding skills and knowledge over multiple weeks, with new sections of coding added in consecutive weeks and plenty of time for questions and connection building between sections. We also practiced coding with multiple books of varying genres, formats, and intended audiences. We provided the reading assignments in advance, so learners could incorporate the work into their already busy schedules.</p>
<p>Within this carefully designed training structure, we also incorporated flexibility. As anticipated, the work evolved based on our learning community’s feedback and needs. We provided written and oral feedback opportunities, both through formal surveys and informal discussions in Small Group Sessions, and we quickly responded to feedback. For example, several weeks into the first training, we centered the Small Group Sessions even more fully on discussions of book coding and specific questions that arose rather than reviewing material from the Large Core Classes, and we lengthened the training program for the second cohort based on input from the first cohort. </p>
<p>As learners expressed their fears of coding “incorrectly,” we increased our refrain about there being no one “right” way to approach EDI issues through book coding; we all code based on the evidence we find in the books and our lived experiences. Our training focused on providing consistent information, guidance, and messaging, and part of our recurrent messaging was that our diversity of experiences and lenses would lead to some valid, different interpretations of material. Through discussions about how and why we coded a book, we learned from one another’s positionalities and perspectives and were able to perceive new ways of interpreting the books and the DBF terminology.</p>
<p><strong>Responsivity</strong></p>
<p>At the start, we sought to create an inclusive, compassionate, affirming, and humanizing learning environment. Initiating our work with a growth mindset, we discussed the difference between safe spaces and brave spaces (see Arao and Clemens), inviting everyone to embrace the challenges inherent to EDI work and consider what they needed to do so. As a first step, we introduced the community agreement created by the <a href="https://www.ala.org/alsc/">Association for Library Service to Children</a> (ALSC) in the first Large Core Class, asking learners to consider its elements and propose revisions, additions, and/or amendments that would allow them to step into a brave space. The ALSC agreement and revision suggestions were reviewed in Small Group Sessions the following week. One suggestion was to clarify how we would identify and manage observations of something oppressive being said or done in the group. We discussed this revision as a leadership team, suggested language, then brought it back to the small groups for further discussion and elaboration before sharing it again in the Large Core Class. Once affirmed in the large group, the agreement was finalized and posted on our shared online training base camp. (See Appendix for the agreement.) The whole process was completed in two weeks. This important community building exercise let our learners understand that we saw them as partners, experts capable of contributing to our shared work. It also provided support for our shared goal of authentic, active participation. It is interesting to note that we never had to return to or invoke the community statement, even though we engaged in numerous conversations about hot topics in children’s books, diversity, and librarianship together.</p>
<p>We also received feedback from those coding stories featuring Indigenous characters, including from those who identified as Indigenous, reporting that they weren’t able to generate what felt like complete and accurate summaries of books featuring Indigenous people. For instance, the metadata meant to capture religious or spiritual experiences was lacking. Using this feedback, we engaged in a larger project, inviting tribal librarians and others with knowledge, skill, and/or lived experience to participate in metadata revisions. The result was the conceptualization and vetting of multiple new tags in collaboration with CoP members and other experts nationally who were part of their networks. Moreover, the experience further conveyed our commitment to shared expertise, authenticity, and active participation, deepening our knowledge and relationships within and beyond the DBF group, leading to one CoP learner agreeing to become a co-facilitator in the 2023-2024 training program. </p>
<h2 class="wp-block-heading"><strong>Conclusion</strong></h2>
<p>Developing a sustained volunteer-based EDI program or sustained committee work related to EDI requires a multi-theoretical and dimensional approach that questions and begins to erode systemic and institutional barriers to integrative and collaborative working partnerships. In our approach, we used pedagogies of feminism and care and meaningful learning that allowed us to translate theory into practical application and move beyond the conversational and performative aspect of EDI work often seen in libraries. The success of our program, as well as the theories through which we formulated our training goals and structure, is exemplified through the continued involvement and commitment of our first volunteer cohort and their expressed comfort in communicating and learning with our facilitators.</p>
<p>As we build on our success and begin our training program with a new cohort, we continue to add greater structure to our CoP AG conversations and practices and focus on the most essential elements of our program.</p>
<ul>
<li>Integrate personal lived experiences into the learning environment and consider all the relational levels present to avoid imposing an artificial boundary between professional and self-knowledge.</li>
<li>When learners and facilitators express their needs, listen and respond carefully, trusting that people have good intentions and know what will most benefit and support them.</li>
<li>Allow flexibility for shifts in response to the cohort’s needs and use the ongoing reflection to intentionally and steadily move from contemplation to action.</li>
</ul>
<p>We hope that reflecting on the following questions will guide and enhance your work as you consider your next steps in creating and/or sustaining intentional and authentic EDI programs that challenge the status quo.</p>
<hr class="wp-block-separator has-alpha-channel-opacity" />
<div id="guiding-questions" style="border: 2px solid #424242;">
<h2 class="wp-block-heading"><strong>Guiding Questions for a Community of Practice for Volunteer-Based EDI Work</strong></h2>
<p><strong>Forming and Supporting Your Leadership Team</strong></p>
<p>What knowledge and positionalities do members of your leadership team have? What knowledge and positionalities are lacking? Are you being honest about what you, as individuals and a group, know and what you don’t know? How will you fill in any identified gaps?</p>
<p>Have you built in time for facilitators to reflect individually and check in with each other throughout the program to ensure connection, alignment, and consistency?</p>
<p><strong>Recruiting Participants </strong></p>
<p>How will you recruit participants? How will your methods ensure a diverse group?</p>
<p>How will you invite program participants to engage fully and authentically?</p>
<p>What special considerations are needed to sustain participants from marginalized communities?</p>
<p><strong>Creating Your Program Structure </strong></p>
<p>How will you design your program to avoid life/logistical barriers to participation? </p>
<p>What methods for decentering authority and power between facilitators and learners and between learners are you utilizing? How are you making your intentions around this practice clear? </p>
<p>Who are your thought partners? What guiding frameworks will you use to inform program development?</p>
<p>How will you celebrate different positionalities, which are key components of a successful program?</p>
<p>How will you build in the flexibility to readily adjust your program, based on your learners’ and facilitators’ needs?</p>
<p><strong>Implementing and Evaluating Your Program</strong></p>
<p>How are you listening and responding to feedback? Are your learners able to see how you are listening and responding? Can they see their real-time impact on the work you are doing together?</p>
<p>What kinds of collection methods will inspire the most honest and complete feedback from participants to allow a full assessment of your program? How and when will you collect this feedback?</p>
<p>The Guiding Questions section of this article may be reused under a Creative Commons License:<br />
Guiding Questions for a Community of Practice for Volunteer-Based EDI Work © 2023 by Alteri, S., Aronson, K., Caponegro, R., Jamison, A., Laboy, L. is licensed under <a href="http://creativecommons.org/licenses/by-nc-sa/4.0/" rel="license">CC BY-NC-SA 4.0</a>.</p>
</div>
<hr class="wp-block-separator has-alpha-channel-opacity" />
<h2 class="wp-block-heading"><strong>Acknowledgements</strong></h2>
<p><em>We would like to thank the other current members of the Diverse BookFinder Community of Practice Advisory Group, Anne Sibley O’Brien and Andrea Breau, as well as past member Marianne Williams, for envisioning and building this community of practice with us. We would also like to thank the other DBF team members and Community of Practice cohort members for all the work they do to make the DBF possible and accessible to users. Finally, many thanks to our reviewers Ikumi Crocoll and LaKeshia Darden and our editor Jaena Rae Cabrera for helping to shape our writing. We appreciate the time and labor that went into improving this article and connecting it with readers. </em></p>
<hr class="wp-block-separator has-alpha-channel-opacity" />
<h2 class="wp-block-heading"><strong>References</strong></h2>
<p>“ALSC Community Agreements.” Association for Library Service to Children, 2020, </p>
<p><a href="https://www.ala.org/alsc/sites/ala.org.alsc/files/content/aboutalsc/governance/hndbk/ALS%20Community%20Agreements%2011.2020.pdf">https://www.ala.org/alsc/sites/ala.org.alsc/files/content/aboutalsc/governance/hndbk/ALS%20Community%20Agreements%2011.2020.pdf</a>. Accessed 13 Aug. 2023.</p>
<p>“American Library Association reports record number of demands to censor library books and materials in 2022. ” <em>American Library Association</em>, March 22, 2023, <a href="http://www.ala.org/news/press-releases/2023/03/record-book-bans-2022">www.ala.org/news/press-releases/2023/03/record-book-bans-2022</a>. Accessed 8 Sept. 2023.</p>
<p>Arao, Brian and Kristi Clemens. “From Safe Spaces to Brave Spaces: A New Way to Frame Dialogue around Diversity and Social Justice.” <em>The Art of Effective Facilitation: Reflections from Social Justice Educators</em>, edited by Lisa M, Landreman, Stylus, 2013, pp. 135-150.</p>
<p>Aronson, Krista Maywalt, Breanna D. Callahan, and Anne Sibley O’Brien. “Messages Matter: Investigating the Thematic Content of Picture Books Portraying Underrepresented Racial and Cultural Groups,” <em>Sociological Forum</em>, vol. 33, no. 1, 2018, pp. 165-185, <a href="https://onlinelibrary.wiley.com/doi/10.1111/socf.12404">https://onlinelibrary.wiley.com/doi/10.1111/socf.12404</a>. Accessed 8 Jan. 2024.</p>
<p>Barek, Hiba et al. “Pedagogies of Care in Precarity.” <em>MethodSpace</em>, <a href="https://www.methodspace.com/blog/pedagogies-of-care-in-precarity">www.methodspace.com/blog/pedagogies-of-care-in-precarity</a>. Accessed 8 Sept. 2023.</p>
<p>Cummins, June. “The Still Almost All-White World of Children’s Literature: Theory, Practice, and Identity-Based Children’s Book Awards.” <em>Prizing Children’s Literature: The Cultural Politics of Children’s Book Awards</em>, edited by Kenneth B. Kidd and Joseph T. Thomas, Jr., Routledge, 2017, pp. 87-103.</p>
<p>Dali, Keren and Nadia Caidi. “Diversity by Design,” <em>The Library Quarterly</em>, vol.<em> </em>87, no. 2, 2017, pp. 88-98.</p>
<p>Davis, Angela et al. <em>Abolition. Feminism. Now.</em> Chicago: Haymarket, 2022. </p>
<p>“Diverse Collections: An Interpretation of the Library Bill of Rights.” <em>American Library Association</em>, July 26, 2006, <a href="http://www.ala.org/advocacy/intfreedom/librarybill/interpretations/diversecollections">www.ala.org/advocacy/intfreedom/librarybill/interpretations/diversecollections</a>. Accessed 8 Sept. 2023.</p>
<p>“Diversity Counts.” American Library Association, <a href="https://www.ala.org/aboutala/offices/diversity/diversitycounts/divcounts">www.ala.org/aboutala/offices/diversity/diversitycounts/divcounts</a>. Accessed 20 Aug. 2023.</p>
<p>Espinosa de los Monteros, Pamela and Sandra Enimil. “Diversity, Equity, and Inclusion in Action: Designing a Collective DEI Strategy with Library Staff.” <em>Diversity, Equity, and Inclusion in Action: Planning, Leadership, and Programming</em>, edited by Christine Bombaro, ALA Editions, 2020, pp. 13-27. </p>
<p>Grissom-Broughton, Paula A. “A Matter of Race and Gender: An Examination of an Undergraduate Music Program Through the Lens of Feminist Pedagogy and Black Feminist Pedagogy.” <em>Research</em> <em>Studies</em> <em>in</em> <em>Music</em> <em>Education</em>, vol. 42, no. 2, 2020, pp. 160-176.</p>
<p>Jonassen, D. H. “Externally Modeling Mental Models.” <em>Learning</em> <em>and</em> <em>Instructional</em> <em>Technologies</em> <em>for</em> <em>the</em> <em>21st</em> <em>Century</em>: <em>Visions</em> <em>for</em> <em>the</em> <em>Future</em>, edited by Leslie Moller, Jason Bond Huett, and Douglas M. Harvey, Springer, 2009, pp. 49-74.</p>
<p>King, Jr., Martin Luther. “Letter from Birmingham Jail.” 16 April 1963. Reprinted in <em>UC Davis Law Review</em>, vol. 26, no. 4, 2023, pp. 835-851, <a href="https://lawreview.law.ucdavis.edu/archives/26/4/letter-birmingham-jail">https://lawreview.law.ucdavis.edu/archives/26/4/letter-birmingham-jail</a>. Accessed 8 January 2024. </p>
<p>McCusker, Geraldine. “A Feminist Teacher’s Account of Her Attempts to Achieve the Goals of Feminist Pedagogy.” <em>Gender</em> <em>and</em> <em>Education</em>, vol. 29, no. 4, 2017, pp. 445-460.</p>
<p>Quinlan, Kathleen M. “How Emotion Matters in Four Key Relationships in Teaching and Learning in Higher Education.” <em>College Teaching</em>, vol. 64, no. 3, 2016, pp. 101-111, DOI: 10.1080/87567555.2015.1088818. </p>
<p>Ravitch, Sharon M. “FLUX Pedagogy: Transforming Teaching and Learning During Coronavirus.” <em>Penn GSE Perspectives on Urban Education</em>, vol. 17, 2020, pp. 1-15, <a href="https://urbanedjournal.gse.upenn.edu/volume-17-spring-2020/flux-pedagogy-transforming-teaching-and-leading-during-coronavirus">https://urbanedjournal.gse.upenn.edu/volume-17-spring-2020/flux-pedagogy-transforming-teaching-and-leading-during-coronavirus</a>. Accessed 22 Sept. 2023.</p>
<p>“Sample Group Agreements.” GSAFE, <a href="https://www.gsafewi.org/wp-content/uploads/Sample-Group-Agreements.pdf">https://www.gsafewi.org/wp-content/uploads/Sample-Group-Agreements.pdf</a>. Accessed 13 Aug. 2023. </p>
<p>Tedesco-Schneck, Mary. “Classroom Participation: A Model of Feminist Pedagogy.” <em>Nurse</em> <em>Educator</em>, vol. 43, no. 5, 2018, pp. 267-271.</p>
<p>Tormey, Roland. “Rethinking Student-Teacher Relationships in Higher Education: A Multidimensional Approach.” <em>Higher Education</em>, vol. 82, 2021, pp. 993-1011, DOI: 10.1007/s10734-021-00711-w.</p>
<hr class="wp-block-separator has-alpha-channel-opacity" />
<h2 class="wp-block-heading"><strong>Appendix</strong></h2>
<div id="community-agreements" style="border: 2px solid #424242;">
<p><strong>Community Agreements for the Diverse BookFinder Community of Practice</strong></p>
<p><em>Thank you to the Association for Library Service to Children (ALSC) </em><em>for widely sharing their community agreements, which we have drawn on heavily in this document.</em></p>
<p><strong>Diverse BookFinder Community Agreements:</strong></p>
<p>These community agreements were developed so that all meetings/classes convened by Community of Practice members/facilitators of the Diverse BookFinder (DBF) are spaces where meaningful and respectful conversations are held. The agreements outline best practices to ensure that everyone has an opportunity for expression, accountability, and growth. </p>
<p>They provide a guide to how topics are discussed, the language used, and how our different experiences, identities, and knowledge are reflected in our thought processes, discussions, and decisions. As you participate in discussions, meetings, presentations, etc. please use these guidelines as a starting point and as a group; add additional agreements if necessary.</p>
<p>● <strong>Speak for yourself</strong>. Use “I” and be aware that your perspective is not everyone’s perspective or the “normal” perspective.</p>
<p>● <strong>Embrace multiple perspectives to engage in curiosity-driven dialogue (not debate or argument)</strong>. Have compassion for and honor people’s varied journeys while respecting their humanity. The goal of dialogue should not be to change anyone’s mind but to offer and receive a perspective for consideration and curiosity. Even if your every cell feels in disagreement with someone’s perspective, right and wrong binaries rarely build connection and understanding. Do note that racism, bigotry, and all other forms of oppression are not a difference of opinion and will not be tolerated.</p>
<p>● <strong>Be aware of the privilege, oppressions, and life experiences </strong>you carry and how they might impact your discussion process.</p>
<p>● <strong>Listen to and use people’s correct names and pronouns</strong>. Let people know how you would like to be addressed during introductions and include pronouns if you would like. If pronouns are not shared or if you are unsure of someone’s pronouns, refer to the person by their name.</p>
<p>●<strong> Share the air</strong>. Be aware of how much you are talking versus listening. Challenge yourself to invite others into the conversation and “step up” if you are prone to not participating. We all have something to bring to the discussion.</p>
<p>● <strong>Interrupt attempts to derail</strong>. Oftentimes, discomfort is so great that we immediately attempt to change the conversation to something that feels more comfortable. Before you know it, the conversation is about the weather when we were talking about equity. Work to stay engaged when you feel uncomfortable and make mistakes (this is when learning happens).</p>
<p>● <strong>Acknowledge intent while addressing impact</strong>. Work to not personalize the responses of others while taking care to be mindful of the impact of our words and our actions on others. Understand that intent does not equal impact and acknowledge the impact of something that was said or done during the conversation (or break) by criticizing ideas and not individuals.</p>
<p>● <strong>Interrupt bias and take feedback</strong>. We want to cultivate a space for everyone to learn, to be wrong and unlearn, to be accountable and change. We recognize that this process always happens in relation to each other and so can and will be hard. It’s also important to us that the necessary labor of creating this space does not fall on the same bodies. In order to hold the systems and structures of power that create harmful ways of relating to each other accountable, this work requires careful intention, thoughtfulness, creativity, and experimentation. We will not always get it right, but if we do this work collectively, we can move forward together.**</p>
<p>Self and community/collective accountability are essential to our work together. If you observe something oppressive being said or done (by yourself or others), please acknowledge it. [For example, “ouch” and “oops” and “oh” are words that can be used to acknowledge moments when you recognize something oppressive is said (“ouch,” “oh,” or another term) or you notice a mistake that you’ve made (“oops,” “oh,” or another term). Of course, you can raise a topic for discussion without using these terms as well.]</p>
<ul>
<li>During Large Group Classes: A facilitator will be identified as “Chat Moderator” for the evening. If you would like to bring an experience or example of bias to the attention of the group and have it addressed in some way, please use the chat to privately message the Chat Moderator and let them know.</li>
<li>During Small Group Discussions: If you would like to bring an experience or example of bias to the attention of the group and have it addressed in some way, please use the chat to privately message your facilitator and let them know.</li>
</ul>
<p>In either case, facilitators may address the moment immediately or they may ask for some grace and the opportunity to further reflect (and receive guidance) on how to best address the situation.</p>
<p>Please remember that everyone (your facilitator included) is human. As we experience feedback about bias, it is our personal responsibility to keep learning. However, that learning may require deeper dialog, reflection, and/or time.</p>
<p> **Ideas adapted from Angela Davis et al’s <em>Abolition. Feminism. Now.</em> (which centers the tools/strategies of transformative justice and community accountability).</p>
<p>● <strong>Remember that we all have opportunities to grow</strong>. Feedback is a gift of experience and expertise, and it acknowledges that learning is complex and never-ending. Receive it and consider systems of dominance and power at play in community conversations and interactions. Be aware of the lenses you do and do not have as a result of your identities and experiences.</p>
<p>●<strong> What’s said here, stays here</strong>. What’s learned here, leaves here. DBF meetings should be a safe place where people can feel free to be vulnerable and share things about their identities. No one should have to worry about these things being discussed outside of DBF. But take other DBF knowledge and learning with you!</p>
<p>● <strong>On Cameras: Connection is crucial</strong>. As we move through the process of training and discussion, we will be interrogating challenging and sensitive topics. We aim to provide the most open, productive, and engaged spaces in which to do this while still considering the flexibilities often required by real life. We believe that being on camera allows us to best build the connections and trust required to fully engage in these conversations. While we suggest that your camera remain on during all of your DBF sessions, cameras will be required during small group sessions.</p>
<ul>
<li>If you need to turn your camera off temporarily, please turn it on as soon as possible.</li>
<li>If you are having technical difficulties or need to leave your camera off for an entire small group session, please communicate that with your facilitator.</li>
<li>Occasional interruptions from guest stars such as dogs, cats, other furry/feathered/scaly friends, children, roommates, partners, parents, coworkers, doorbells, and food deliveries are a normal part of virtual meetings and working from home and will be expected.</li>
</ul>
<hr class="wp-block-separator has-alpha-channel-opacity" />
<p>● <strong>Reach out before conflicts get worse</strong>. All of our facilitators are skilled educators, whether in higher education or as professional speakers, with particular expertise with facilitating conversations about race, ethnicity, and culture. Every effort has been made to create a solid footing for this work, with the goal of creating a brave space where we can all learn and grow together. Even so, conflicts may emerge during the course of our work. When that happens we would first like you to reach out to your small group facilitator. Our hope is that together you can discuss the matter and work towards resolution.</p>
<p>If you experience conflict with your small group leader, please reach out to Krista A. or Lisely L. for assistance.</p>
<hr class="wp-block-separator has-alpha-channel-opacity" />
<p><strong>Sources:</strong></p>
<p>Much of the language above is borrowed from the following organizations/documents:</p>
<ol>
<li>“ALSC Community Agreements.” ALSC. Retrieved from <a href="https://www.ala.org/alsc/sites/ala.org.alsc/files/content/aboutalsc/governance/hndbk/ALS%20Community%20Agreements%2011.2020.pdf">https://www.ala.org/alsc/sites/ala.org.alsc/files/content/aboutalsc/governance/hndbk/ALS%20Community%20Agreements%2011.2020.pdf</a> </li>
</ol>
<ol start="2">
<li>“Sample Group Agreements.” GSAFE. Retrieved from<a href="https://www.gsafewi.org/wp-content/uploads/Sample-Group-Agreements.pdf"> https://www.gsafewi.org/wp-content/uploads/Sample-Group-Agreements.pdf</a> </li>
</ol>
<p style="font-size: 90%;">Community Agreements for the Diverse BookFinder Community of Practice © 2022 by Diverse BookFinder is licensed under <a href="http://creativecommons.org/licenses/by-nc-sa/4.0/" rel="license">CC BY-NC-SA 4.0</a>.</p>
</div>
<hr class="wp-block-separator has-alpha-channel-opacity" />
<p>Footnotes:</p>
<ol class="footnotes"><li class="footnote" id="footnote_0_11407">During the seven-week training program, learners and facilitators participated in weekly Large Core Classes and Small Group Sessions. After the training program ended, we continued to hold monthly Small Group Sessions to discuss new coding questions and share insights and approaches. The monthly Small Group Sessions provided ongoing consistency and community and continued our collaborative approach to learning from one another.</li><li class="footnote" id="footnote_1_11407">In our recruitment efforts, we promoted the DBF CoP to members of the following library organizations: the American Library Association’s Ethnic and Multicultural Information Exchange Round Table (EMIERT); the Black Caucus of ALA (BCALA); the Association for Library Service to Children’s Equity, Diversity, and Inclusion Implementation Task Force; the American Association of School Librarians’ Diversity, Equity and Inclusion Community of Practice; REFORMA: The National Association to Promote Library and Information Services to Latinos and the Spanish Speaking; the Asian Pacific American Library Association (APALA); the American Indian Library Association (AILA); the Rainbow Roundtable (RRT); and the Association of Jewish Libraries (AJL). We also shared the opportunity with the Association for Library Service to Children (ALSC), the Young Adult Library Services Association (YALSA), and divisions of the National Council of Teachers of English (NCTE).</li></ol>
Ramona Caponegro
https://www.inthelibrarywiththeleadpipe.org
HangingTogether: Libraries support data-driven decision making
https://hangingtogether.org/?p=13876
2024-02-21T08:31:00+00:00
<p class="has-small-font-size"><em>The following post is part of an ongoing <a href="https://hangingtogether.org/tag/building-for-the-future" rel="noreferrer noopener" target="_blank">series</a> about the OCLC-LIBER “Building for the future” program.</em> A Dutch version of this blog post <a href="https://hangingtogether.org/bibliotheken-ondersteunen-datagedreven-besluitvorming/">is also available</a>. </p>
<p>The <a href="https://www.oclc.org/research/partnership.html" rel="noreferrer noopener" target="_blank">OCLC Research Library Partnership</a> (RLP) and <a href="https://libereurope.eu/" rel="noreferrer noopener" target="_blank">LIBER</a> (Association of European Research Libraries) hosted a facilitated discussion on the topic of <strong>data-driven decision making</strong> on 7 February 2024. This event was a component of the ongoing <a href="https://connect.oclc.org/en/oclc-liber-building-for-the-future-series-2023">Building for the future</a> series exploring how libraries are working to provide state-of-the-art services, as described in <a href="https://connect.oclc.org/en/oclc-liber-building-for-the-future-series-2023" rel="noreferrer noopener" target="_blank">LIBER’s 2023-2027 strategy</a>. </p>
<figure class="wp-block-image alignleft size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2024/02/s-o-c-i-a-l-c-u-t-r0saAQNjEjQ-unsplash-scaled.jpg"><img alt="This image shows three women seated at a table working at computers." class="wp-image-13881" height="683" src="https://hangingtogether.org/wp-content/uploads/2024/02/s-o-c-i-a-l-c-u-t-r0saAQNjEjQ-unsplash-1024x683.jpg" style="width: 460px; height: auto;" width="1024" /></a>Photo by <a href="https://unsplash.com/@socialcut?utm_content=creditCopyText&utm_medium=referral&utm_source=unsplash">S O C I A L . C U T</a> on <a href="https://unsplash.com/photos/3-women-sitting-on-chair-in-front-of-table-with-laptop-computers-r0saAQNjEjQ?utm_content=creditCopyText&utm_medium=referral&utm_source=unsplash">Unsplash</a>
</figure>
<p>The OCLC RLP team worked collaboratively with members of the LIBER <a href="https://libereurope.eu/working-group/research-data-management/" rel="noreferrer noopener" target="_blank">Research Data Management </a> and <a href="https://libereurope.eu/working-group/liber-data-science-in-libraries-working-group/" rel="noreferrer noopener" target="_blank">Data Science in Libraries</a> working groups to develop the discussion questions. Like our earlier discussion on research data management, we tried to keep things practical, asking participants to share about current and future efforts, and to contribute their thoughts on the role and value of the library in supporting data-driven decision making. Small group discussions were facilitated by generous volunteers from <a href="https://libereurope.eu/working-groups/" rel="noreferrer noopener" target="_blank">LIBER working groups</a> and OCLC.</p>
<p>The virtual event was attended by participants from 35 institutions across 15 countries from Europe, North America, and Asia. Despite many regional and national differences, there were several key themes that surfaced across the seven breakout discussion groups, which is synthesized below. </p>
<h4 class="wp-block-heading">What does “data-driven decision making” mean for libraries?</h4>
<p>We asked participants this question in a virtual poll, and we reached fairly strong consensus that data-driven decision making means “using evidence to inform decisions and evaluate their outcomes.” While we framed this discussion using the phrase “data-driven,” we recognize that others prefer “data-informed” or “data-conscious.”</p>
<p>Indeed, while the conversations recognized the value of using quality data to inform decisions, we also heard cautionary comments that data should be considered as a decision support tool. Data should be used within context, and users should not use data to the exclusion of other qualitative ways of knowing. </p>
<figure class="wp-block-image size-full is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2024/02/dddm.jpg"><img alt="" class="wp-image-13954" height="445" src="https://hangingtogether.org/wp-content/uploads/2024/02/dddm.jpg" style="width: 840px; height: auto;" width="921" /></a><em>Online poll responses to question about the meaning of “data-driven” decision making</em></figure>
<h4 class="wp-block-heading">How are libraries supporting data-driven decision making?</h4>
<p><strong>There are dozens of ways that libraries are supporting data-driven decision making. </strong>We heard from participants who described <a href="https://hangingtogether.org/category/collective-collections/" rel="noreferrer noopener" target="_blank">collective collections</a> efforts, where a group of libraries is working together to manage their combined holdings, to support collection retention decisions, and more. Additionally, borrowing statistics can be used to inform both collection development and weeding decisions. </p>
<p>Beyond collections, participants described analyzing library building usage data (such as gate traffic and wifi usage) to measure the busyness of spaces, to inform space management decisions. </p>
<p>Participants also described the growing role of the library in research analytics, in support of institutional goals. In the UK, the library is usually responsible for managing data about the institutional scholarly record, for reporting to the national Research Excellent Framework (REF) assessment exercise. Elsewhere, library workers are supporting institutional efforts to understand research productivity, progress toward open research goals, and identify potential collaborations. And, of course, libraries are creating specific roles to manage a wide variety of data and make it available for reuse, the topic of a recent <a href="https://libereurope.eu/article/data-curation-an-interview-with-matthias-towe/" rel="noreferrer noopener" target="_blank">LIBER interview with Matthias Töwe</a>, Data Curator at ETH Zurich Library. </p>
<h4 class="wp-block-heading">Supporting data-driven decision making is challenging</h4>
<p><strong>Libraries are awash in data.</strong> Several participants described the feeling of being overwhelmed by all the data available, with the sheer volume making it challenging to manage, clean, and use effectively. At the same time, it can be difficult to even know what data is available, because it is spread across many silos within the organization. Greater organization and transparency are necessary. </p>
<p><strong>Collaboration is required</strong>, regardless of scale. Multi-institutional <a href="https://crl.acrl.org/index.php/crl/article/view/24618/32438" rel="noreferrer noopener" target="_blank">collective collections analyses </a>demand significant investment and commitment from a wide variety of stakeholders across many institutions and library units. Even when seeking an answer to local operational questions, where , as one participant noted, “we need certain bits of data from other people,” library workers must apply <a href="https://hangingtogether.org/social-interoperability-getting-to-know-all-about-you/" rel="noreferrer noopener" target="_blank">social interoperability</a> to get work done. </p>
<p><strong>Users asking for data and reports are often unable to clearly articulate what they need.</strong> This is apparently such a widely felt pain point that it was the #1 response to our online poll about the tensions and challenges of collaboration around data-driven decision making. One small group discussed the need to repurpose “reference interview” skills to interview data consumers in order to clarify the questions they are seeking to answer. </p>
<h4 class="wp-block-heading">What is the value proposition of the library for data-driven decision making?</h4>
<p>We asked the small groups to discuss the overarching value proposition of the library in supporting data-informed decisions, and several themes emerged across the group discussions:</p>
<p><strong>Libraries know metadata</strong>. The skills and knowledge that metadata librarians hold about library data is invaluable for managing collections. . . and more. This metadata expertise is clearly a strength, but one that may be easily overlooked, requiring improved messaging to non-library audiences. One participant expressed concern that library expertise is too easily dismissed because it was seen as “just books,” without recognizing the transferability and value of these skills, such as experience with complex enterprise systems, proficiency with data management, and the consistent application of rules, standards, and policies. </p>
<p><strong>Libraries use data to responsibly steward resources</strong>. Shared print and collective collections activities rely upon aggregated library holdings data to make decisions about collections development, retention, and long term and cost effective stewardship of the scholarly record. Several participants also described how data about both collections and library building usage has been leveraged to make decisions about future space utilization. Libraries also need “to show that we are making good use of [campus] resources, so that they will continue to fund us.” </p>
<p><strong>Research support services that extend beyond the library are highly visible to other campus stakeholders</strong>. Library support in areas like research data management, research intelligence, and managing data for national reporting requirements, in alignment with campus strategic priorities, often offer the greatest visibility to non-library stakeholders. For example, participants from the UK and Hong Kong described the central role of the library in collecting the scholarly record of the institution, to support national reporting requirements and provide analysis of the output and impact of institutional scholarship. A Canadian participant described their creation of a bibliometrics librarian who now leads an informal network of business intelligence officers across the university, providing decision support about compliance, assessment, and funding. Libraries are also exploring how they can define a set of indicators that will provide insights into open research activities, as described in a <a href="https://hangingtogether.org/supporting-open-research-at-the-university-of-manchester-libraries/" rel="noreferrer noopener" target="_blank">recent RLP webinar presentation</a> by Scott Taylor at the University of Manchester. </p>
<h4 class="wp-block-heading">What are some strategies libraries can use to demonstrate this value proposition? </h4>
<p><strong>Library leaders must advocate for the library’s role</strong>. We heard many examples of libraries providing institutional decision support. However, it can still be a challenge for non-library stakeholders to recognize the library as strong contributor, and participants echoed a concern we heard in the <a href="https://hangingtogether.org/exploring-the-challenges-and-opportunities-of-research-data-management-rdm/" rel="noreferrer noopener" target="_blank">previous facilitated discussion on research data management</a>: “People don’t think of the library.” Library leaders should be relentless in advocating for the knowledge and skills of library workers, guiding campus partners to conceptualize the library in new and modern ways. </p>
<p><strong>Use a “purpose tree” to codify value and communicate internally and externally</strong>. A UK participant in one small group discussion shared how she had created a purpose tree for her metadata team, which included a high level vision and strategy statement about their activities and how they contribute to library and university strategy. The document helped demonstrate that catalogers weren’t just “sitting in the corner and going through books,” but that they played a vital role in stewarding quality metadata, supporting an array of business needs. The other small group participants expressed sincere enthusiasm for this idea, and it seems to offer a framework for team building and strategic goal alignment.</p>
<p><strong>Visualizations and data storytelling is required.</strong> A strong theme throughout the small group discussions was that the data itself is not enough. Librarians must also develop data storytelling skills and leverage visualizations in order to effectively communicate and create enthusiasm for the data findings. </p>
<p><strong>Library workers must upskill, both individually and in teams</strong>. Library workers bring significant skills to managing data, but they often lack training in data analysis, including tools like PowerBI and Tableau. Participants shared many stories of how they are acquiring these skills. For example, a Hong Kong participant described how her institution formed an interest group to explore data analysis and build skills, enabling participants to learn from each other in a supportive environment. Another participant from the Netherlands described a similar effort, where their local working group is learning data visualization skills and building a broader community of practice. In general, participants expressed the need not only for the upskilling of existing staff, but the future onboarding of staff members with mature technical data analysis skills. </p>
<div class="wp-block-image">
<figure class="alignright size-full is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2024/02/Feel.jpg"><img alt="" class="wp-image-13922" height="378" src="https://hangingtogether.org/wp-content/uploads/2024/02/Feel.jpg" style="width: 508px; height: auto;" width="621" /></a><em>Word cloud summary from event polling</em></figure></div>
<p>We concluded the event by inviting participants to share one word about how they felt, and they reported feeling inspired, informed, and encouraged. </p>
<h4 class="wp-block-heading">Join us for the upcoming facilitated discussion on AI, Machine Learning, and Data Science</h4>
<p>The next discussion in this multi-part series on state-of-the-art services will take place on 17 April, where we will collectively explore the challenges and opportunities of AI, machine learning, and data science. The session will focus on the ways that research libraries are using (or want to use) advancing technologies to improve library workflows, metadata, and more. By facilitating structured small group discussions, we are inviting participants to ideate and share about their future visions for AI and data science, while also purposefully exploring the challenges libraries face in leveraging emerging technologies <a href="https://www.oclc.org/research/publications/2019/oclcresearch-responsible-operations-data-science-machine-learning-ai.html" rel="noreferrer noopener" target="_blank">responsibly</a>. <a href="https://www.oclc.org/oclc-forms/en/events/2024/liber-webinar-ai-machine-learning-data-science.html?_gl=1*2k27mp*_gcl_au*NTg2NDU2Ny4xNjk5ODg1MjA1" rel="noreferrer noopener" target="_blank">Register today</a> to save your spot. </p>
<p>The post <a href="https://hangingtogether.org/libraries-support-data-driven-decision-making/">Libraries support data-driven decision making</a> appeared first on <a href="https://hangingtogether.org">Hanging Together</a>.</p>
Rebecca Bryant
https://hangingtogether.org/
Ed Summers: Uh oh it's Magic
https://inkdroid.org/2024/02/21/magic/
2024-02-21T05:00:00+00:00
<p>
The <a href="https://opensource.googleblog.com/2024/02/magika-ai-powered-fast-and-efficient-file-type-identification.html">news</a>
about Google open sourcing its new “AI” driven file format
identification tool <a href="https://github.com/google/magika">magika</a> made a splash in the
usual tech places recently. This post provides a very quick look at just
one file through the lens of three file format identification tools, and
gestures a bit about what we are giving up when we give in to big tech’s
machine learning models.
</p>
<p>
<a href="https://digipres.club/@anj/111966343349680486">Andy Jackson</a>
from the Digital Preservation Coalition has a <a href="https://anjackson.net/2024/02/20/a-first-look-at-magika/">good
post</a> about how <em>magika</em> is quite limited in terms of the
formats it identifies, and the types of information it reports. He also
points out that it’s important to remember that Google created magika to
help route files to specialized security scanners in Gmail and Google
Drive, which is quite different from digital preservation use cases. In
digital preservation the concern is usually around mitigating perceived
obsolescence of file formats, and also determining what applications can
be used to render the file, both of which require knowledge not just of
the format but also its version.
</p>
<p>
So, here’s a quick comparison of looking at <em>one TIFF file</em> using
<em>magika</em>, the venerable <a href="https://en.wikipedia.org/wiki/File_(command)">file</a> Unix
command, and <a href="https://www.itforarchivists.com/siegfried">siegfried</a>, which is
a specialized tool developed by and for the digital preservation
community. Think of this as a <a href="https://en.wikipedia.org/wiki/Close_reading">close reading</a> of
tools for file format identification, to try to discover or illustrate
something significant in the details of their output, rather than a
statistical overview of what the tools do more generally.
</p>
<pre class="text"><code>$ magika MCE_AF2G_2010.tif
MCE_AF2G_2010.tif: TIFF image data</code></pre>
<p>
Good job <em>magika</em>, it <em>is</em> a TIFF file.
</p>
<pre class="text"><code>$ file MCE_AF2G_2010.tif
MCE_AF2G_2010.tif: TIFF image data, little-endian, direntries=18, height=3724, bps=230, compression=LZW, PhotometricIntepretation=RGB, width=7460</code></pre>
<p>
Awesome <em>file</em>, thanks for the extra information about the
dimensions, compression, and colors.
</p>
<pre class="text"><code>$ sf MCE_AF2G_2010.tif
---
siegfried : 1.11.0
scandate : 2024-02-20T18:01:54-05:00
signature : default.sig
created : 2023-12-17T15:54:41+01:00
identifiers :
- name : 'pronom'
details : 'DROID_SignatureFile_V116.xml; container-signature-20231127.xml'
---
filename : 'MCE_AF2G_2010.tif'
filesize : 1484121
modified : 2024-02-08T10:07:46-05:00
errors :
matches :
- ns : 'pronom'
id : 'fmt/155'
format : 'Geographic Tagged Image File Format (GeoTIFF)'
version :
mime : 'image/tiff'
class : 'GIS, Image (Raster)'
basis : 'extension match tif; byte match at 0, 186 (signature 1/4)'
warning :</code></pre>
<p>
Niiice <em>siegfried</em>, this is important! The TIFF file isn’t just
an any image file, it’s a <a href="https://en.wikipedia.org/wiki/GeoTIFF">GeoTIFF</a> file. If we
were to open the TIFF file in a regular image viewer like Preview on
MacOS we’d see this:
</p>
<p>
<a href="https://inkdroid.org/images/geotiff-preview.png"><img class="img-fluid" src="https://inkdroid.org/images/geotiff-preview.png" /></a>
</p>
<p>
But since we know it’s a GeoTIFF we can also view it in GIS software
like <a href="https://qgis.org/en/site/">QGIS</a>:
</p>
<p>
<a href="https://inkdroid.org/images/geotiff-qgis.png"><img class="img-fluid" src="https://inkdroid.org/images/geotiff-qgis.png" title="2G Networks in Afghanistan" /></a>
</p>
<p>
And we can use other tools like <a href="https://gdal.org/programs/gdalinfo.html">gdalinfo</a> to look at
metadata in the file:
</p>
<pre class="text"><code>➜ tmp gdalinfo x.tif
Driver: GTiff/GeoTIFF
Files: x.tif
Size is 7460, 3724
Coordinate System is:
GEOGCRS["WGS 84",
ENSEMBLE["World Geodetic System 1984 ensemble",
MEMBER["World Geodetic System 1984 (Transit)"],
MEMBER["World Geodetic System 1984 (G730)"],
MEMBER["World Geodetic System 1984 (G873)"],
MEMBER["World Geodetic System 1984 (G1150)"],
MEMBER["World Geodetic System 1984 (G1674)"],
MEMBER["World Geodetic System 1984 (G1762)"],
MEMBER["World Geodetic System 1984 (G2139)"],
ELLIPSOID["WGS 84",6378137,298.257223563,
LENGTHUNIT["metre",1]],
ENSEMBLEACCURACY[2.0]],
PRIMEM["Greenwich",0,
ANGLEUNIT["degree",0.0174532925199433]],
CS[ellipsoidal,2],
AXIS["geodetic latitude (Lat)",north,
ORDER[1],
ANGLEUNIT["degree",0.0174532925199433]],
AXIS["geodetic longitude (Lon)",east,
ORDER[2],
ANGLEUNIT["degree",0.0174532925199433]],
USAGE[
SCOPE["Horizontal component of 3D system."],
AREA["World."],
BBOX[-90,-180,90,180]],
ID["EPSG",4326]]
Data axis to CRS axis mapping: 2,1
Origin = (56.249996410171626,38.166360030600401)
Pixel Size = (0.002160683455290,-0.002160683455290)
Metadata:
AREA_OR_POINT=Area
DataType=Thematic
Image Structure Metadata:
INTERLEAVE=PIXEL
Corner Coordinates:
Upper Left ( 56.2499964, 38.1663600) ( 56d14'59.99"E, 38d 9'58.90"N)
Lower Left ( 56.2499964, 30.1199748) ( 56d14'59.99"E, 30d 7'11.91"N)
Upper Right ( 72.3686950, 38.1663600) ( 72d22' 7.30"E, 38d 9'58.90"N)
Lower Right ( 72.3686950, 30.1199748) ( 72d22' 7.30"E, 30d 7'11.91"N)
Center ( 64.3093457, 34.1431674) ( 64d18'33.64"E, 34d 8'35.40"N)
Band 1 Block=7460x1 Type=Byte, ColorInterp=Red
Mask Flags: PER_DATASET ALPHA
Band 2 Block=7460x1 Type=Byte, ColorInterp=Green
Mask Flags: PER_DATASET ALPHA
Band 3 Block=7460x1 Type=Byte, ColorInterp=Blue
Mask Flags: PER_DATASET ALPHA
Band 4 Block=7460x1 Type=Byte, ColorInterp=Alpha</code></pre>
<p>
So if we used <em>magika</em> we never would have known we could put the
image on a map.
</p>
<p>
<em>… dramatic pause …</em>
</p>
<p>
But perhaps even more important is what we are <em>giving up</em> when
we rely entirely on a machine learning model, like what comes with
<em>magika</em>, instead of the hand crafted rules used by <em>file</em>
and <em>siegfried</em>.
</p>
<ol type="1">
<li>
We lose the ability to <em>reason</em> about the output. Why was one
format chosen and not the other?
</li>
<li>
We lose the ability to update the tool to recognize new formats or to
correctly choose other ones.
</li>
</ol>
<p>
When you install the Python version of <em>magika</em> you get a few
files added to your Python environment:
</p>
<pre><code>├── __init__.py
├── cli
│ └── magika.py
├── colors.py
├── config
│ ├── content_types_config.json
│ └── magika_config.json
├── content_types.py
├── logger.py
├── magika.py
├── models
│ └── standard_v1
│ ├── model.onnx
│ ├── model_config.json
│ ├── model_output_overwrite_map.json
│ └── thresholds.json
├── prediction_mode.py
├── strenum.py
└── types.py</code></pre>
<p>
The <code>model.onnx</code> file is what is used to determine what
format a file is in. It was generated by developers at Google who used
large amounts of data that they have, because they’re Google, and
(presumably) a compute cluster, because they’re Google. It’s a binary
blob that you can’t edit, or read as a human.
</p>
<p>
<em>file</em> and <em>siegfried</em> on the other hand use hand crafted
databases of “magic codes” to look for in order to determine what format
is likely for a file. Lots of time and effort have gone into creating
and maintaining them. The rules aren’t perfect, or complete, but we do
know how to update and fix them.
</p>
<p>
We don’t know all the details yet about how Google built their model,
but it’s quite unlikely that the dataset they used is going to be made
publicly available. I guess it’s possible (go on Google I dare you), but
even if they did you will need (potentially a lot) of compute resources
to be able to run the modeling itself. This means that not anyone can
build this “opensource” tool. The model data is available, but the way
to create it is not. It’s basically like making binary executables
available without the source code.
</p>
<p>
If you find a new file format, or notice that a file format isn’t being
recognized correctly you won’t be able to fix it, because fixing it
involves tuning the machine learning algorithm that was used, and
running it on an augmented dataset, which you don’t have access to.
</p>
<p>
Funnily enough, <em>siegfried</em> and <em>file</em> have no idea what
the file format for <em>model.onxx</em> is. But <em>magika</em> says
it’s a “Python compiled bytecode (executable)”. After experimenting a
little bit it does seem like <em>magika</em> is able to distinguish
programming language source code and executables quite a bit better than
traditional tools.
</p>
<p>
So perhaps the reality we are in is that it <em>might</em> be useful to
have multiple perspectives on file formats, and that running multiple
tools could have uses. However the digital preservation community should
be careful not to throw the baby out with the bath water. It’s important
that we are able to maintain our tools, and be able to understand why
they behave the way they do.
</p>
<hr />
<p>
<em>Update 2024-02-22: V <a href="https://merveilles.town/@v/111974900572616043">pointed out</a> to
me that it should be possible to <a href="https://mxnet.apache.org/versions/1.6/api/python/docs/tutorials/packages/onnx/fine_tuning_gluon.html">fine
tune</a> to the magika model, without requiring access to the original
corpus that it was trained on, and the compute infrastructure that was
used. This sounds promising, and I would actually really like to be
proven wrong here. But I remain concerned that while fine tuning might
be achievable, adding <strong>new</strong> file formats could prove
difficult, or at least beyond the ken of digital preservationists. I’m
not against learning new things (this old dog can still be taught new
tricks), but replacing domain expertise in people’s brains for what’s in
ML engineer’s brains is a real transfer of power that is underway at the
moment. Perhaps it has been underway for decades under the guise of
automation more generally (not just machine learning), but that is a
topic for another post…</em>
</p>
Ed Summers
https://inkdroid.org/
David Rosenthal: Clouds Over The Mines
tag:blogger.com,1999:blog-4503292949532760618.post-8253065947681816428
2024-02-20T18:28:19+00:00
In early December 2022 when I wrote skeptically about the economics of Bitcoin mining in <a href="https://blog.dshr.org/2022/12/foolish-lenders.html"><i>Foolish Lenders</i></a> the Bitcoin "price" was around $17K. It has now climbed 153% to around $43K and, below the fold, I am still posting skeptically about the economics of mining.<br />
<span><a name="more"></a></span>
<br />
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgYIjjIeeerm-Zzz0z4uKi7-O4gmkaveLT2Wgg0hpDpm6yK_sxoG7fdLWniOb84aINDI2WDOegrlfrI8MB_YE56oCM5FqLWRU6aoaaGuHLuaqZBABOiNXW0Q4S3VmiT8vePMIfwIpAFiE1Fk4unMFV8ANOnw3N9cuxA94DxXVyJLsg1pNKexP88G5IhvZi7/s932/MinersHODLings.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="115" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgYIjjIeeerm-Zzz0z4uKi7-O4gmkaveLT2Wgg0hpDpm6yK_sxoG7fdLWniOb84aINDI2WDOegrlfrI8MB_YE56oCM5FqLWRU6aoaaGuHLuaqZBABOiNXW0Q4S3VmiT8vePMIfwIpAFiE1Fk4unMFV8ANOnw3N9cuxA94DxXVyJLsg1pNKexP88G5IhvZi7/w200-h115/MinersHODLings.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://www.bloomberg.com/news/articles/2024-02-07/bitcoin-btc-price-outlook-clouded-by-falling-miner-reserves-before-halving">Source</a></td></tr></tbody></table>
The first clue that the future for miners is clouded comes in <a href="https://www.bloomberg.com/news/articles/2024-02-07/bitcoin-btc-price-outlook-clouded-by-falling-miner-reserves-before-halving"><i>Bitcoin Outlook Clouded by Falling Miner Reserves Ahead of April’s Halving</i></a> by Sidhartha Shukla and David Pan:<br />
<blockquote>
Bitcoin miners are getting a jump on an anticipated decline in revenue from the so-called halving in April, when the blockchain’s network protocol will reduce rewards for verifying transactions by half.<br />
<br />
Miner reserves — unsold Bitcoin held in digital wallets associated with the companies — have dropped by 8,400 tokens since the start of 2024 to 1.8 million, a level last seen in June 2021, according to data compiled by CryptoQuant. Analysts said the decrease indicates miners are selling tokens.
</blockquote>
The somewhat misleading graph of the miners HODL-ings actually shows only a 2% drop in the number of BTC from the peak in August 2022. But the "value" of those HODL-ings has risen 75% from $44.8B to $78.5B.
One way of looking at it is that the mining industry started August 2022 with 1.865M BTC and, in the 18 months since mined 821,250 BTC for a total of 2,686,250 but they now have only 1.825M BTC so they must have sold 861,250, or about 5% more than they mined.
<a href="https://www.bloomberg.com/news/articles/2024-02-07/bitcoin-btc-price-outlook-clouded-by-falling-miner-reserves-before-halving">Nevertheless</a>:<br />
<blockquote>
“Miners have begun to sell more of their coins to bolster balance sheets and fund growth capex ahead of tougher times for margins when block rewards are halved in April,” said Matthew Sigel, head of digital-asset research at VanEck. “After the halving, scale will matter even more.”
</blockquote>
Before the great EFT pump, it was generally believed that the frantic efforts to pump BTC over $30K suggested that the mining industry's break-even point was around $30K. On 6<sup>th</sup> October BTC was $26K and the hash rate was around 412M TH/s. Lets assume that the industry was breaking even at BTC=$30K and a hash rate of 412MTH/s, so the industry's costs were covered by 45K BTC/month or $1.35B/month. Assuming the increase in efficiency is roughly cancelled by less efficient operators entering, the hash rate is now 527 TH/s and so costs are around $1.73B. But income is around $1.94B/month so margins are around 12%.<br />
<br />
After the halvening, income is 22.5K BTC/month. At BTC=$43K, this is $968M/month or 56% of current costs. To maintain a 12% margin, costs need to be cut to $852M/month, or 49% of current costs. Alternatively, if costs stay the same, the Bitcoin "price" needs to increase to $86K in April.<br />
<br />
The industry isn't going to halve its costs in the next three months, and even massive printing of unbacked Tether isn't going to double the BTC "price", so "tougher times for margins" are certainly among the clouds overhead.<br />
<br />
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiMQw8ldiqYcm2zhKUDU2Bv3vYJSD9ml5KKG5R4RzSdtzRcoFsDMk-NpVq8GR9hz0K6lumcY3KwoIIwf5MwfWMpEVFhB7G1AH9C8enXxZHtynNj74CYlzk2LYsRgx_TG3i0D3ve2PLD-CeZok5qQWFKvULWFZVEddIxGc_s6edjCaYpIKfh8JlidgTt9UsS/s957/Hut8.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="109" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiMQw8ldiqYcm2zhKUDU2Bv3vYJSD9ml5KKG5R4RzSdtzRcoFsDMk-NpVq8GR9hz0K6lumcY3KwoIIwf5MwfWMpEVFhB7G1AH9C8enXxZHtynNj74CYlzk2LYsRgx_TG3i0D3ve2PLD-CeZok5qQWFKvULWFZVEddIxGc_s6edjCaYpIKfh8JlidgTt9UsS/w200-h109/Hut8.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;">Source</td></tr></tbody></table>
I am not the only skeptic. Short-seller J Capital Research investigated Hut 8, a large public miner, and on January 18<sup>th</sup> published <a href="https://www.jcapitalresearch.com/uploads/2/0/0/3/20032477/2024_01_18_hut.pdf"><i>The Coming HUT Pump and Dump</i></a>. As the chart shows, it was effective. The report featured a list of 22 different red flags, focused mainly on the recent merger between Hut 8 (HUT) and U.S. Bitcoin Corp. (USBTC) and asking:<br />
<blockquote>
Why then did HUT pay $745 mln to acquire this company and its planned payments?
</blockquote>
This is a good question, given <a href="https://www.jcapitalresearch.com/uploads/2/0/0/3/20032477/2024_01_18_hut.pdf">that</a>:<br />
<blockquote>
One person highly familiar with USBTC told us, “without the merger, [USBTC] would have done a structured bankruptcy.”
</blockquote>
It wasn't as if Hut 8 didn't have <a href="https://www.jcapitalresearch.com/uploads/2/0/0/3/20032477/2024_01_18_hut.pdf">problems of its own</a>:<br />
<blockquote>
Hut 8’s North Bay mining facility has been non-operational for an extended period of time, and problems at its Drumheller facility “have been causing miners to fail.”
</blockquote>
And the result of the merger is a <a href="https://www.jcapitalresearch.com/uploads/2/0/0/3/20032477/2024_01_18_hut.pdf">company that</a>:<br />
<blockquote>
has an industry-low efficiency rate and, post halving, will produce Bitcoin at a loss of close to $20,000 per Bitcoin at current spot prices.
</blockquote>
In other words, the merged company can barely make money now and cannot survive when the industry's income is halved in less than 3 months. This all looks like rats leaving the sinking ship with whatever they can carry. An impression reinforced by David Pan in <a href="https://www.bloomberg.com/news/articles/2024-02-07/bitcoin-miner-hut-8-hut-ceo-exits-three-weeks-after-short-seller-allegations"><i>Bitcoin Miner Hut 8 CEO Exits Three Weeks After Short-Seller Allegations </i></a>:<br />
<blockquote>
Hut 8 Corp., one of the largest publicly traded Bitcoin mining companies, named Asher Genoot to succeed Jaime Leverton as chief executive officer, three weeks after a short-seller released a report critical of its recent merger.<br />
<br />
The transition is effective immediately. Genoot served as the chief operating officer and the president of US Bitcoin Corp. Miami-based US Bitcoin, which has large-scale mining facilities across the US including Texas, completed its merger with then-Canadian miner Hut 8 in late 2023.<br />
<br />
The leadership transition comes amid increasing competition among the miners, a Bitcoin code update set to drastically reduce mining revenue in two months as well as the Jan. 18 report from short-seller J Capital Research alleging the merged company was a “pump and dump” waiting to happen. Hut 8 has disputed the claim.
</blockquote>
Genoot has <a href="https://www.jcapitalresearch.com/uploads/2/0/0/3/20032477/2024_01_18_hut.pdf">allegedly</a>:<br />
<blockquote>
abandoned several failed start-ups.
</blockquote>
His co-founder was USBTC's CEO and is now HUT's CSO/director <a href="https://www.jcapitalresearch.com/uploads/2/0/0/3/20032477/2024_01_18_hut.pdf">and</a>:<br />
<blockquote>
is a 30-year-old used-car salesman from Vancouver whose history is littered with involvement in SEC-defined pump-and-dumps, sporting share-price declines of 83%,
</blockquote>
The key inputs for profitable mining are low-cost power and state-of-the-art chips to use it as effficently as possible. Clouds are gathering over both of them.<br />
<br />
As regards access to cheap electricity, utilities are increasingly reluctant to provide it. Take, for example,
<a href="https://www.msn.com/en-ca/news/canada/crypto-mining-company-loses-bid-to-force-bc-hydro-to-provide-power/ar-BB1hPaZE"><i>Crypto mining company loses bid to force BC Hydro to provide power
</i></a> by Darryl Greer of The Canadian Press:<br />
<blockquote>
A cryptocurrency firm has lost a bid to force BC Hydro to provide the vast amounts of power needed for its operations, upholding the provincial government's right to pause power connections for new crypto miners. <br />
<br />
Conifex Timber Inc., a forestry firm that branched out into cryptocurrency mining, had gone to the B.C. Supreme Court to have the policy declared invalid.<br />
<br />
But Justice Michael Tammen ruled Friday that the government's move in December 2022 to pause new connections for cryptocurrency mining for 18 months was "reasonable" and not "unduly discriminatory."<br />
<br />
BC Hydro CEO Christopher O'Riley had told the court in an affidavit that the data centres proposed by Conifex would have consumed 2.5 million megawatt-hours of electricity each year.
</blockquote>
In the US, David Pan's <a href="https://www.bloomberg.com/news/articles/2024-02-01/bitcoin-miners-in-us-consume-up-to-2-3-of-nation-s-electricity"><i>US Bitcoin Miners Use as Much Electricity as Everyone in Utah </i></a> shows that regulators are starting to worry:<br />
<blockquote>
Bitcoin miners in the US are consuming the same amount of electricity as the entire state of Utah, among others, according to a new analysis by the US Energy Information Administration. And that’s considered the low end of the range of use.<br />
<br />
Electricity usage from mining operations represents 0.6% to 2.3% of all the country’s demand in 2023, according to the report released Thursday. It is the first time EIA has shared an estimate. The mining activity has generated mounting concerns from policymakers and electric grid planners about straining the grid during periods of peak demand, energy costs and energy-related carbon dioxide emissions.<br />
<br />
“This estimate of U.S. electricity demand supporting cryptocurrency mining would equal annual demand ranging from more than three million to more than six million homes,” the report said.
</blockquote>
Globally, this increase in demand is part of a bigger picture that is concerning, as Eamon Farhat reports in <a href="https://www.bloomberg.com/news/articles/2024-01-24/cryptocurrency-ai-electricity-demand-seen-doubling-in-three-years"><i>Electricity Demand at Data Centers Seen Doubling in Three Years</i></a>:<br />
<blockquote>
Global electricity demand from data centers, cryptocurrencies and artificial intelligence could more than double over the next three years, adding the equivalent of Germany’s entire power needs, the International Energy Agency forecasts in its latest report.<br />
<br />
There are more than 8,000 data centers globally, with about 33% in the US, 16% in Europe and close to 10% in China, with more planned. In Ireland, where data centers are developing rapidly, the IEA expects the sector to consume 32% of the country’s total electricity by 2026 compared to 17% in 2022. Ireland currently has 82 centers; 14 are under construction and 40 more are approved.<br />
<br />
Overall global electricity demand is expected to see a 3.4% increase until 2026, the report found. The increase, however, will be more than covered by renewables, such as wind, solar and hydro, and all-time high nuclear power.
</blockquote>
The first step towards stricter regulation is information collection which, as Kristoffer Tigue reports in <a href="https://arstechnica.com/tech-policy/2024/02/large-cryptocurrency-miners-in-us-now-have-to-report-energy-use-to-government/"><i>Large cryptocurrency miners in US now have to report energy use to government</i></a> has started:<br />
<blockquote>
The Biden administration is now requiring some cryptocurrency producers to report their energy use following rising concerns that the growing industry could pose a threat to the nation’s electricity grids and exacerbate climate change.<br />
<br />
The Energy Information Administration announced last week that it would start collecting energy use data from more than 130 “identified commercial cryptocurrency miners” operating in the US. The survey, which started this week, aims to get a sense of how the industry’s energy demand is evolving and where in the country cryptocurrency operations are growing fastest.<br />
<br />
“As cryptocurrency mining has increased in the United States, concerns have grown about the energy-intensive nature of the business and its effects on the US electric power industry,” the EIA said in a new report, following the announcement. “Concerns expressed to EIA include strains to the electricity grid during periods of peak demand, the potential for higher electricity prices, as well as effects on energy-related carbon dioxide emissions.”
</blockquote>
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj14LmRDj5Qv_KdDA7OamCTjicibUyIkDrn8j5PcuhsVd6QEKnqomD_78uV1bvx_fnLtE9iBQzr2MRLvHqUugPXsxhwkM-n13jURWC9c9Na_6c6Aq0mlnYt0IAJjO_Uo34GO5aOpN7vq7KklNB85wr-5JenwvUxepMcR6IfW94LjmdIpVJSisgsy8X1UFr4/s1200/hash-rate.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="100" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj14LmRDj5Qv_KdDA7OamCTjicibUyIkDrn8j5PcuhsVd6QEKnqomD_78uV1bvx_fnLtE9iBQzr2MRLvHqUugPXsxhwkM-n13jURWC9c9Na_6c6Aq0mlnYt0IAJjO_Uo34GO5aOpN7vq7KklNB85wr-5JenwvUxepMcR6IfW94LjmdIpVJSisgsy8X1UFr4/w200-h100/hash-rate.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://www.blockchain.com/explorer/charts/hash-rate">Source</a></td></tr></tbody></table>
Bitcoin is not nearly as decentralized as advocates claim. It appears a third of the industry is in Texas and thus subject to both US regulators and concerns about <a href="https://www.nytimes.com/2023/12/25/technology/bitrush-bitcoin-cryptocurrency-china.html">Chinese</a> <a href="https://www.nytimes.com/2023/10/13/us/bitcoin-mines-china-united-states.html">ownership</a>. Turner Wright reports that <a href="https://cointelegraph.com/news/bitcoin-hash-rate-drops-freezing-texas"><i>Bitcoin hash rate drops by 34% amid freezing temperatures in Texas</i></a>:
<blockquote>
A sudden freeze in Texas may have contributed to a 34% drop in the Bitcoin hash rate, as some miners were forced to curtail operations amid demand on the state’s energy grid.<br />
<br />
Beginning on Jan. 14, temperatures in many parts of Texas dropped below freezing for one of the first times since a massive ice storm in February 2023. According to data from YCharts, the total Bitcoin network hash rate fell from more than 629 exahashes per second (EH/s) on Jan. 11 to roughly 415 EH/s on Jan. 15 — a 34% drop. The analytics site reported the hash rate increased to more than 454 EH/s on Jan. 16 as temperatures in Austin briefly rose above freezing during the day.
</blockquote>
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiphQCQ1Q-bLryzRusVK3OLU3VGNCs_uzgYqgRgNnTdV_hcntuILWhBoVGCjpDVHx8ies7Zr_HSw1p2OYkyZb6eDA4N9bEY7m8Y52aELEjyoiT6tEe_VG1fJOWtBPYkHe40a-an0DJ2-mQ8oDUlE4VKZ2QlCmjqI8_iFoSFRIaNyxUDKLzj73P3ACVYs1Yn/s969/RigShipments.png" style="clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;"><img border="0" height="126" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiphQCQ1Q-bLryzRusVK3OLU3VGNCs_uzgYqgRgNnTdV_hcntuILWhBoVGCjpDVHx8ies7Zr_HSw1p2OYkyZb6eDA4N9bEY7m8Y52aELEjyoiT6tEe_VG1fJOWtBPYkHe40a-an0DJ2-mQ8oDUlE4VKZ2QlCmjqI8_iFoSFRIaNyxUDKLzj73P3ACVYs1Yn/w200-h126/RigShipments.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://www.bloomberg.com/news/articles/2024-02-08/bitcoin-mining-why-ethiopia-is-attracting-chinese-crypto-miners">Source</a></td></tr></tbody></table>
It isn't just that Texas has maybe a third of the industry, it is also that the US industry is growing much faster than anywhere else. Figures from Luxor Logistics show that in 2023 the US consumed <a href="https://www.bloomberg.com/news/articles/2024-02-08/bitcoin-mining-why-ethiopia-is-attracting-chinese-crypto-miners">nearly two-thirds of all new mining rigs</a>. The figures come from <a href="https://www.bloomberg.com/news/articles/2024-02-08/bitcoin-mining-why-ethiopia-is-attracting-chinese-crypto-miners"><i>Chinese Bitcoin Miners Find a New Crypto Haven in Ethiopia</i></a> by David Pan and Fasika Tadesse:<br />
<blockquote>
Ethiopia has emerged as a rare opportunity for all firms that mine the original cryptocurrency, as climate change and power scarcity fuel a backlash against the $16 billion-a-year industry (at Bitcoin’s current price) elsewhere. But it holds special appeal for Chinese companies, which once dominated Bitcoin mining but have struggled to compete with local rivals in Texas, the current hub.<br />
</blockquote>
The haven isn't without its own <a href="https://www.bloomberg.com/news/articles/2024-02-08/bitcoin-mining-why-ethiopia-is-attracting-chinese-crypto-miners">clouds</a>:<br />
<blockquote>
It is also a risky gamble, for the companies and Ethiopia alike. A succession of developing countries like Kazakhstan and Iran initially embraced Bitcoin mining, only to turn on the sector when its energy use threatened to fuel domestic discontent. China’s reign as the epicenter of Bitcoin mining came to an abrupt end in 2021, when the government banned it. Dozens of companies were forced to leave.<br />
<br />
Ethiopian officials are wary of the controversy that accompanies Bitcoin mining, according to industry executives who spoke on condition of anonymity to avoid jeopardizing government relations. Even after new generation capacity came online, almost half the population live without access to electricity, making mining a delicate topic. At the same time, it represents a potentially lucrative source of foreign-exchange earnings.<br />
...<br />
The reliance on abundant power is also a major vulnerability because it can put miners in competition for electricity with factories and households, exposing them to political backlash.<br />
<br />
When Kazakzstan imposed fresh curbs and taxes on miners, “it basically killed the industry,” said Hashlabs co-founder Alen Makhmetov. Two years after the clampdown, his 10-megawatt facility there is still sitting idle.<br />
<br />
And in an era when rising temperatures wreak havoc around the world, Bitcoin mining is increasingly seen as a contributor to global warming that doesn’t serve any productive purpose — even though miners have claimed they’re increasingly tapping clean energy. A study by United Nations University published in October estimated that two-thirds of the electricity used for Bitcoin mining in 2020 and 2021 was generated using fossil fuels.
</blockquote>
Developing countries aren't the only ones where miners face "domestic discontent". <a href="https://www.nytimes.com/2024/02/03/us/bitcoin-arkansas-noise-pollution.html"><i>Anxiety, Mood Swings and Sleepless Nights: Life Near a Bitcoin Mine</i></a> by Gabriel J.X. Dance reports on an example in Arkansas:<br />
<blockquote>
The Arkansas Data Centers Act, popularly called the Right to Mine law, offers Bitcoin miners legal protections from communities that may not want them operating nearby. Passed just eight days after it was introduced, the law was written in part by the Satoshi Action Fund, a nonprofit advocacy group based in Mississippi whose co-founder worked in the Trump administration rolling back Obama-era climate policies.
</blockquote>
The law <a href="https://www.nytimes.com/2024/02/03/us/bitcoin-arkansas-noise-pollution.html">ins't popular</a>:<br />
<blockquote>
A furious backlash has some lawmakers considering a statewide ban.
</blockquote>
The Satoshi Action Fund <a href="https://www.nytimes.com/2024/02/03/us/bitcoin-arkansas-noise-pollution.html">over-reached</a>:<br />
<blockquote>
Despite efforts to build bipartisan support, the Satoshi fund has succeeded predominantly in red states. But in Arkansas, where the state legislature is dominated by Republicans, it is conservatives who have led calls to repeal the law, including Senator Bryan King, a poultry farmer whose district includes a property purchased by one of the companies tied to the Chinese government. He said it was not fair that the Bitcoin operators received special protections under the law, which shields them from “<a href="https://www.arkleg.state.ar.us/Home/FTPDocument?path=%2FACTS%2F2023R%2FPublic%2FACT851.pdf">discriminatory industry specific regulations and taxes</a>,” including noise ordinances and zoning restrictions.
</blockquote>
At least the Ethiopian mines don't emit much CO2, they run on <a href="https://www.bloomberg.com/news/articles/2024-02-08/bitcoin-mining-why-ethiopia-is-attracting-chinese-crypto-miners">hydropower</a>:<br />
<blockquote>
The opening of the GERD project increased Ethiopia’s installed generation capacity to 5.3 gigawatt, 92% of which comes from hydropower, a renewable energy source.<br />
<br />
Once GERD is fully completed, Ethiopia’s generation capacity will double, according to Ethiopian Electric Power. It charges Bitcoin miners a fixed rate of 3.14 US cents per kilowatt hour for electricity drawn from substations, Marketing and Business Development Director Hiwot Eshetu said in an interview.<br />
<br />
While that’s similar to the average in Texas, rates in the Lone Star State can swing wildly, Luxor’s Vera said, making profits there less predictable. In Ethiopia, the price will fall once miners connect directly to power plants, according to Hiwot.
</blockquote>
But if the utility can make money selling power to the mines right by the dam they have little incentive to build out the grid than could get the power to the unserved half of the population.<br />
<br />
As regards the longer-term issue of access to state-of-the-art chips, it is important to note that the best mining chips are sold by Bitmain, a Chinese company, but manufactured at TSMC in Taiwan. There are two major risks here. The first is that the US appears determined to prevent China <a href="https://www.cnn.com/2023/10/18/tech/us-china-chip-export-curbs-intl-hnk/index.html">importing leading-edge chips</a> and the <a href="https://www.theguardian.com/technology/2024/jan/02/asml-halts-hi-tech-chip-making-exports-to-china-reportedly-after-us-request">equipment to make them</a>. China remains at least a <a href="https://arstechnica.com/gadgets/2024/02/china-close-to-shipping-5nm-chips-despite-western-curbs/">generation and a half</a> behind TSMC and Samsung, and reportedly has poor yields on its leading-edge process. These restrictions could well prevent Chinese mining companies acquiring leading-edge rigs, and might cause TSMC problems in fab-ing Bitmain products.<br />
<br />
Second, there is the looming threat of a Chinese blockade or even invasion of Taiwan. Of course, difficulties for Bitcoin miners are hardly the major impact if these threats are made good. One might think that, even if supplies of new mining chips were cut off, existing rigs would continue working. In the short term they would, but there is a long history of <a href="https://blog.dshr.org/2022/05/generally-accepted-accounting-principles.html">rigs being obsolete</a> after <a href="https://doi.org/10.1016/j.resconrec.2021.105901">about 18 months</a>. So they aren't designed or operated for longevity.<br />
David. (noreply@blogger.com)
https://blog.dshr.org/
HangingTogether: Advancing IDEAs: Inclusion, Diversity, Equity, Accessibility, 20 February 2024
https://hangingtogether.org/?p=13992
2024-02-20T16:45:19+00:00
<p class="has-small-font-size"><em>The following post is one in a regular <a href="https://hangingtogether.org/tag/IDEA/" rel="noreferrer noopener" target="_blank">series</a> on issues of Inclusion, Diversity, Equity, and Accessibility, compiled by a team of OCLC contributors.</em></p>
<div class="wp-block-image">
<figure class="alignleft size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2024/02/romain-vignes-ywqa9IZB-dU-unsplash-scaled.jpg"><img alt="Close of up on a dictionary page -- all text is out of focus except for the first few lines of the definition for the word "focus"" class="wp-image-13991" height="680" src="https://hangingtogether.org/wp-content/uploads/2024/02/romain-vignes-ywqa9IZB-dU-unsplash-1024x680.jpg" style="width: 508px; height: auto;" width="1024" /></a><sup>Photo by <a href="https://unsplash.com/@rvignes?utm_content=creditCopyText&utm_medium=referral&utm_source=unsplash">Romain Vignes</a> on <a href="https://unsplash.com/photos/focus-dictionary-index-page-ywqa9IZB-dU?utm_content=creditCopyText&utm_medium=referral&utm_source=unsplash">Unsplash</a> </sup></figure></div>
<h2 class="wp-block-heading"><a href="https://search.worldcat.org/en/title/1350082275" rel="noreferrer noopener" target="_blank">Country of the blind: A memoir at the end of sight</a> </h2>
<p>Andrew Leland’s memoir documents his journey dealing with a genetic disorder that is slowly diminishing his sight. As a voracious reader, writer, and editor, Leland discusses the challenges he has faced in adapting to assistive technologies. His journey isn’t just about learning new technologies but about dealing with intersectional identities as a sighted person learning to integrate into the culture of people with low vision. If you want a quick hit, check out <a href="https://99percentinvisible.org/episode/the-country-of-the-blind/" rel="noreferrer noopener" target="_blank">Roman Mars’ 99% Invisible podcast interview</a> with Leland. </p>
<p style="padding-left: 40px;">Listening to Leland’s interview, I was struck when he asked many of the questions I’ve asked myself. As my vision diminishes, should I learn assistive technologies while I’m still sighted? Despite a life of wearing corrective glasses, I have not necessarily identified myself as a member of the low-vision community – although it animates my interest in accessibility. Leland’s discussion of his own journey here was particularly poignant to me. <em>Contributed by </em><a href="https://www.oclc.org/research/people/urban-richard.html" rel="noreferrer noopener" target="_blank"><em>Richard J. Urban.</em></a> </p>
<h2 class="wp-block-heading">Intellectual Freedom Round Table virtual book club </h2>
<p>The <a href="https://www.ala.org/rt/ifrt/ifrtreads" rel="noreferrer noopener" target="_blank">IFRT Reads</a> discussion group of ALA’s Intellectual Freedom Round Table will host the second installment of its 2024 series in the form of a free hour-long webinar on 27 February 4:00 p.m. Eastern Time. During the first session held on January 24, Chapter 2 (“Understanding the Library Bill of Rights and its Significance to Diversity in Collection Development”) of <em>Decentering Whiteness in Libraries: A Framework for Inclusive Collection Management Practices</em>, by Dr. Andrea Jamison, assistant professor of school librarianship at Illinois State University (OCLC Symbol: IAI), was discussed. Dr. Jamison will be present and answering questions for the February 27 installment, <a href="https://ala-events.zoom.us/webinar/register/WN_L3WicGHuS6uYA4UZNq5_Qw#/registration" rel="noreferrer noopener" target="_blank">registration</a> for which is now open. </p>
<p style="padding-left: 40px;">As chair of the working group responsible for the 2019 “<a href="https://www.ala.org/advocacy/intfreedom/librarybill/interpretations/diversecollections" rel="noreferrer noopener" target="_blank">Diverse Collections: An Interpretation of the Library Bill of Rights</a>,” Dr. Jamison stands in a unique position to talk authoritatively about building diverse collections according to the core principles of intellectual freedom. Currently, Dr. Jamison serves as Chair of ALA’s <a href="https://www.ala.org/rt/emiert" rel="noreferrer noopener" target="_blank">Ethnic and Multicultural Information Exchange Round Table</a> (EMIERT), which is specifically charged with promoting services “for all ethnolinguistic and multicultural communities in general.” <em>Contributed by Jay Weitz.</em> </p>
<h2 class="wp-block-heading">Academic libraries leading the way in access and diversity </h2>
<p><em>Insight Into Diversity</em> magazine, the largest and oldest diversity and inclusion publication in higher education, has awarded 56 academic libraries the inaugural <a href="https://www.insightintodiversity.com/lead-award/" rel="noreferrer noopener" target="_blank">2024 Library Excellence in Access and Diversity (LEAD) Award</a> for their outstanding programs and initiatives promoting diversity, equity, and inclusion (DEI). The LEAD Award highlights initiatives in areas such as research, technology, accessibility, exhibitions, and community outreach. “As higher education institutions provide more than just legally required accessibility and disability services, they could find guidance from their own academic libraries, who are often at the forefront of this field. From digital resources and sensory spaces to personalized assistance, many academic libraries prioritize creating an environment where all members of the academic community can thrive, ensuring equal access to information.” Out of nearly 150 applicants, the 56 winners will be featured in the March 2024 issue of <em>Insight Into Diversity, </em>with the outstanding work of <a href="https://www.insightintodiversity.com/academic-libraries-leading-the-way-in-accessibility/" rel="noreferrer noopener" target="_blank">ten libraries featured in a preview</a>. </p>
<p style="padding-left: 40px;">This welcome recognition of libraries leading the way highlights the range of initiatives and programs implemented across libraries. From prioritizing diverse hiring practices to an embedded <em>E</em><em>quity and Engagement Librarian, </em>and from fellowship programs for underrepresented groups, to providing sensory spaces and adaptive computing labs, their successes provide the field with models and inspiration for libraries to prioritize this work locally. <em>Contributed by </em><a href="https://www.webjunction.org/about-us/our-team/peterson-jennifer.html" rel="noreferrer noopener" target="_blank"><em>Jennifer Peterson</em></a><em>.</em> </p>
<h2 class="wp-block-heading">University of North Texas libraries advised to suspend Pride Week events </h2>
<p>On 15 February 2024, the KERA News website <a href="https://www.keranews.org/news/2024-02-15/pride-gets-canceled-unt-libraries-cant-plan-lgbtq-events-and-comply-with-dei-ban" rel="noreferrer noopener" target="_blank">reported</a> that the University of North Texas legal counsel advised UNT Libraries (OCLC Symbol: INT) to suspend planned events for Pride Week. In an email sent to library employees on 9 February, university administration stated that “using staff and faculty time on the activities we were planning around Pride Week would be in violation of SB17,” a bill passed by the Texas State Legislature and signed by Governor Greg Abbott on 14 June 2023 that prohibits publicly funded colleges and universities from conducting “trainings, programs, or activities that advocate for or give preferential treatment on the basis of race, sex, color, ethnicity, gender identity, or sexual orientation.” Melisa Brown, senior director of UNT Relations, stated that the “recognition of commemorative months [such as Black History Month, Pride Month, International Women’s Month, Asian American and Pacific Islander Heritage Month, Disability Pride Month and the like] is something the university has celebrated for years, and UNT plans to continue this. What is changing in the university’s recognitions is that any event the university funds must focus on the history of the culture being celebrated in order to be compliant with the law.” </p>
<p style="padding-left: 40px;">Recognizing diverse groups through various library events is a common method of promoting inclusion on college campuses. Although the article does not describe the events that had to be suspended, it is unfortunate that Texas librarians find themselves under threat of losing funding for developing programming that celebrates diversity<em>. Contributed by </em><a href="https://www.oclc.org/research/people/levy-morris.html" rel="noreferrer noopener" target="_blank"><em>Morris Levy</em></a></p><p>The post <a href="https://hangingtogether.org/advancing-ideas-inclusion-diversity-equity-accessibility-20-february-2024/">Advancing IDEAs: Inclusion, Diversity, Equity, Accessibility, 20 February 2024</a> appeared first on <a href="https://hangingtogether.org">Hanging Together</a>.</p>
Merrilee Proffitt
https://hangingtogether.org/
Lucidworks: What Is B2B Site Search?
https://lucidworks.com/?p=28434
2024-02-20T14:03:47+00:00
<p>The efficiency and effectiveness of a website’s search function are pivotal in today’s competitive world. This is particularly true for...</p>
<p>The post <a href="https://lucidworks.com/post/what-is-b2b-site-search/">What Is B2B Site Search?</a> appeared first on <a href="https://lucidworks.com">Lucidworks</a>.</p>
Lucidworks
https://lucidworks.com/
Digital Library Federation: Forum Feedback: Insights from the DLF Community on Event Sessions
https://www.diglib.org/?p=30661
2024-02-20T13:00:10+00:00
<div class="wp-block-image"><img alt="Forum Feedback logo." class="wpa-warning wpa-suspicious-alt wp-image-27192 aligncenter" height="246" src="https://www.diglib.org/wp-content/uploads/sites/3/2023/11/smForum-Feedback_landscape-transparent-1.png" width="730" /></div>
<p> </p>
<p><i><span style="font-weight: 400;">Welcome to our </span></i><span style="font-weight: 400;">Forum Feedback</span><i><span style="font-weight: 400;"> series, a space dedicated to gathering insights from our vibrant community. Here we delve into the ever-changing conference landscape, exploring themes such as health, safety, accessibility, affordability, and sustainability. </span></i><a href="https://www.diglib.org/category/forum-feedback/"><i><span style="font-weight: 400;">Follow along</span></i></a><i><span style="font-weight: 400;"> as we share data, insights, and thought-provoking discussions aimed at shaping the future of gatherings with inclusivity at the forefront. </span></i><a href="https://docs.google.com/forms/d/e/1FAIpQLScRSr1hSuUoaJTEGRwx-NeMRaEVlfu31F3d_gseQdCkbQknLQ/viewform"><i><span style="font-weight: 400;">We encourage you to actively participate by sharing your own valuable feedback</span></i></a><i><span style="font-weight: 400;">. Together, let’s shape the landscape of conferences for the better. </span></i></p>
<p><span style="font-weight: 400;">Team DLF has been looking forward to 2024 for a couple of years because we knew it would be the best time to start our journey of evaluating how we gather for the DLF Forum. In preparation, we asked the registrants of the </span><a href="https://forum2023.diglib.org/"><span style="font-weight: 400;">2023 CLIR Events (DLF Forum, Learn@DLF, and NDSA’s Digital Preservation)</span></a><span style="font-weight: 400;"> the question, “When attending a conference, what is most important to you?” While we make it a regular part of our workflow to send surveys to attendees after an event, we wanted to take advantage of the opportunity to ask every person who would be joining us in St. Louis this question. It’s important to note the language of this question and those that followed (reviewed below). Our question asks what folks are looking for from a conference, not for feedback on past DLF Forum events. This was purposeful as we wanted the opportunity to think beyond what has already been done. This is an opportunity to talk about the larger context of the academic conference.</span></p>
<p><span style="font-weight: 400;">We received over 400 responses and categorized them into four major categories: networking, event venue, event sessions, and opportunities to present. </span></p>
<figure class="wp-caption aligncenter" id="attachment_30662" style="width: 834px;"><img alt="A word cloud showing 10 words in different colors, the biggest being "netowkring," "session" and "work."" class="size-full wp-image-30662" height="372" src="https://www.diglib.org/wp-content/uploads/sites/3/2024/02/wordcloud.png" width="834" />Word cloud showing major themes of responses to the question, “When attending a conference, what is most important to you?”</figure>
<p><span style="font-weight: 400;">After collecting this data from registrants, we turned what we learned into a participatory closing plenary session in St. Louis. In this session, we shared the four major categories we saw in the registrant responses and asked participants to quietly reflect on prompts related to each category. For this update, we’re sharing and reflecting on what folks had to say about the event sessions category. </span></p>
<p><span style="font-weight: 400;">The questions developed for the event sessions category included: </span></p>
<ol>
<li style="font-weight: 400;"><span style="font-weight: 400;">What determines a quality conference session or workshop? </span></li>
<li style="font-weight: 400;"><span style="font-weight: 400;">What determines a quality featured speaker (keynote) session? </span></li>
<li style="font-weight: 400;"><span style="font-weight: 400;">Are there any specific session types you particularly like? </span></li>
</ol>
<figure class="wp-caption alignnone" id="attachment_30673" style="width: 2560px;"><img alt="Tight view of someone holding a half sheet of paper with questions." class="wp-image-30673 size-full" height="1707" src="https://www.diglib.org/wp-content/uploads/sites/3/2024/02/CLIRW-38-scaled.jpg" width="2560" />We provided the prompts on the digital screens as well as half sheets of printed paper on each table. Photo by Tyler Small.</figure>
<p><span style="font-weight: 400;">We provided these questions as a way to get started, but folks were also empowered to freestyle. The questions were developed out of the desire for folks to elaborate on their responses to the 2023 CLIR Events registration question. Many people responded to the question “When attending a conference, what is most important to you?” by saying “quality conference sessions.” “Quality” can mean different things to different people, so we wanted to give folks a chance to reflect on what this means. </span></p>
<p><span style="font-weight: 400;">After quiet reflection, another pivotal aspect of the plenary unfolded: the sharing and exchange among community members. Participants were seated at round tables, equipped with large sticky pads, smaller sticky pads, pens, and markers, facilitating discussions and the recording of their responses and reflections. Providing a platform for our individual experiences to be heard is invaluable. Engaging with fellow community members serves as a reminder of the diverse perspectives present at the conference, enriching our collective understanding. Following the small group discussions, we opened the floor for anyone wishing to share insights or resonant thoughts with the larger group. This structured approach, akin to the </span><a href="https://www.theteachertoolkit.com/index.php/tool/think-pair-share"><span style="font-weight: 400;">think, pair, share method</span></a><span style="font-weight: 400;">, evoked nostalgic memories of my days in library instruction and provided an enjoyable conclusion to the 2023 DLF Forum. </span></p>
<p><span style="font-weight: 400;">In an effort to include folks who may not have been able to attend the in-person plenary session, we offered a virtual session with the same guidelines and utilized Padlet for folks to record group discussions. </span></p>
<figure class="wp-caption alignnone" id="attachment_30663" style="width: 2560px;"><img alt="Birds eye view of a table with sticky notes, showing people reaching for things." class="wp-image-30663 size-full" height="1707" src="https://www.diglib.org/wp-content/uploads/sites/3/2024/02/CLIRW-41-scaled.jpg" width="2560" />After independent reflection, folks participated in small group discussions of their responses. Photo by Tyler Small.</figure>
<h2><b><br />
Insights and Desires for Conference Sessions</b></h2>
<p><span style="font-weight: 400;">The word “sessions” was used 369 times in the 2023 CLIR Events registration responses, and other words such as “quality,” “practical,” “diverse,” “engaging,” and “relevant” were used as descriptors of what folks are looking for. </span></p>
<p><span style="font-weight: 400;">At the in-person closing plenary session, we heard responses such as requests for more working sessions, combination sessions (for DLF this typically includes 2-3 15-minute presentations organized around similar topics) and roundtable discussions. Traditionally, working sessions have included time for </span><a href="https://www.diglib.org/groups/"><span style="font-weight: 400;">DLF working groups</span></a><span style="font-weight: 400;"> that already exist to meet or for new ones to kick off around a topic of interest. Other folks want to see accessibility emphasized, including the use of microphones, coaching on how to project one’s voice, and requiring accessible slides. In virtual trainings leading up to our events, in our email communications, and in our opening plenary, Team DLF does make sure to emphasize the importance of microphone use by all presenters and attendees, and </span><a href="https://www.diglib.org/creating-accessible-presentations/"><span style="font-weight: 400;">we offer resources for creating accessible presentations</span></a><span style="font-weight: 400;">, thanks to the work of Debbie Krahmer and DLF’s Committee for Equity and Inclusion. However, this doesn’t mean we can’t expand our emphasis on accessibility for in-person and virtual events and meetings. </span></p>
<p><span style="font-weight: 400;">Folks also want to see a good balance between the technical and the practical aspects of digital library work. We heard feedback from participants from all Forum Feedback modalities that they are seeking practical and applicable tips and workflows to incorporate at their home institutions. Folks also want to see diverse, friendly and engaging presenters on the conference program. </span></p>
<figure class="wp-caption alignnone" id="attachment_30664" style="width: 2560px;"><img alt="People seated at a roundtable with papers and markers. One person reaching for a marker." class="wp-image-30664 size-full" height="1707" src="https://www.diglib.org/wp-content/uploads/sites/3/2024/02/CLIRW-51-scaled.jpg" width="2560" />After independent reflection, folks participated in small group discussions of their responses. Photo by Tyler Small.</figure>
<h2><b><br />
Navigating Diverse Perspectives</b></h2>
<p><span style="font-weight: 400;">As expected, individuals offered varying perspectives throughout all modalities of Forum Feedback. Some prefer detailed, granular sessions, while others seek broader discussions. Preferences also diverge regarding session lengths, with some advocating for longer sessions and others for shorter ones. Navigating through this qualitative data, albeit occasionally contradictory, can be enlightening and worthwhile. The discernible patterns we’ve identified are invaluable to us as conference organizers, aiding in the deliberation of an optimal conference format for both in-person and virtual events. It’s essential to recognize that an in-person conference doesn’t automatically translate to a virtual one simply because it’s livestreamed online. </span></p>
<p><span style="font-weight: 400;">With these considerations in mind, we invite continued engagement and feedback from our community as we collectively shape the future of gatherings. Thank you so much to those who have made invaluable contributions to Forum Feedback! </span></p>
<p>The post <a href="https://www.diglib.org/forum-feedback-event-sessions/" rel="nofollow">Forum Feedback: Insights from the DLF Community on Event Sessions</a> appeared first on <a href="https://www.diglib.org" rel="nofollow">DLF</a>.</p>
Jennifer Ferretti
https://www.diglib.org
Lucidworks: 3 Issues with Current Ecommerce Product Discovery
https://lucidworks.com/?p=28046
2024-02-15T19:50:13+00:00
<p>Enhancing the customer experience is the heart of product discovery. Here's the strategic approach brands should take and the technological prowess that can drive success.</p>
<p>The post <a href="https://lucidworks.com/post/3-issues-with-current-ecommerce-product-discovery/">3 Issues with Current Ecommerce Product Discovery</a> appeared first on <a href="https://lucidworks.com">Lucidworks</a>.</p>
Lila Schoenfield
https://lucidworks.com/