Digital Archivists

From Noisebridge
(Difference between revisions)
Jump to: navigation, search
(HARDWARE)
 
(41 intermediate revisions by 9 users not shown)
Line 1: Line 1:
The Noisebridge Digital Archivists group has one active project:
+
===INTRODUCTION===
  
[[Bookscanner]]
+
The Noisebridge Digital Archivists group is for anyone interested in a range of topics including book scanning, digitizing video/audio, open access, creative commons, and more. We usually meet on '''Wednesdays at 7pm'''. Check the mailing list to see if there's a meeting this week.
  
 +
The Noisebridge book scanner was demonstrated at the 2015 Maker Faire in San Mateo
  
Noisebridge Digital Archivists mailing list: [http://www.noisebridge.net/mailman/listinfo/digitalarchivists digitalarchivists]
+
[[file:ScannerMF.jpg|800px]]
  
Meeting notes:
+
===MAILING LIST===
 +
----
 +
Many weekly updates are made exclusively on our mailing list. Please join to keep in touch and ask questions:
  
[[Digital Archivists 2013-04-21]]
+
Join the mailing list here: [http://www.noisebridge.net/mailman/listinfo/digitalarchivists digitalarchivists]
  
[[Digital Archivists 2013-07-08]]
+
A summary of topics in our mailing list will be posted soon.
  
 +
===HARDWARE===
 +
----
 +
Our first project was to build a book scanner at Noisebridge based on the open-source [http://diybookscanner.myshopify.com/products/diy-book-scanner-kit DIY Book Scanner] design. Check out the schematic, below, to see all the components. 
  
Some links to other digital archive projects:
+
[[File:Reetz_Scanner_Schematic.png|800px]]
  
The [http://dp.la/ The Digital Public Library of America (DPLA)]
+
This model of book scanner is intended to preserve the book being digitized, therefore all page turning is done manually (by hand).
 +
 
 +
[[File:BookScanHand_Quick.gif|Page Turning]]
 +
 
 +
===SOFTWARE===
 +
----
 +
The device captures from both cameras simultaneously by using USB drivers provide by gphoto2 wrapped into a nice python script [https://github.com/danyq/diybookscanner here]. Check out gphoto2 [http://gphoto.org/ here] and the list of supported cameras [http://gphoto.org/proj/libgphoto2/support.php here]. If you have linux you can just type: sudo apt-get install gphoto2
 +
 
 +
 
 +
Currently we are making simple scripts to perform Optical Character Recognition (OCR) and produce searchable full text as PDF documents [https://github.com/andrewdefries/TesseractOCR Github Code Repository]. Check it out and fork our repos.
 +
 
 +
 
 +
[[file:ReetzWorkFlow.png|800px]]
 +
 
 +
---
 +
 
 +
===Related links===
 +
----
 +
diy book scanner
 +
http://diybookscanner.org
 +
http://vimeo.com/29184137
 +
 
 +
linear book scanner project
 +
http://linearbookscanner.org
 +
 
 +
the internet archive
 +
http://archive.org/about/
 +
 
 +
prelinger archives
 +
http://archive.org/details/prelinger
 +
 
 +
new alexandria archive
 +
http://www.newalexandria.org/archive/
 +
 
 +
using archival information in interesting ways
 +
http://maptcha.org/
 +
 
 +
a film about nitrile film decomposition
 +
http://www.youtube.com/watch?v=r-FJyJjH6IE
 +
 
 +
The Digital Public Library of America (DPLA)
 +
http://dp.la/
 +
 
 +
 
 +
For details about the book scanner, see: '''[[Bookscanner]]'''
 +
 
 +
 
 +
===Meeting notes===
 +
----
 +
2015-02-25 - training session for scanner, looked at linear book scanner prototype
 +
 
 +
2015-02-11 - scanner demo, discussion
 +
 
 +
2015-01-25 - discussed Andrew Defries' scanning booth and post-processing station, experimented with Vinux
 +
 
 +
2014-09-24 - reassembled and demoed the scanner to visitors, some linear book scanner updates, discussed funding and directions for the group
 +
 
 +
2014-08-28 - social meeting at Open Drinks
 +
 
 +
2014-07-17 - discussed scanning stand design at Muddy Waters
 +
 
 +
2014-06-19 - short meeting and 5mof
 +
 
 +
2014-06-12 - discussed plans for a portable phone-based scanner, assembled cardboard prototype of prism scanner
 +
 
 +
2014-06-05 - discussed improvements to the scanner and ocr pipeline
 +
 
 +
2014-05-29 - main topics were portable scanners for the blind and OCR comparison algorithms. full notes: [[Digital Archivists 2014-05-29]]
 +
 
 +
2014-05-22 - discussed lessons learned from DIY Scanner 1.0 and a range of new initiatives. full notes: [[Digital Archivists 2014-05-22]]
 +
 
 +
2014-01-09 - scanned books, investigated glare and alignment issues
 +
 
 +
2013-12-19 - bug fixes in the code, reviewed linear scanners from Michigan, improved documentation, discussed larger book scanning network
 +
 
 +
2013-12-12 - replaced LED shields with black cloth, more scanning
 +
 
 +
2013-12-05 - scanned books
 +
 
 +
2013-11-21 - scanned a book, wrote tesseract ocr script, looked at old computers, 5mof presentation
 +
 
 +
2013-11-14 - improved scanning gui, added LED shields
 +
 
 +
2013-11-07 - replaced bunjees, mounted LEDs, tried to run spreads
 +
 
 +
2013-10-31 - reassembled scanner, looked at 'spreads' software, screen readers
 +
 
 +
2013-10-24 - disassembled and painted scanner parts
 +
 
 +
2013-10-17 - investigated glare and LED positioning, plan to paint black, 5mof presentation
 +
 
 +
2013-10-10 - mounted cameras, built trigger mechanism, wrote capture script
 +
 
 +
2013-10-03 - picked up scanner and equipment from internet archive
 +
 
 +
2013-07-27 - internet archive borrows the book scanner
 +
 
 +
2013-07-18 - 5mof presentation on book scanning
 +
 
 +
2013-07-08 - discussed scanner location, security, transportation, reimbursement, promotion. full notes: [[Digital Archivists 2013-07-08]]
 +
 
 +
2013-05-23 - scavenged and cut glass from flatbed scanners, plan to attend internet archive friday lunch
 +
 
 +
2013-04-30 - assembled scanner kit. needs cameras, glass, trigger mechanism, software
 +
 
 +
2013-04-21 - looked at reetz scanner and prism scanner, discussed content and licensing. will order reetz kit. full notes: [[Digital Archivists 2013-04-21]]

Latest revision as of 01:10, 19 May 2015

Contents

[edit] INTRODUCTION

The Noisebridge Digital Archivists group is for anyone interested in a range of topics including book scanning, digitizing video/audio, open access, creative commons, and more. We usually meet on Wednesdays at 7pm. Check the mailing list to see if there's a meeting this week.

The Noisebridge book scanner was demonstrated at the 2015 Maker Faire in San Mateo

ScannerMF.jpg

[edit] MAILING LIST


Many weekly updates are made exclusively on our mailing list. Please join to keep in touch and ask questions:

Join the mailing list here: digitalarchivists

A summary of topics in our mailing list will be posted soon.

[edit] HARDWARE


Our first project was to build a book scanner at Noisebridge based on the open-source DIY Book Scanner design. Check out the schematic, below, to see all the components.

Reetz Scanner Schematic.png

This model of book scanner is intended to preserve the book being digitized, therefore all page turning is done manually (by hand).

Page Turning

[edit] SOFTWARE


The device captures from both cameras simultaneously by using USB drivers provide by gphoto2 wrapped into a nice python script here. Check out gphoto2 here and the list of supported cameras here. If you have linux you can just type: sudo apt-get install gphoto2


Currently we are making simple scripts to perform Optical Character Recognition (OCR) and produce searchable full text as PDF documents Github Code Repository. Check it out and fork our repos.


ReetzWorkFlow.png

---

[edit] Related links


diy book scanner http://diybookscanner.org http://vimeo.com/29184137

linear book scanner project http://linearbookscanner.org

the internet archive http://archive.org/about/

prelinger archives http://archive.org/details/prelinger

new alexandria archive http://www.newalexandria.org/archive/

using archival information in interesting ways http://maptcha.org/

a film about nitrile film decomposition http://www.youtube.com/watch?v=r-FJyJjH6IE

The Digital Public Library of America (DPLA) http://dp.la/


For details about the book scanner, see: Bookscanner


[edit] Meeting notes


2015-02-25 - training session for scanner, looked at linear book scanner prototype

2015-02-11 - scanner demo, discussion

2015-01-25 - discussed Andrew Defries' scanning booth and post-processing station, experimented with Vinux

2014-09-24 - reassembled and demoed the scanner to visitors, some linear book scanner updates, discussed funding and directions for the group

2014-08-28 - social meeting at Open Drinks

2014-07-17 - discussed scanning stand design at Muddy Waters

2014-06-19 - short meeting and 5mof

2014-06-12 - discussed plans for a portable phone-based scanner, assembled cardboard prototype of prism scanner

2014-06-05 - discussed improvements to the scanner and ocr pipeline

2014-05-29 - main topics were portable scanners for the blind and OCR comparison algorithms. full notes: Digital Archivists 2014-05-29

2014-05-22 - discussed lessons learned from DIY Scanner 1.0 and a range of new initiatives. full notes: Digital Archivists 2014-05-22

2014-01-09 - scanned books, investigated glare and alignment issues

2013-12-19 - bug fixes in the code, reviewed linear scanners from Michigan, improved documentation, discussed larger book scanning network

2013-12-12 - replaced LED shields with black cloth, more scanning

2013-12-05 - scanned books

2013-11-21 - scanned a book, wrote tesseract ocr script, looked at old computers, 5mof presentation

2013-11-14 - improved scanning gui, added LED shields

2013-11-07 - replaced bunjees, mounted LEDs, tried to run spreads

2013-10-31 - reassembled scanner, looked at 'spreads' software, screen readers

2013-10-24 - disassembled and painted scanner parts

2013-10-17 - investigated glare and LED positioning, plan to paint black, 5mof presentation

2013-10-10 - mounted cameras, built trigger mechanism, wrote capture script

2013-10-03 - picked up scanner and equipment from internet archive

2013-07-27 - internet archive borrows the book scanner

2013-07-18 - 5mof presentation on book scanning

2013-07-08 - discussed scanner location, security, transportation, reimbursement, promotion. full notes: Digital Archivists 2013-07-08

2013-05-23 - scavenged and cut glass from flatbed scanners, plan to attend internet archive friday lunch

2013-04-30 - assembled scanner kit. needs cameras, glass, trigger mechanism, software

2013-04-21 - looked at reetz scanner and prism scanner, discussed content and licensing. will order reetz kit. full notes: Digital Archivists 2013-04-21

Personal tools