06. Data
December 14, 2020
Agenda
Meeting notes
Note-taker: Zach
Business
- Workshop ideas & dates
- Zach - QGIS workshop, what GIS?
- for first workshop show projects across disciplines. Does not need to be specific to Arch, Classics, charset
- Kristen - html Workshop
- Security- community.reclaimhosting.com/installing-free-ssl-certificates/325
http vs https: the “s” means the site is certified as protecting data through encryption.
- in domain of ones own go to “Let’s Encrypt SSL”
- Security- community.reclaimhosting.com/installing-free-ssl-certificates/325
http vs https: the “s” means the site is certified as protecting data through encryption.
- Thu - workshop on introduction the digitization of papyri in papyrology
- OCR optimizing? photo processing steps to optimize? See Tesseract OCR.
- Talked about ways to make this more of a DS session.
- Workshops on Zoom. Will be recorded and can be put on Panopto
- Zach - QGIS workshop, what GIS?
- Roles for Spring work
- Questions and updates
- Examples of professional websites: added to Resources for Session 04
- examples on the DSGF syllabus page.
- question about why you would share a github profile: sharing code, collaborative writing, data scraping, etc.
- unsplash.com - photos for everyone.
- Kristen question: creative commons photos. Some CC request a citation. How do you cite?
- AM response: Add caption or photo credit in a footer. Unsplash has code for embedding a badge or text to a photo.
-
Metadata and images: Omeka and Scalar show metadata. Scalar especially using javascript and to a lesser extent css. Wax.
- Accessibility
- Kristen, site was pinged for text size and photo alt text . Language needs to be declared
- web accessibility evaluation tool
- Skip tp main content link should be present.
- Security - instructions on issuing ssl certificate for your domain
- Examples of professional websites: added to Resources for Session 04
Data
- Data Cleaning with OpenRefine tutorial
- OpenRefine
- Data cleaning with open refine. See link on DSGF syllabus.
- Download and play around with it.
- Data cleaning with open refine. See link on DSGF syllabus.
- Tidy data
- specific approach common to people using R. How to turn a dataset into something that is compatible with R’s tools.
- Data provides. Structure and standardization.
- What is data? Think about how data is defined and managed. Standardization for machine learning vs. digital exhibition vs. spatial analysis.
- Data is specifically rhetorical. It is not data unless you are looking at it “as data”. Likewise cleaning data requires interpretation and a priori structural ideas.
- Tabular vs. graph data (hierarchical, rhyzomatic/networked, or polarized)
Resources
For next time
Friday, January 22
- Choose a date and time for your workshop and write a short description
- Propose a lightning-round topic for the Grad Showcase in February
- Email me and let me know:
- How many hours you’d like to work next semester
- Your preferences between: running a community of learning, teaching additional workshops, helping develop and run events, supervising undergraduates
- Finish OpenRefine tutorial
«< Previous | Next »> |