Talk:Data curation
This is the talk page for discussing improvements to the Data curation article. This is not a forum for general discussion of the article's subject. |
Article policies
|
Find sources: Google (books · news · scholar · free images · WP refs) · FENS · JSTOR · TWL |
This article is rated Start-class on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | ||||||||||||||
|
Untitled
[edit]This topic (Data Curation) should probably be merged with the topic on "Digital Curation". They seem to be the same thing.
- I disagree. There's no curational aspect to data wrangling, which is all about using data that happens to be in an inconvenient format. Data curation is about preserving for the future. Curation may make future munging easier (or less necessary) but that not it's primary goal, per se. People who do data curation almost of necessity must wrangle with data formats, but people who wrangle with data need not be concerned with its curation. There are a lot of web apps out that that screen scrape data or repurpose it in unexpected ways but I hardly think their authors are too concerned with long-term preservation. It's the difference between maintenance and application. Phette23 (talk) 06:08, 8 February 2013 (UTC)
- I disagree as well. Data wrangling/munging is but one possible activity of data curation. Rcamilled (talk) 17:01, 11 February 2013 (UTC)
- Something is amiss here. I also disagree that data wrangling and data curation should be merged into one topic. So I agree with the above two posters, and disagree with the statement that appears on the main article page. However, just above, it says that this article on data curation should be merged with an article in library science titled "Digital Curation". That does make sense to me. I think this is best kept in the area of library science. Can the note on the main page be altered to match the note at the top of this talk page? Or do we have to wait longer for others to respond to the suggestion regarding data wrangling? MarkGoldfain (talk) 15:48, 13 March 2013 (UTC)
Wiki Education Foundation-supported course assignment
[edit]This article is or was the subject of a Wiki Education Foundation-supported course assignment. Further details are available on the course page. Student editor(s): Jenny 1990, Lwarres.
Above undated message substituted from Template:Dashboard.wikiedu.org assignment by PrimeBOT (talk) 19:49, 17 January 2022 (UTC)
External links modified
[edit]Hello fellow Wikipedians,
I have just modified one external link on Data curation. Please take a moment to review my edit. If you have any questions, or need the bot to ignore the links, or the page altogether, please visit this simple FaQ for additional information. I made the following changes:
- Added archive https://web.archive.org/web/20120123161104/http://3roundstones.com/led_book/led-curry-et-al.html to http://3roundstones.com/led_book/led-curry-et-al.html
When you have finished reviewing my changes, you may follow the instructions on the template below to fix any issues with the URLs.
This message was posted before February 2018. After February 2018, "External links modified" talk page sections are no longer generated or monitored by InternetArchiveBot. No special action is required regarding these talk page notices, other than regular verification using the archive tool instructions below. Editors have permission to delete these "External links modified" talk page sections if they want to de-clutter talk pages, but see the RfC before doing mass systematic removals. This message is updated dynamically through the template {{source check}}
(last update: 5 June 2024).
- If you have discovered URLs which were erroneously considered dead by the bot, you can report them with this tool.
- If you found an error with any archives or the URLs themselves, you can fix them with this tool.
Cheers.—InternetArchiveBot (Report bug) 06:15, 5 September 2017 (UTC)
Class Edit
[edit]For a class I have been assigned this Data Curation article, which has been rated as Start-Class on the quality scale (a fair rating in my opinion), and Mid-importance on the importance scale (I think it could gain importance if it were of a higher quality). There are several potentially confusing aspects to this article.
For example, it does not include links to the Digital Preservation or Digital Curation pages anywhere (this is a content gap), although the reverse is true. While these 3 concepts are not interchangeable, they are related, so I do think they should all link to each other.
This Data Curation page is less library-specific, and its opening definition much broader, than the other two pages. While this Data Curation page is listed under the Information Science category, it is also listed as within the scope of the WikiProject Computational Biology. Unlike the Digital Curation page, this Data Curation page is mainly about data in non-library contexts, but does go on to cite a definition from the University of Illinois’ Graduate School of Library and Information Science.
The sentence “The exact curation process undertaken within any organization depends on . . . how much noise the data contains . . .” is not clear about what it means by “noise”. Does it mean superfluous data, data that can be discarded? More precision would be helpful here.
There are also some positives about this article, such as the number of links to other Wikipedia articles. The link to the Data page is especially important because, to learn about Data Curation, it is essential to first understand the definition of data itself. However, the broad definition of curation does not link to the Wikipedia page on Curation (another content gap), though the Curation page does link to both the Data Curation and Digital Curation pages.
Minor grammar edit
[edit]I fixed some of the grammar in this Data Curation article, but I’m still reading through all of the related linked pages, continuing to identify content gaps or overlap, and making notes of references to add to cover these gaps. I reworded the opening sentence of the Definitions and Practice section to improve grammar. Lwarres (talk) 20:50, 15 February 2018 (UTC)
History
[edit]I think the history of data curation began much earlier than the 1982 date cited:
Inter-university Consortium for Political and Social Research (ICPSR) 1960s[1]
For example, census data has been available in tabulated punch card form since the early 20th century. It has been electronic since the 1960s.[2]
I may also add something about the crisis in space data, which led to the creation of the OAIS model,[3][4] and
what institutions have shaped data curation's history and development and intellectual foundations?[5]
Lwarres (talk) 21:03, 1 March 2018 (UTC)
References
- ^ ICPSR history page.
- ^ Preserving Digital Information (PDI) report, 1996, pp 2-3.
- ^ "The Hackers who Recovered NASA's Lost Lunar Photos in Wired (April 23, 2014)
- ^ "Lost on Earth" NYTimes, March 20, 1990.
- ^ Borgman, C. (2015). Big data, little data, no data : Scholarship in the networked world. Cambridge, Massachusetts: MIT Press.
Wiki Education assignment: IFS213-Hacking and Open Source Culture
[edit]This article was the subject of a Wiki Education Foundation-supported course assignment, between 30 January 2024 and 10 May 2024. Further details are available on the course page. Student editor(s): SuziSigler (article contribs).
— Assignment last updated by KAN2035117 (talk) 17:48, 7 March 2024 (UTC)