Time |
S |
Nick |
Message |
16:46 |
|
pdurbin |
cool, the subnetwork mockups are on a blog post now: http://thedata.org/blog/upcoming-changes-version-35-dataverse-network-subnetworks |
17:15 |
|
pdurbin |
interesting... GitHub: A Tool for Social Data Set Development and Verification in the Cloud by Christopher Gandrud :: SSRN - http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2199367 |
17:15 |
|
pdurbin |
"In this brief article I show that GitHub offers a comprehensive data storage service for social scientists creating and using original data sets. It has unique tools for social data set development and accuracy verification. Furthermore, GitHub fits directly into an active research workflow, particularly one that also includes R." |
17:17 |
|
pdurbin |
heh. at least dvn is mentioned under "status quo" for data storage :) |
17:18 |
|
pdurbin |
should be an interesting read |
17:19 |
|
pdurbin |
stupid PDFs |
17:19 |
|
* pdurbin |
prints it out |
18:57 |
|
* pdurbin |
follows https://github.com/christophergandrud |
18:59 |
|
pdurbin |
right. so here's the gh-pages version of his data: http://christophergandrud.github.io/Disproportionality_Data/ |
18:59 |
|
pdurbin |
and the data itself is here: https://github.com/christophergandrud/Disproportionality_Data |
19:01 |
|
pdurbin |
I really don't have time to say much about the paper |
19:01 |
|
pdurbin |
it's good. I think the whole dvn team should read it |
19:01 |
|
pdurbin |
he's right that dvn isn't very commandline friendly |
19:03 |
|
pdurbin |
we do have a read-only API, and someone just asked at http://irclog.iq.harvard.edu/dvn/2013-05-02 about a write API. I replied and let him know we have plans for this: http://thedata.org/book/upcoming-releases |
19:06 |
|
pdurbin |
I love github... "TL;DR: GitHub is the largest public repository of the everyday experience of work. Ever. If you’re a scholar or journalist interested in collaboration, this is perhaps the most important archive you will find regarding what actually happens as people work together." -- http://7fff.com/2012/07/14/the-most-important-social-network-github/ |
19:07 |
|
pdurbin |
and I suppose it's working well enough for the author, which is fine |
19:07 |
|
pdurbin |
I certainly use the heck out of github: https://github.com/pdurbin |
19:08 |
|
pdurbin |
but I still think dvn is compelling for a variety of reasons |
19:08 |
|
pdurbin |
it's open source, unlike github |
19:08 |
|
pdurbin |
it can visualize your data |
19:08 |
|
pdurbin |
you can subset your data |
19:09 |
|
pdurbin |
harvest data over standard protocols (i.e. OAI-PMH) |
19:10 |
|
pdurbin |
etc, etc. I'm not really an expert on dvn |
19:10 |
|
pdurbin |
I try to give my take on dvn here: http://people.iq.harvard.edu/~pdurbin |
19:10 |
|
pdurbin |
why it's useful, etc |
19:11 |
|
pdurbin |
I do wonder how we could make dvn more commandline friendly |
19:11 |
|
pdurbin |
a write API would help, of course |
19:11 |
|
pdurbin |
maybe some language bindings for the API? a dvn module on CRAN? I dunno |
19:13 |
|
pdurbin |
oh, I love how he wrote his paper in rstudio :) |
19:15 |
|
pdurbin |
hmm, I don't see his paper at https://github.com/christophergandrud?tab=repositories ... he has a couple typos... Handel instead of Handle, it's instead of its ... guess I won't be sending him a pull request :) |
19:16 |
|
pdurbin |
oh, he mentions how you can follow a repo on github but he he didn't mention RSS. i.e. https://github.com/christophergandrud/Disproportionality_Data/commits/master.atom for his data set |
19:19 |
|
pdurbin |
ok, tweeted at the author: https://twitter.com/philipdurbin/status/331487282815201280 |
19:19 |
|
pdurbin |
now! back to faceted search... https://redmine.hmdc.harvard.edu/issues/2656 :) |
19:37 |
|
* pdurbin |
tries to convince my R buddy to write a wrapper for DVN: http://irclog.greptilian.com/sourcefu/2013-05-06#i_5930 :) |