IQSS logo

IRC log for #dataverse, 2018-02-14

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

All times shown according to UTC.

Time S Nick Message
00:18 pdurbin yeah, it does look odd
08:11 jri joined #dataverse
11:20 jri joined #dataverse
12:32 danmcp joined #dataverse
13:58 danmcp joined #dataverse
16:05 danmcp joined #dataverse
16:05 danmcp pdurbin: ping
16:05 danmcp pdurbin: sorry was disconnected, not sure if you responded yet
16:26 pameyer joined #dataverse
16:29 pdurbin danmcp: mornin. What's up
16:30 danmcp pdurbin: question for you related to the group building an open service broker to connect to dataverse
16:30 danmcp pdurbin: as someone consuming data through the dataverse api, would I want to connect to the whole dataverse deployment, or a single dataverse.
16:31 danmcp pdurbin: this corresponds to how many choices a user will get in the OpenShift UI of what to connect to
16:32 danmcp pdurbin: my thought was that it should show an option for each dataverse
16:35 pdurbin danmcp: sigh. The term "dataverse" is overloaded. I stubbed out a glossary ( https://docs.google.com/spreadsheets/d/1xje8FADNjCKNBNExXjKQXrmRgc-WQxEv7jKJrDOuuLA/edit?usp=sharing ) but I think you get that Dataverse is what we call the software and within an installation there can be a hierarchy of dataverses.
16:36 danmcp pdurbin: yeah, I am asking about the hierarchy
16:36 danmcp pdurbin: does it make sense that a user would want to connect to just one of those for pulling data?
16:37 danmcp pdurbin: or multiple is fine as well.  but is it reasonable for them to connect to each one they want to pull data from?
16:37 danmcp pdurbin: with an explicit choice
16:37 pdurbin Sure. If you're an astronomer, perhaps you're interested in the hierarchy of data under the Center for Astrophysics dataverse: https://dataverse.harvard.edu/dataverse/cfa
16:38 pdurbin If you're a social scientist, perhaps you're interested in the hierarchy of data under this social science dataverse (MRA): https://dataverse.harvard.edu/dataverse/mra
16:39 danmcp pdurbin: cool, I might ask you the question next time we are on the phone together to make sure we are totally on the same page.  But I am going to assume we understand each other for now :)
16:39 pdurbin danmcp: sounds good. Do you need any guidance on which APIs to use?
16:39 danmcp pdurbin: we will need the list dataverses api
16:39 danmcp pdurbin: a pointer would be great
16:41 pdurbin I think you should start with the "contents" API. It looks like this in the API Guide: GET http://$SERVER/api/dataverses/$id/contents
16:41 pdurbin http://guides.dataverse.org/en/4.8.5/api/native-api.html#dataverses
16:42 danmcp pdurbin: thx
16:42 pdurbin sure. like this: https://dataverse.harvard.edu/api/dataverses/cfa/contents
16:45 danmcp pdurbin: thx!
16:46 pdurbin It's weird to me that these datasets about Roman warships are under the Center for Astrophysics dataverse but what do I know: http://dx.doi.org/10.7910/DVN/23994 :)
16:47 danmcp pdurbin: actually, where is the top level list of dataverses?
16:50 pdurbin danmcp: it's a little weird but it depends on the "alias" of the root dataverse. By default, this the alias is "root" but even on the demo site the alias of the root dataverse is "harvard". You can tell what the alias is by clicking on "Advanced Search" on the home page and inspecting the URL, such as https://demo.dataverse.org/dataverse/harvard/search
16:51 pdurbin Here's one with the out-of-the box alias of "root" for the root dataverse: http://phoenix.dataverse.org/api/dataverses/root/contents
16:51 danmcp pdurbin: cool thx
16:53 pdurbin danmcp: please be advised, however, that there is a known performance problem with this "contents" API endpoint ( https://github.com/IQSS/dataverse/issues/2122 ) so I guess I'd suggest the Search API if it's too slow.
16:55 danmcp pdurbin: cool thx
16:56 pdurbin danmcp: also, if the use case is pulling data, like you said, you may want to use a standard that Dataverse supports called OAI-PMH. We also call it "harvesting". Harvesting "sets" can be defined on the server for various subject of interest and the client can harvest just that data. This is metadata only that's "harvested", not the files themselves.
16:57 pdurbin But once you have the database IDs for the files, you can download them using the "access" API.
16:57 danmcp pdurbin: cool, that will be the next step
16:58 pdurbin Ok, so you do want to download the files themselves. That's fine.
16:59 pdurbin Feedback on these APIs is great because I don't really use them myself except for testing.
16:59 danmcp pdurbin: cool, we'll let you know how it goes
17:00 pdurbin thanks. I'm still a little confused about what you're building but there's no rush
17:00 danmcp pdurbin: yeah let's chat about it next meeting
17:00 pameyer I'm happy to see another user interested in APIs
17:00 pdurbin sounds good
17:01 pdurbin pameyer: please keep the feedback coming :)
17:01 pdurbin I see OpenShift at https://www.openservicebrokerapi.org so maybe that's what he was talking about when he said "open service broker".
17:10 danmcp joined #dataverse
17:42 pameyer joined #dataverse
18:31 pdurbin ah, more on what Roman warships have to do with astrophyics: http://space.mit.edu/home/guenther/roman.html :)
18:32 pameyer joined #dataverse
18:39 pameyer joined #dataverse
18:45 pameyer without reading either of those links; back when maritime navigation involved astronomy, I could image astrophysics being something the navy would be interested in (and possibly interested in funding research / keeping it secret)
18:51 pdurbin yeah
19:02 pameyer joined #dataverse
21:20 pameyer joined #dataverse
22:14 pameyer pdurbin: question for you about sysadmin types and dependencies

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.