IQSS logo

IRC log for #dataverse, 2018-10-11

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

All times shown according to UTC.

Time S Nick Message
01:40 andrewSC joined #dataverse
01:44 jri joined #dataverse
07:22 jri joined #dataverse
08:22 jri_ joined #dataverse
08:23 jri joined #dataverse
08:59 poikilotherm joined #dataverse
11:36 donsizemore joined #dataverse
11:38 donsizemore @pdurbin i've decided it's better to keep the suggested branch and repo in the script, allow them to be overridden with runtime flags, and store only the datacite stuff in the env file. seems like the easiest workflow for developers
11:38 donsizemore @pdurbin so lemme work on that (assuming the building doesn't wash away today)
11:48 pdurbin donsizemore: ok, that sounds good. I think the most common use case around here is to pass in the branch with -b. Pull requests from outside IQSS are getting more common, though, so continuing to support -r to pass in the repo url will be nice, I think.
11:48 pdurbin Plus, that way we don't have to mess with the docs as much. :)
11:48 pdurbin Though we should explain in the docs the new "env" file.
11:48 pdurbin If you need a hand with that, I'm happy to add it.
11:53 pdurbin donsizemore: did you see that Odum is mentioned in the "Open Science Infrastructure WG 10/24 Call Agenda" doc? Please see https://docs.google.com/document/d/1bOVWBfhOiKGU2dYoHN_Pkpv5zPI6y819UISfkTqYpoQ/edit?usp=sharing
11:54 donsizemore @pdurbin jon was asking if i was on the e-mail chain about that call. he's going to ask craig to add me to the fray
11:55 pdurbin donsizemore: ah, to get the emails, you should click "subscribe" at https://github.com/whole-tale/whole-tale/issues/43 (or leave a comment)
11:56 donsizemore done
11:57 pdurbin cool
11:57 donsizemore @pdurbin jon says there is also a mailing list
11:57 pdurbin Is there? I'm not on it. Do they have public archives?
12:04 pdurbin donsizemore: I just opened this about the mailing list: https://github.com/whole-tale/whole-tale/issues/46
12:17 cdsp-rmo hey there
12:18 cdsp-rmo pdurbin: do you have any news about the questions I had on https://github.com/IQSS/dataverse/issues/4593 ? (about multiple depositors)
12:19 cdsp-rmo I fixed points 1 and 3 by adding "good" error messages about import
12:19 cdsp-rmo but I'm still stuck with point 2
12:30 donsizemore joined #dataverse
12:41 pdurbin_m joined #dataverse
12:41 pdurbin_m cdsp-rmo: you should go ahead and make a pull request
12:43 cdsp-rmo okay
12:44 pdurbin_m cdsp-rmo: thanks. Also, what is the status of https://github.com/IQSS/dataverse/issues/5090 ?
12:44 cdsp-rmo I still had the log problem
12:45 cdsp-rmo I had to comment out the login part
12:45 cdsp-rmo gonna commit my changes, merge with latest develop, and check again if the problem persists
12:49 pdurbin_m cdsp-rmo: perfect. Thank you! Julian and I agree that multiple depositors should be supported. Can you please open a new issue about this?
12:49 cdsp-rmo okay
12:52 pdurbin_m thanks!
12:53 cdsp-rmo https://github.com/IQSS/dataverse/issues/5164
12:56 pdurbin_m thanks! I'll add some details in a bit
13:01 jri joined #dataverse
13:03 donsizemore @pdurbin_m i think i'm beating up on AWS too hard. i stuck in a sleep 10 to give the instance time to spin up, and i'm still occasionally getting timeouts
13:05 isullivan joined #dataverse
13:08 pdurbin_m weird
13:09 pdurbin_m Whoops, I forgot to kill my instance yesterday. Amazon bill. :/
13:11 poikilotherm Mornin' guys :-) News from Europe, freshly backed from the videoconference with Slava ;-)
13:12 poikilotherm I talked to Slava about the PID plugin they develop and this might a very interesting thing for #5093, too :-)
13:13 poikilotherm They have a simple service implementation which is able to talk to the Dataverse API to collect the metadata, transform it to some kind of request a PID provider needs and put the resulting PID back into Dataverse
13:14 pdurbin_m huh
13:14 pdurbin_m is it open source?
13:14 poikilotherm This of course needs some minor code changes in Dataverse (ringing the service, ...), but this sounds like a good approach to get all the custom PID stuff outside of the upstream project and away from maintenance hell for you guys.
13:15 poikilotherm Yes, it is. Don't have a link yet, but Slava will send it to me tomorrow
13:16 poikilotherm The benefit from this is that Dataverse only needs to talk over a stable API to a evolving service. This is easier for maintenance and especially for testing.
13:17 poikilotherm And if I think about my plans on switching from DataCite MDS API to REST API, this would be a very welcome approach to have the functionality separated and guaranted not to blow up existing usage.
13:18 poikilotherm This separate project is testable without a working dataverse installation, which makes it also much lighter to test
13:28 pdurbin poikilotherm: sounds great. How would you like to track this exploration? Do you want to make a new issue and we'll put it in "Community Dev" and assign it to you?
13:29 poikilotherm Let me take a look at the code first... ;-) And Slava said they wanted to put their changes to Dataverse into upstream, so they most certainly will open an issue or just reuse the existing one.
13:29 pdurbin Ok. However you want to track it is fine. Thanks for working on this!
13:29 poikilotherm Maybe I can encourage him to be more responsive on GH... ;-)
13:36 poikilotherm Question from a colleague of mine: do you guys think it would be possible to get the slides or even a recording of https://hlrdm.library.harvard.edu/event/preparing-your-data-and-code-reproducible-publication-hands-workshop-researchers
13:36 pdurbin Meh. He's flown over for two community meetings. He calls in a lot. He tours Europe giving talks about Dataverse. He's great.
13:37 poikilotherm This could be very interesting for our own workshops we need to come up with for the time after day 0, the day our dataverse installation comes to life...
13:37 poikilotherm Yes he is :-D
13:40 pdurbin huh. "Co-sponsored with Harvard Dataverse"
13:40 poikilotherm That's why I'm asking here... ;-)
13:45 pdurbin poikilotherm: I think I learned about it from https://twitter.com/hlrdmp/status/1036982254436798465 ... Can you please reply to that tweet and ask about slides and recordings?
13:47 pdurbin donsizemore: "Code Ocean Workshop - Preparing your data and code for reproducible publication: A hands-on workshop for researchers" ^^ ... seems to relate to the Whole Tale meeting we were talking about earlier.
13:48 pdurbin isullivan: looks like you're the instructor for https://hlrdm.library.harvard.edu/event/open-science-framework-workshop-0
14:14 poikilotherm pdurbin: Done. :-)
14:17 pameyer joined #dataverse
14:21 pdurbin Huh. I don't see it.
14:21 pdurbin Ah, here: https://twitter.com/poi_ki_lo_therm/status/1050389163860787200
14:26 pdurbin poikilotherm: I just reached out to the person who tweeted this: https://twitter.com/jgolds2/status/1039927601479995396
14:38 jri joined #dataverse
14:39 pameyer poikilotherm - any ideas why they didn't try to do api-compatability w\ ezid or mds for their PID stuff?
14:39 pameyer ideas that can be publicly logged, I mean ;)
14:41 poikilotherm actually these PID providers are pretty different in what they need and can do for you. it seemed not to be a good idea to be limited to the data that the ezid/mds api can transport
14:42 poikilotherm da|ra might not be the best example, but e.g. ePIC allows you to have custom key-value-pairs directly attached to your metadata entry. this is not possible and not wanted by datacite
14:42 pameyer thanks; makes sense
14:42 poikilotherm and for a more concrete example
14:42 poikilotherm i could not use the mds api to map to the rest api
14:42 poikilotherm the mds api does not know about states
14:43 poikilotherm so i would need to get that information from the metadata in dataverse anyway
14:43 jri_ joined #dataverse
14:44 poikilotherm IMHO it really makes more sense to let people provide the mapping from the metadata in dataverse into the metadata schema from the pid provider themself and not fiddle around with possible limitations from the other apis
14:46 pdurbin poikilotherm: will I be able to add a "no-op" PID provider that I can use in development? A PID provider that doesn't actually reach out over the network?
14:47 poikilotherm why not?
14:47 pdurbin cool
14:47 poikilotherm what the provider thing is doing depends on what you develop for it. of course you could use a noop, which is simply skipping the communication with an upstream service
14:48 pameyer "provide a mapping" - agreed; but I think that sorting out how to do this might be somewhat messy
14:48 poikilotherm pameyer: sure. But that's the headache of the guy who needs to mapping and does not put load on you guys ;-)
14:48 poikilotherm (which imho would be a bad idea)
14:49 poikilotherm i really enjoy the idea of a separate web service, running independently
14:50 poikilotherm this could even be used to provide custom made PIDs without any upstream support, just for internal use
14:50 poikilotherm many usecases to think of
14:50 poikilotherm but not bloating the upstream of dataverse :-)
14:51 poikilotherm Alright guys, read you tomorrow :-)
14:51 pameyer would need some changes on the dataverse side too (urls for different types of PIDs) - but less so than others
14:51 poikilotherm Have a nice day
14:51 pdurbin you too poikilotherm
15:30 jri joined #dataverse
15:51 cdsp-rmo pdurbin: the pull request is finally there https://github.com/IQSS/dataverse/pull/5170
15:52 cdsp-rmo gonna add some questions to it
15:58 pdurbin cdsp-rmo: thanks! I just moved it over to code review in Waffle.
15:59 cdsp-rmo wouhou
16:02 pdurbin cdsp-rmo: since you're adding a new API endpoint, can you document it in the API Guide? The file is doc/sphinx-guides/source/api/native-api.rst
16:02 cdsp-rmo oh no ...
16:02 cdsp-rmo strange bug came again
16:02 jri joined #dataverse
16:02 cdsp-rmo just deployed new war file, everything is fine
16:02 cdsp-rmo stopped glassfish
16:02 cdsp-rmo I wanted to do a new test, so I started glassfish
16:02 cdsp-rmo can't log anymore
16:02 cdsp-rmo ><
16:03 pdurbin :(
16:03 cdsp-rmo it's like if stop then start glassfish breaks everything
16:04 cdsp-rmo (ok for doc, that's why I left the checkbox)
16:04 pdurbin It's no fun trying to get Glassfish to behave.
16:07 pdurbin cdsp-rmo: thanks. Without any docs it's hard for QA to know what to test.
16:08 pameyer I haven't gotten it to the point where I could try dataverse, but I was able to get a JSF hello world war to deploy on glassfish, payara and tomcat :(
16:09 pdurbin pameyer: cool. Here's my non-JSF hello world that I got working fine on wildfly and payara: https://github.com/pdurbin/javaee-docker
16:15 pameyer pdurbin: sounds like we're thinking of similar baby steps
16:15 cdsp-rmo well, at least now I know what to change in the code to make it works :D
16:16 cdsp-rmo so, even if I got the bug, I can still test my things
16:16 pdurbin cdsp-rmo: sorry it's so painful
16:16 pdurbin pameyer: I'd love to hear more about what you're thinking.
16:20 pameyer pdurbin: partly inspired by the user (who's name I've forgotten) talking about ~"why have an install script when there are standard ways to deploy war files".  figured the first step of that was to get war deployment working on a bunch of app servers
16:21 pameyer didn't get farther than that; but next steps would've been seeing how much effort it would take to do 0_predeploy.sh , 1_postdeploy.sh in a generic way
16:21 pdurbin pameyer: you might be thinking of https://github.com/IQSS/dataverse/issues/4523
16:21 pameyer pdurbin - exactly
16:23 pameyer also started wondering about pre/post deploy hooks in a war file (although that was more in the context of upgrades across versions).  *looks* theoretically possible
16:24 pdurbin It would be nice to have more open source models to look at. Jenkins is written in Java but I don't know if it's a jar or a war.
16:24 pameyer war
16:24 pdurbin Hmm. Maybe we can learn something from how Jenkins is deployed, then.
16:26 pameyer works with tomcat - but I have a tendency to get mixed up about the distinctions between various types of java app servers
16:26 pameyer but maybe
16:28 cdsp-rmo well, it's time for me to go for sport
16:29 pdurbin Because Dataverse uses Java EE APIs such as JMS a "full profile" Java application server is required. See "Full" vs "Web" profile at https://en.wikipedia.org/wiki/Java_Platform,_Enterprise_Edition
16:29 cdsp-rmo doc is coming, I assure ! ;)
16:29 pdurbin cdsp-rmo: bye! Thanks for the pull request!
16:29 cdsp-rmo you're welcome
16:29 cdsp-rmo bye
16:49 pdurbin pameyer: not sure if that helps or not (full vs web). If you click the link to Adam Bien's repo from mine, I think he's on the right track with his "thin war" concept. In his world, the "full" app server is just another docker layer and the war file you're deploying is small. This all assumes Docker though.
16:53 pameyer pdurbin: I think it does - but I probably need some time to digest it more
16:56 pdurbin pameyer: basically the "fat jar" (or "fat war", I guess) model is used in Spring and deployed to Tomcat but Adam like the "thin war" model where you deploy to payara or wildfly.
16:56 pdurbin likes*
17:09 donsizemore joined #dataverse
17:10 donsizemore @pdurbin one more fix and i think we're good
17:12 pdurbin donsizemore: yeah? Great!
18:24 jri joined #dataverse
18:26 jri joined #dataverse
19:14 jri joined #dataverse
20:24 jri joined #dataverse
21:13 jri joined #dataverse
21:25 jri joined #dataverse
21:41 donsizemore joined #dataverse
22:11 pameyer left #dataverse

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.