IQSS logo

IRC log for #dataverse, 2018-09-28

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

All times shown according to UTC.

Time S Nick Message
11:52 poikilotherm joined #dataverse
14:14 djbrooke joined #dataverse
14:44 djbrooke joined #dataverse
14:46 donsizemore joined #dataverse
14:50 Jim_ joined #dataverse
14:51 djbrooke joined #dataverse
14:54 Jim_ hi all - I was just going through a test 4.8.6 to 4.9.almost3 upgrade on TDL's dev machine (looks good so far) and wondered if anyone has thought about amending the 4.9 install instructions to point out you should not do registerDataFileAll if you're going to turn file PIDs off in 4.9.3?
14:55 pdurbin Jim_: oh, the release notes for 4.9?
14:55 Jim_ yes
14:56 Jim_ perhaps obvious, but a bad thing to do if you wrotely update version by version
14:56 Jim_ rotely
14:56 pdurbin yeah
14:57 pdurbin release notes aren't versioned the way guides are
14:57 poikilotherm joined #dataverse
15:00 pdurbin Jim_: I just added a warning. Please let me know what you think: https://github.com/IQSS/dataverse/releases/tag/v4.9
15:11 djbrooke joined #dataverse
15:11 djbrooke joined #dataverse
15:16 poikilotherm Hi pdurbin, could you give me hint where I find the code that runs tasks at startup of dataverse and where I could find something about the default workflows being initially deployed to a new install?
15:17 poikilotherm I'm a bit lost in the code base :-D
15:23 Jim_ pdurbin - looks good except for the type (what instead of want). I guess it could be even more explicit - if you are upgrading to 4.9.3 or beyond and intend to disable file DOI creation, do not do this step.
15:27 Jim_ As for workflows, the ones I know of are all just examples - in scripts/api/data/workflows and there are no workflows enabled by default. http://guides.dataverse.org/en/latest/developers/big-data-support.html has some info and the api to check on them is described on  http://guides.dataverse.org/en/latest/api/native-api.html.
15:27 Jim_ (just been working to add a post-publication workflow to submit archival copies of data to the Digital Preservation Network...)
15:36 djbrooke joined #dataverse
15:38 djbrooke joined #dataverse
15:42 djbrooke joined #dataverse
15:58 poikilotherm joined #dataverse
15:59 poikilotherm @Jim_ that is the intention I am thinking in - but for me this would be related to PID minting at different stages in a very configurable way.
16:00 poikilotherm I hoped the Workflows would be the thing to look at for the different stages of a dataset from creation over update to publication
16:14 pdurbin Jim_: thanks, I just changed it to this: **Please note: Do not run the `registerDataFileAll` command below if you do not plan to give your files persistent identifiers, which are no longer required in 4.9.3 or later (#4929).**
16:15 pdurbin poikilotherm: hi, so you have two questions? Tasks at startup? Docs on workflows?
16:15 poikilotherm I found the workflows and how they are triggered thank to Jim_
16:16 poikilotherm Currently digging into this...
16:16 poikilotherm Maybe this is a good place for the PID minting stuff
16:16 poikilotherm The other question is about startup tasks
16:17 poikilotherm Do you have such a construction in place?
16:17 poikilotherm E. g. to validate configs?
16:17 poikilotherm I'm asking because that could be usefull to load some kind of plugin for the PID providers
16:19 pdurbin poikilotherm: I would suggest looking at the @PostConstruct startup method that calls registerProviderFactory at https://github.com/IQSS/dataverse/blob/v4.9.2/src/main/java/edu/harvard/iq/dataverse/authorization/AuthenticationServiceBean.java#L111
16:20 pdurbin there's also something for starting the EJB timer stuff
16:22 poikilotherm Ah thx
16:22 poikilotherm Will look into that.
16:22 pdurbin sure
16:22 poikilotherm I forgot: an update about #4106, the pid plugin issue...
16:23 poikilotherm I have notice from Felix Bensmann (fbgesis) that he is not involved in this stuff anymore
16:23 poikilotherm There is another employee from GESIS following up to his work, but I hadn't a chance to talk to him yet.
16:23 poikilotherm Will keep you posted how it goes.
16:26 pdurbin huh, are you sure he isn't involved anymore?
16:29 poikilotherm Yes - he wrote me an email today :-D
16:29 poikilotherm I can forward this to you, but it is in german... ;-)
16:30 poikilotherm Ok, he said he is only doing "consulting" from now on - the actual implementation work seems to be done by a Mr. Klas
16:32 poikilotherm https://www.gesis.org/institut/mitarbeiterverzeichnis/person/?tx_gextstaffdir_staffdirectory%5Bemail%5D=claus-peter.klas%40gesis.org&tx_gextstaffdir_staffdirectory%5Baction%5D=details&tx_gextstaffdir_staffdirectory%5Bcontroller%5D=Index&cHash=2a391c38fc0fa940c580b70a939365f7
16:39 pdurbin ok, well, you're welcome to invite him to join us here
16:40 poikilotherm Yes Sir! ;-)
16:40 poikilotherm Hope to get in touch with him next week. We'll see.
16:42 pdurbin cool
16:42 pdurbin I'm glad you're thinking about the pluggable PID stuff.
17:16 poikilotherm joined #dataverse
17:16 poikilotherm Sure. This is pretty urgent for us, as we need to get our repo in place more sooner than later...
17:19 poikilotherm Do you think there is help wanted with the Java 11 stuff? This seems as it might blow up in the nearer future...
17:19 djbrooke joined #dataverse
17:22 pdurbin You're welcome to see if you can reproduce the errors I'm seeing: https://github.com/IQSS/dataverse/issues/4259#issuecomment-425453563
17:44 poikilotherm joined #dataverse
17:53 poikilotherm joined #dataverse
19:07 poikilotherm Hey @pdurbin, just updated #4259 with my experiences :-)
19:08 * pdurbin looks
19:30 poikilotherm While looking at the failing tests: have you guys ever thought about replacing jHove with something like Apache Tika?
19:30 poikilotherm As far as I can see, jHove is only used for file type detection, right?
19:31 poikilotherm Tika has quite a few more formats... ;-)
19:32 pdurbin poikilotherm: Jim_ and djbrooke mentioned Tika at https://github.com/IQSS/dataverse/issues/5030
19:32 pdurbin yes, I believe jhove is only used for file type detection
19:34 pdurbin poikilotherm: did you see my comments on https://github.com/IQSS/dataverse/pull/5059 ?
19:36 poikilotherm pdurbin: whooops - not yet. Thanks for the tip... *reading*
19:36 pdurbin sure
19:42 pdurbin poikilotherm: by the way, Jim_ is the one who wrote the google doc I was telling you about... his roadmap for stuff he's working on as a contributor
19:45 djbrooke joined #dataverse
19:45 poikilotherm pdurbin: I thought so... ;-)
19:45 poikilotherm pdurbin: Just commented your review :-)
19:46 pdurbin thanks, I'm commenting on your comments
19:47 pdurbin if it's easier, we can just talk here. I still have another 45 minutes before I need to pick up the kids
19:47 poikilotherm As you prefer :-)
19:47 pdurbin I like your proposed new names for the JVM options.
19:47 pdurbin I think they help reduce potential confusion.
19:47 poikilotherm Ah, the conflict results from a commit coming in after I rebased earlier this morning...
19:48 poikilotherm Whatever you guys prefer... Would this trigger reiteration of reviews?
19:48 pdurbin no, I don't think so
19:49 pdurbin I mean, QA might find a bug and send it back to you.
19:49 poikilotherm Ok, then I will change 'em...
19:49 djbrooke joined #dataverse
19:49 poikilotherm Will take a few moments :-D
19:49 pdurbin No rush. Thanks. Are you going to provide QA a Minio server to test with? That would be ideal.
19:50 poikilotherm They can simply use the play.minio.io:9000 ;-)
19:50 poikilotherm Credentials and stuff already in the docs :-D
19:51 djbrooke joined #dataverse
19:51 pdurbin Ok. Maybe you can put that in the guide. I do something similar for Test Shib in the shib section of the guides.
19:51 pdurbin "try out this test service to see if it works"
19:51 poikilotherm About your comment regarding the empty compilerArgument - just leave a comment above it in pom.xml?
19:51 pdurbin yes a <!-- comment -> please
19:51 poikilotherm Sure.
19:52 poikilotherm I will add the comment about the testing to the working combination report, ok?
19:52 pdurbin working combination report?
19:59 poikilotherm Yeah, new section
20:00 poikilotherm "
20:00 poikilotherm Reported Working S3-Compatible Storage
20:00 poikilotherm Minio v2018-09-12
20:00 poikilotherm Set dataverse.files.s3-path-style-access=true, as Minio works path-based. Works pretty smooth, easy to setup.
20:00 poikilotherm HINT: If you are successfully using an S3 storage implementation not yet listed above, please feel free to open an issue at Github and describe your setup. We will be glad to add it here.
20:02 pdurbin sure, sounds fine. and a link to the play thing
20:02 poikilotherm Of course :-)
20:02 pdurbin oh, you added it already
20:02 pdurbin great
20:25 poikilotherm DONE :-)
20:27 poikilotherm And I'm done for today, too ;-) Its 22:27 here, let's call it a day.
20:28 poikilotherm Have a nice weekend :-)
20:28 pdurbin you too!
20:30 pdurbin all of you
20:30 pdurbin left #dataverse
20:53 djbrooke joined #dataverse
20:58 djbrooke joined #dataverse
21:01 djbrooke_ joined #dataverse
21:30 djbrooke joined #dataverse

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.