IQSS logo

IRC log for #dataverse, 2019-01-04

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

All times shown according to UTC.

Time S Nick Message
08:38 poikilotherm joined #dataverse
09:06 jri joined #dataverse
11:32 pdurbin poikilotherm: thanks for sending me your work plan.
11:32 poikilotherm Morning pdurbin. Sure. You're welcome.
11:33 poikilotherm Do you think it is reasonable?
11:35 pdurbin Sure. It seems fine. Also, I'm glad to hear that you and Gustavo talked on the phone. I didn't know that.
11:36 pdurbin How did that conversation go?
11:39 poikilotherm That was last year... :-D It went pretty well and it brought up the idea of an internship ;-)
11:41 pdurbin :)
13:47 donsizemore joined #dataverse
13:53 dataverse-user joined #dataverse
13:55 dataverse-user left #dataverse
14:09 isullivan joined #dataverse
14:29 pdurbin Is anyone running OS X Mojave? I upgraded and now I can't start Glassfish 4.1 from Netbeans 8.2. I just opened this issue with details: https://github.com/IQSS/dataverse/issues/5431
14:49 drew-jhu joined #dataverse
15:02 Jim_ joined #dataverse
15:18 pameyer joined #dataverse
15:21 drew-jhu happy new year, folks. @pdurbin - you mentioned a fake pid provider for dev environments & documented configuring for it here: http://guides.dataverse.org/en/4.10/developers/dev-environment.html#configure-your-development-environment-for-publishing. what is the effect of doing so? does it simply disable pid minting, or does it do something fancier?
15:21 pameyer pdurbin: sounds like there's an issue with the credential file from `asadmin login` and netbeans
15:23 pameyer @drew-jhu pdurbin's authorative on that one ;) but I'm pretty sure it just disables pid minting while acting like all the calls to the pid provider behave as expected
15:27 pdurbin drew-jhu: the fake pid provider does as little as possible. As close to a no-op as possible. It doesn't reach out over the Internet. It's lazy. :)
15:30 donsizemore @pdurbin so the fake pid provider is in fact a provided pid faker? it needs a good catty name
15:30 pdurbin meow
15:30 drew-jhu but it does have a different effect than `curl http://localhost:8080/api/admin/settings/:DoiProvider -X PUT -d JANKY` would...
15:30 donsizemore @pdurbin the sean rowe virus comes to mind
15:31 drew-jhu https://github.com/IQSS/dataverse/blob/19f19c25a95b03a6ba5b0d7c0cc4232f7cac77c2/src/main/java/edu/harvard/iq/dataverse/GlobalIdServiceBean.java#L81
15:31 drew-jhu that's good to know
15:34 pdurbin drew-jhu: you found the right line. JANKY won't work. Only FAKE.
15:34 pdurbin ... and the real DOI providers, of course.
15:35 pdurbin (DataCite and EZID)
15:38 drew-jhu so it would be accurate to say it "safely disables DOI registration" and is only available in versions >= 4.10, correct?
15:39 pdurbin correct
15:39 pameyer depends what you mean by "safely".  if I'm remembering correctly, there's no visual indication to users that the PIDs are fake.
15:40 pdurbin ah, good point
15:41 drew-jhu i'm assuming that JANKY would eventually throw an error that FAKE wouldn't, so FAKE is safer in that sense
15:41 pdurbin I haven't tried JANKY. :)
15:42 pdurbin The fake PID provider is meant for developers. Please see "Do not require setup of PID credentials for development environments" https://github.com/IQSS/dataverse/issues/5327
15:44 pdurbin Since moving developers from EZID to DataCite, we've had a lot of pain. EZID provided shared test credetials. DataCite does not. If you look at https://github.com/IQSS/dataverse/pull/5399/files you can see that I was able to replace a long paragraph with a single curl command.
15:57 drew-jhu i think a bit more elaboration in the guide would be useful to make it clear that configuring :DoiProvider to FAKE (specifically) has the effect of safely disabling DOI registration, but that the UI does not currently provide any indication that the DOIs thus created are fake.
15:58 pdurbin No objections. Would you like to create a pull request?
15:58 drew-jhu was just about to offer.
15:59 pdurbin Great! Please create an issue first. It's our way.
15:59 drew-jhu wilco
15:59 pdurbin :)
16:00 drew-jhu i have to say, phil, you are a good community engagement pull-you-in person
16:02 pdurbin one of us, one of us
16:03 pdurbin I gave a somewhat recent talk on this stuff: http://wiki.greptilian.com/talks/2017/javaone-how-to-run-an-open-source-project/
16:43 drew-jhu i'd say you walk that talk, phil. #thumbsup
18:02 pdurbin :)
18:04 pdurbin pameyer: the thing is, I'm not putting a password on glassfish. I never do.
18:07 pameyer pdurbin: I probably got confused - somehow thinking new version of netbeans might be handling the glassfish interaction differently, and trying to do some kind of default login
18:07 pameyer but yeah, that shouldn't matter for os changes
18:51 leonid just to chime in the "fake" pid provider discussion. specifically re: "there's no visual indication to users that the PIDs are fake." I've said this before - they are not really "fake"; they are legitimate DOIs, issued in the name space (10.5072) that is specifically designated for assigning local, unregistered identifiers.
18:52 leonid there is nothing more fake about this setup than, say, if you were using EZID or DataCite, but were using the same test name space. In both cases you end up with dois that are not registered anywhere.
18:53 leonid the visual indication is the test name space. the analogy is the numeric notation for "localhost" -  127.0.0.1. It is a legitimate ip address; but you know that it's not registered in the DNS and only valid locally because it's purposefully reserved.
18:55 pdurbin leonid: yeah, and as you've said, similar to private IP addresses: 192.168.x.x, 172.16.x.x, 10.x.x.x. These IP addresses are not fake but they are not to be used on the public Internet. Similar concept to 10.5072.
18:55 pdurbin drew-jhu pameyer ^^
18:56 leonid So, in retrospect, maybe naming it "test" instead of "fake" would have made more sense... I wish I thought about it back then.
18:56 pameyer yeah; for devs I don't have any problems with that.  but my expectation is that the number of users who'd recognize a test DOI namespace is even less than the number that would recognize a localhost ipv4 address
18:58 leonid This is true. But, again, this is not unique to this new "fake" provider setup. Our default, out of the box setup for years was EZID, with a test account and this same test name space... same end result - ids that may (confusingly) look to some users as real, registered global ids.
18:58 leonid (i recognize that this may be a problem;)
18:58 pameyer very good point - this is not a new problem ;)
19:00 pameyer I might be remembering wrong, but I think that test ezid dois would resolve (until they expired)
19:00 pameyer don't think datacite test ones resolve, even if they're configured with a landing page
19:01 leonid they are actually reserved, temporarily, yes. so, strictly speaking, i meant "the end long-term result is the same".
19:03 leonid this means that using the test name space with EZID does have some value: you are testing your account credentials and/or the registration functionality in Dataverse.
19:06 pameyer authenticating to external services is definately someplace I'd rather get it working before switching to production ;)
19:54 drew-jhu not having an installation of 4.10 with which to test & develop a clearer understanding of how the new fake doi provider actually works is costing me here... i guess my thinking was that if i have configured my instance to use the fake provider, i'm probably not going to bother to also configure it to use the test namespace, in which case there would really be no visual indication that these DOIs won't behave like others do. TBH, this is no
19:57 pdurbin not...
19:57 pdurbin drew-jhu: I think you got cut off :)
19:58 drew-jhu i just wanted to hit return after typing so many words and see if more talking was really necessary :)
20:00 pdurbin TBH what though?
20:01 drew-jhu one could provide guidance on whether it should be used on a staging environment or just production & whether it should be used in conjunction with a test namespace or if it obviates that, along with the pros & cons of mixing and matching these approaches, but that would probably be a blog post, rather than a quick note in the docs & would really end phil's victory of replacing a paragraph with a curl statement
20:02 pdurbin heh
20:04 pameyer maybe add something to the going live section of the docs? "don't use the FAKE provider in production"
20:05 pdurbin I was feeling really bad for new contributors like Sheldon who were struggling: https://github.com/IQSS/dataverse/issues/5263
20:07 leonid joined #dataverse
20:08 drew-jhu sheldon may be hitting up against the issue with the test environment DataCite just reported, but i'm not sure. once they told us about it, i stopped persuing that test
20:09 leonid fwiw, the test name space ("authority
20:10 leonid fwiw, the test name space (10.5072) is always set up by default, by the installer; by this line in setup-all.sh:
20:11 leonid curl -X PUT -d 10.5072 "$SERVER/admin/settings/:Authority"
20:11 pdurbin drew-jhu: just reported? Today? Where?
20:15 drew-jhu day before yesterday, jon crabtree created test credentials for us under the GDCC account, but rteported he was unable to assign a prefix for it. yesterday, DataCite said, "We are aware of a bug that is affecting the assigning of prefixes to clients at the moment. We have a Github issue open and the technical team are working to resolve this as quickly as possible. I will let you know once it's fixed."
20:17 pdurbin Huh. Did they link to the GitHub issue?
20:17 drew-jhu keep in mind, i'm fairly new to dataverse myself & we only just migrated to datacite right before the holidays, so i'm not as familiar with how things are supposed to work as i'd like to be.
20:18 pdurbin maybe it's this one: https://github.com/datacite/bracco/issues/126
20:19 drew-jhu <shrugs>
20:21 pdurbin 90 repos under https://github.com/datacite so who knows
20:22 drew-jhu ...and most of them with codenames, to boot
20:30 drew-jhu heh. earlier, i said, "one could provide guidance on whether [FAKE] should be used on a staging environment or just production" i meant "...or just development"
20:31 pameyer gotcha :)
20:32 drew-jhu hopefully we don't need to tell people not to use FAKE in production
20:33 pameyer there have been a surprising number of times that folks have gone into production with the default robots.txt , and then had problems with getting their datasets indexed by google...
20:33 drew-jhu good point.
20:35 pameyer I'm glad you mentioned it though - because it would be a harder to fix failure mode than robots
20:37 drew-jhu so, once i have a more fully functional test environment and a fuller picture of how thigs can work with the test datacite sandbox &/or the FAKE DOI provider, i would entertain the possibility of trying to draft more documentation or perhaps even a blog post, should that appear needful. in the meantime, if someone else is inclined to elborate on my elaboration in PR #5433, that's cool by me.
20:40 pdurbin I'm fine with what you wrote and moved it to QA. I'm not sure if leonid or pameyer have looked at it yet. Here's a handy link: https://github.com/IQSS/dataverse/pull/5433
20:40 pameyer pdurbin: I just took a quick look
20:40 pdurbin drew-jhu: did you see the note about the config page in the issue?
20:40 pdurbin "FAKE" appears there too. In the Installation Guide, I mean.
20:42 drew-jhu i did & i think that's helpful. however, when i came across the curl line, i didn't immediately think to look elsewhere to see how it worked, so i do think that some elaboration at that point in the docs is useful
20:44 pdurbin Sure. One thing I'll say about the "dev env" page is that we try to keep it short.
20:44 pdurbin It used to be super long. I moved a lot of stuff into other pages recently.
20:44 pdurbin We don't want to overwhelm new contributors with a super long page.
20:47 pdurbin like we used to :)
20:48 pameyer for what it's worth, I may have never read the original "dev env" stuff
20:49 drew-jhu if you think terseness would be more valuable here, then that's cool too.
20:50 drew-jhu it wasn't entirely clear to me a first glance whether FAKE was a magic word that would cause the code to behave differently or if it was just a brute-force way to keep it from butt-dialing anyone
20:56 drew-jhu heh. it only now occurs to me that i did read that section & the part taken out is what alerted me to the possibility of getting test credentials for https://mds.test.datacite.org
20:58 pdurbin Heh. Well, you're welcome to put whatever in your pull request. Want me to pull it out of QA so you can work on it some more?
21:01 drew-jhu i'm disinclined to add more now unless someone else has a strong opinion about it. as i say, after i have more experience with the sandbox (and 4.10), i'd consider elaborating more *somewhere* (maybe not on that dev page, if you're trying to keep that short)
21:07 pdurbin Sure, sounds good. Again, no objections to what you added so far.
21:11 poikilotherm joined #dataverse
21:51 pdurbin folks, have a good weekend, I'm out
21:51 pdurbin left #dataverse

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.