IQSS logo

IRC log for #dataverse, 2017-07-11

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

All times shown according to UTC.

Time S Nick Message
03:09 axfelix joined #dataverse
06:01 dataverse-user joined #dataverse
06:25 dataverse-user good morning
06:26 dataverse-user coming from https://github.com/nds-org/ndslabs-dataverse/issues/8 by pdurbin
06:27 dataverse-user regarding handle registration @ dataverse results in error after using the jvm options provided with database settings through curl
06:27 dataverse-user Error – This dataset may not be published because the Handle Service is currently inaccessible. Please try again. Does the issue continue to persist? If you believe this is an error, please contact Root Support for assistance.
07:02 dataverse-user i found in the logs :
07:02 dataverse-user [2017-07-11T07:02:23.723+0000] [glassfish 4.1] [SEVERE] [] [edu.harvard.iq.dataverse.HandlenetServiceBean] [tid: _ThreadID=28 _ThreadName=http-listener-1(2)] [timeMillis: 1499756543723] [levelValue: 1000] [[   Can't load private key in null: java.lang.NullPointerException]]
10:26 pdurbin dataverse-user: hi!
10:27 pdurbin That error seems to be coming from here: https://github.com/IQSS/dataverse/blob/v4.7/src/main/java/edu/harvard/iq/dataverse/HandlenetServiceBean.java#L303
11:21 donsizemore joined #dataverse
11:52 andrewSC joined #dataverse
12:13 dataverse-user oh
12:13 dataverse-user so i need to include the private phrase as well with the private key ? because i don't think we encrypted one
12:14 pdurbin well, I'm wondering if your JVM options are configured correctly
12:16 pdurbin See http://guides.dataverse.org/en/4.7/installation/config.html#jvm-options
12:18 pdurbin dataverse-user: can you please email the output of `asadmin list-jvm-options` to support@dataverse.org ?
12:52 dataverse-user joined #dataverse
13:12 pdurbin dataverse-user: the output should look something like this: https://paste.fedoraproject.org/paste/M80dUPUeVa0apzWScl4eaQ/raw
13:12 pdurbin Please notice how it has Dataverse-specific lines such as this one: -Ddataverse.fqdn=phoenix.dataverse.org
13:13 dataverse-user hmm
13:13 dataverse-user what about the handles config like this one : -Ddataverse.handlenet.admcredfile=/pathtokey.bin
13:14 dataverse-user or do i need another options for this?
13:15 pdurbin dataverse-user: I don't know very much about how to configure Handles (I use DOI instead) but if you email the output to support@dataverse.org someone will be able to help.
13:17 dataverse-user Thank you
13:17 dataverse-user done
13:18 pdurbin dataverse-user: ah, at https://help.hmdc.harvard.edu/Ticket/Display.html?id=251167 . Perfect. Thank you!
13:19 pdurbin dataverse-user: Is there a file at /siteinfo.bin ?
13:19 pdurbin Right in the root filesystem?
13:21 donsizemore joined #dataverse
13:22 dataverse-user pdurbin : yeah
13:25 pdurbin hmm, ok. well, someone will reply. sorry you're having trouble
13:26 dataverse-user no problem , thank you for your help !
13:27 pdurbin Oh sure. So you use Docker in production?
13:28 pdurbin That's what I was reading at https://github.com/IQSS/dataverse/issues/3938#issuecomment-313366127
13:28 dataverse-user Yes
13:28 pdurbin Do you like it? Any pain points? I'm pretty new to Docker.
13:29 dataverse-user No pains :D
13:29 pdurbin heh, good!
13:29 pdurbin bjonnh has been trying to help me learn Docker
13:29 dataverse-user we're not that good at java configs or java in general so we use the image to do the hardwork
13:30 pdurbin That makes sense.
13:30 dataverse-user and just use the REST api to generate data sets
13:31 dataverse-user but i think updating the docker image will be hard if there is no official support since we have to rely on someone to update it
13:31 dataverse-user anyway i have to go , thanks again for the help much appreciated! good bye
14:02 donsizemore joined #dataverse
14:03 pdurbin donsizemore: thanks for https://github.com/IQSS/dataverse/pull/3991 !! I moved it to Code Review for you.
14:04 donsizemore @pdurbin it may not be purty, but the installation works on Ubuntu and in Vagrant, and I've tested the init script on CentOS6 and the systemd script on RHEL7
14:07 pdurbin Nice. systemd is still new and fancy for me. :/
14:12 donsizemore @pdurbin it ain't no thang. i'm more concerned with whether my soy sauce is low sodium =)
14:12 pdurbin heh
14:13 pdurbin donsizemore: are you aware of how many Dataverse installations are on AWS? I'm not but I hear it's a thing.
14:15 donsizemore @pdurbin i'm not but we've priced out putting ours there. once our VMware cluster comes up for hardware refresh, AWS may look pretty attractive
14:15 donsizemore @pdurbin i can share my silly little map of how Odum might implement it, if you're interested?
14:16 pdurbin TDL is on AWS according to some notes I took at https://groups.google.com/d/msg/dataverse-community/6Twh8KLhmQg/rR9TP-cPAQAJ
14:16 pdurbin donsizemore: map? Like a network diagram? Sure, please share it up.
14:17 donsizemore @pdurbin oh, i meant pricing particular VMs by service level
14:20 pdurbin oh, sure, I'll take that too
14:22 donsizemore @pdurbin sorry, i thought we could share with external folks. i blame Bill Gates personally.
14:22 pdurbin heh, no worries
14:32 pameyer joined #dataverse
14:36 pameyer systemd still seems somewhat counter-intiutive to me
14:38 pdurbin pameyer: lemme know when you upgrade from RHEL/CentOS 6 to something newer :)
14:38 pameyer we've got cent7 boxes in production ...
14:39 bjonnh donsizemore: how many users?
14:40 donsizemore @bjonnh we have 1118 in our authenticateduser table... @pdurbin would the builtinusers be copied over as they sign in after our migration to 4?
14:41 bjonnh what is the size of the storage?
14:41 bjonnh and are all the users located in a single geographical region?
14:41 pdurbin donsizemore: the dashboard for users just got merged into develop yesterday. You should try it out on you 1118 users. :)
14:41 pdurbin your*
14:42 donsizemore @bjonnh right now our Dataverse VM has a 2TB thin-provisioned VMDK (the maximum supported by fault tolerance) and consuming <200GB
14:43 pdurbin donsizemore: I
14:43 pameyer huh - I'd assumed you'd had VMs talking to fileservers over NFS/IRODS
14:43 bjonnh oh wow so your storage cost will be like nothing on AWS
14:43 pdurbin donsizemore: I'm confused. You already migrated to Dataverse 4.
14:43 donsizemore @bjonnh we accept users from anywhere. a select distinct on affiliation from authenticateduser is all over the place - 491 results
14:43 bjonnh do you have an idea of how much upload you are doing?
14:44 bjonnh (how much your server outputs to the net)
14:44 donsizemore @pdurbin i wasn't sure of the relationship between builtinuser and authenticateduser pre- and post-migration
14:44 donsizemore @bjonnh downloads are bursty but our average over a month is i think... hold on, i measured that, gimme a sec
14:44 bjonnh well just a range, 1G, 1TB, 1PB…
14:45 pdurbin Ever user has a row in the authenticateduser table. Only builtin users have a row in the builtinuser table.
14:46 * bjonnh noticed this is not really a range
14:46 donsizemore @pameyer our Dataverse VM irsyncs its files.dir and PG backups into iRODS. we didn't get the iRODS/WebDAV dealy to scale prior to our migration to 4
14:47 pdurbin The builtinuser table is where we store a salted and hashed password for builtinusers. We do not store a password for non-builtin users because there password is stored by a shibboleth server or ORCID's server or GitHub's server or Google's server.
14:47 donsizemore @pdurbin cool, then i was looking in the right place =)
14:48 donsizemore @bjonnh we're primarily social science with historical data, so we have a bunch of small files. i'm fishing a network summary out of VMware if i can find the right screen
14:50 pdurbin pameyer: but you're running Dataverse on el6 not el7, right?
14:50 donsizemore @bjonnh we averaged 11kb/s (incoming and outgoing) over the past month. each VMware host has dual 10GB NICs, tho =)
14:50 pameyer pdurbin: dv.sbgrid.org is c7; my dev VMs are currently c6
14:51 pdurbin hmm, interesting. thanks, pameyer
14:51 donsizemore @pameyer i'm waiting impatiently for rhel8 before it's time to rebuild our production machines
14:51 bjonnh donsizemore: how redundant are your machines/VMs?
14:51 pdurbin donsizemore: I got your PDF about AWS stuff. Is this issue on your radar? https://github.com/IQSS/dataverse/issues/3921
14:53 donsizemore @pdurbin we've got a grant proposal in to extend that: s3 and irods in addition to swift. that's just a proposal at this time.
14:54 donsizemore @bjonnh we've currently got 7 vmhosts of varying vintage over a NetApp. our Dataverse fit neatly in one VM within fault tolerance's maximum specs, so we're sitting there until we outgrow it.
14:55 bjonnh donsizemore: well 4 machines on AWS with 2 cores 8 gigs of ram are 2200/y (with upfront reservation)
14:56 bjonnh donsizemore: 2TB of S3 storage with 1GB in/out is arount $50/mo
14:56 bjonnh around
14:57 bjonnh anyway I'm not sure how dataverse is clusterisable, so you may end up with only one machine for the app itself and eventually the db on 3
14:57 bjonnh that would be the paranoid option ;)
14:57 donsizemore @bjonnh i chose different service levels depending on VM (GF, PG, TwoRavens) and came up with $5500/yr not including data
14:57 bjonnh wow
14:58 donsizemore @bjonnh but my boss told me to spec it the way *I* would want it done, without regard to price
14:58 bjonnh also
14:58 bjonnh remember that amazon is offering a grant system
14:58 bjonnh so you can get stuff for free
14:58 donsizemore @bjonnh tell me more
14:58 bjonnh hah
14:58 bjonnh you're going to cut me the grass under the feet ;)
14:59 bjonnh https://aws.amazon.com/grants/
14:59 bjonnh "Build cloud-hosted publicly available science-as-a-service applications, software, or tools to facilitate their future research and the research of their community."
14:59 bjonnh that's totally you ^
15:00 donsizemore @bjonnh i really appreciate your mentioning this =)
15:01 bjonnh well if you file one, I will be interested in knowing how it went
15:01 bjonnh because I will have to do it too
15:16 donsizemore @bjonnh jonathan_crabtree@unc.edu has gone through their grant process before and i bet he could answer some questions for you?
15:26 donsizemore @bjonnh sorry, i just re-read my cost estimate, and that $5500 would include a generous estimate of outgoing data (~$720/yr)
15:40 bjonnh great thx
15:43 pameyer joined #dataverse
16:10 pameyer joined #dataverse
16:12 jhand joined #dataverse
17:26 pameyer joined #dataverse
17:27 donsizemore joined #dataverse
17:30 axfelix joined #dataverse
18:24 pameyer donsizemore: I just learned your glassfish user has a home directory
19:05 axfelix joined #dataverse
19:08 pameyer joined #dataverse
19:31 donsizemore joined #dataverse
19:32 donsizemore @pameyer should it not(?)
19:33 donsizemore @pameyer or are you telling me i have some remote vulnerability?
19:34 pameyer @donsizemore: not a problem, just something that I hadn't been expecting
19:35 pameyer I'd overlooked creating $HOME for glassfish and just noticed while looking at 1934
19:35 donsizemore @pameyer on the production box, everything is done from my home directory on the assumption that i'll be around for a while ;) i thought a glassfish home might be more central
19:35 pameyer :)
19:36 donsizemore @pameyer #1934 was only me taking a stab at it. improvements welcome =)
19:37 pameyer it looks good to me so far
19:37 donsizemore @pameyer the vagrant installer still runs as root due to vagrant mount permissions, but then it launches as $glassfish_user
19:37 pameyer most of the glitches I've hit have been problems on my end
19:37 pameyer ah - I hadn't though to try the vagrant installer
19:38 pameyer but it appears to play nicely with the non-interactive / changed defaults installer
19:38 donsizemore @pameyer the "bare metal" installation process may be run unprivileged; the vagrant machination was simpler as root (and then it's in a temporal vagrant box)
19:40 pameyer yup - non-vagrant unprivileged install looks good :)
19:41 pameyer and once I'd put my default.config someplace that wasn't /root ; wipe and re-provision installation could send mail (that's what I'd meant by non-interactive)
19:41 donsizemore @pameyer i was starting to look at your mvn clean issue out of curiousity
19:42 pameyer anything that reduces the number of steps I need to remember is something I try to do
19:42 pameyer does it make sense?
19:42 donsizemore absolutely
19:57 pameyer joined #dataverse
20:08 pameyer joined #dataverse
20:24 pameyer joined #dataverse
20:55 axfelix joined #dataverse
21:22 pameyer joined #dataverse
22:05 donsizemore joined #dataverse

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.