IRC log for #dataverse, 2016-06-14

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.

All times shown according to UTC.

Time	Nick	Message
04:57		djbrooke joined #dataverse
06:41		Haridass joined #dataverse
06:41	Haridass	Hi Dataverse Team
06:42	Haridass	I am facing one issue unzipping glassfish in Ubuntu
06:42	Haridass	We are setting up Dataverse in our Ubuntu server as per the steps given in http://guides.dataverse.org/en/latest/developers/ubuntu.html
06:43	Haridass	I have successfully downloaded glassfish using the command
06:43	Haridass	•wget http://download.java.net/glassfish/4.1/release/glassfish-4.1.zip
06:43	Haridass	But when I try to unzip using
06:43	Haridass	unzip glassfish-4.1*zip
06:43	Haridass	I am getting the following error
06:44	Haridass	End-of-central-directory signature not found. Either this file is not a zipfile, or it constitutes one disk of a multi-part archive. In the latter case the central directory and zipfile comment will be found on the last disk(s) of this archive.
07:05		jri joined #dataverse
10:38	pdurbin	I just downloaded that glassfish zip file (`md5sum glassfish-4.1.zip` shows "2fd41ad9af8d41d1c721c1b25191f674") and it unzipped fine.
13:15		bsilverstein joined #dataverse
13:23		djbrooke joined #dataverse
13:24		djbrooke joined #dataverse
13:27		donsizemore joined #dataverse
13:28	donsizemore	@pdurbin: is https://spaces.internet2.edu/display/InCFederation/Error+Handling+Service worth creating a github issue (and want me to do so?)
13:30	pdurbin	donsizemore: huh. Maybe. First time I've seen that page though so I'm not sure what the GitHub issue would be about.
13:30	pdurbin	"The goal of Federated Error Handling is to provide a better user experience in those situations where an IdP provides an SP with insufficient information (attributes) to make an access control decision."
13:31	pdurbin	That sounds like me! The poor Service Provider (SP) who doesn't get enough information. :)
13:34		djbrooke joined #dataverse
13:35	donsizemore	@pdurbin: as is, Dataverse tells us exactly which attribute is missing, but a more user-friendly error URL could smooth things over down the road
14:05		pameyer joined #dataverse
14:10		djbrooke joined #dataverse
14:28	pdurbin	yeah, that's a good idea
14:34		Haridass joined #dataverse
14:35	Haridass	Hi dataverse team
14:35	pdurbin	Haridass: hi! Did you see my reply at http://irclog.iq.harvard.edu/dataverse/2016-06-14#i_36835 ?
14:37	Haridass	yes pdurbin
14:38	Haridass	glassfish got downloaded successfully
14:38	Haridass	that issue got resolved
14:39	pdurbin	oh good
14:39	Haridass	But I have another issue
14:39	Haridass	While installing Dataverse I am getting the following error
14:39	Haridass	nope I haven't been able to connect to the local instance oof postgresql as admin user
14:52		djbrooke joined #dataverse
14:53		djbrooke joined #dataverse
15:04		djbrooke joined #dataverse
15:13		axfelix joined #dataverse
15:15	pdurbin	Haridass: following error? Did you paste an error?
15:18	Haridass	the error is
15:19	Haridass	Nope, I haven't been able to connect to the local instance of postgresql as admin user
15:19	Haridass	If you can provide me your email, I can sen you the screen shot
15:19	pdurbin	You can email it to supportdataverse.org
15:19	Haridass	ok
15:20	Haridass	By the way, can I know your name
15:20	Haridass	or shall I address it as Support Team
15:20	pdurbin	Philip Durbin but most people call me Phil. http://www.iq.harvard.edu/people/philip-durbin
15:21	Haridass	ok Phil
15:21	pdurbin	Haridass: you're installing Dataverse on a server, right? You aren't setting up a dev environment, are you?
15:32	Haridass	Yes you are correct
15:32	Haridass	As a first step I want to install Dataverse and use it as a end user.
15:33	Haridass	We are planning to build a team to set the dev environment for customization work in the future.
15:33	pdurbin	Make sense. If you want, you could try `vagrant up` from your Ubuntu box.
15:35	pdurbin	Haridass: great that you're building a team of people do development on Dataverse!
15:37	Haridass	is Vagranr Up something like a VM virtual box manager
15:37	pdurbin	Haridass: I see your screenshots in https://help.hmdc.harvard.edu/Ticket/Display.html?id=237133 . For now, since you're just playing around, you might want to set Postgres to "trust", if that makes sense.
15:37	axfelix	@Hardiass: yup, it's exactly that :)
15:37	Haridass	"trust" done already
15:38	Haridass	still same error
15:38	pdurbin	To use Vagrant, you install Vagrant and VirtualBox and then run `vagrant up` to spin up a CentOS VM in VirtualBox running Dataverse.
15:39		djbrooke joined #dataverse
15:39	pdurbin	Haridass: the error is telling you that postgres is not running.
15:39	Haridass	the link https://help.hmdc.harvard.edu/Ticket/Display.html?id=237133 is asking for login
15:39	Haridass	do I need to register for help.hmdc.harvard.edu
15:40	Haridass	"trust" done already, still same error
15:41	Haridass	Postgres is running and I am able run SQL statements in psql
15:41	pdurbin	huh
15:41	pdurbin	I wonder why the installer thinks postgres isn't running.
15:41	Haridass	there we 2 screenshots in my email
15:42	Haridass	the first one is where I have to specify certain values for dataverse installation
15:42	Haridass	are those values correct
15:43	Haridass	there are 2 screenshots in my email
15:44	Haridass	please verify the values are correct in my first screenshot
15:44	Haridass	postgres is having only one user 'postgres' with password 'excel'
15:45	Haridass	but the dataverse installer is asking for Postgres ADMIN Password as well as Postgres User Password
15:47	Haridass	Also
15:48	Haridass	my PC is a windows machine with Ubuntu in VM
15:48	Haridass	Ubuntu on virtual disk
15:48	Haridass	in this setup can I use vagrant up
15:49		djbrooke joined #dataverse
15:50	pdurbin	Oh, no. You would run `vagrant up` from Windows. You'd install Vagrant and VirtualBox on your Windows box.
15:56	Haridass	I will get Vargrant Up set up tommorow and will let you know the progress then.
15:56	Haridass	in the meantine will this chat history be maintained
15:57	pdurbin	yep
15:57	pdurbin	I back it up. MySQL dump
15:57	Haridass	do I need logins for https://help.hmdc.harvard.edu/Ticket/Display.html?id=237133
15:58	pameyer	pdurbin: mind if I move some of your stubs around in the dcm repo?
16:02	pdurbin	pameyer: please feel free!
16:02	pdurbin	BBQ time. Back in a while.
16:02	pameyer	enjoy :)
16:03		djbrooke joined #dataverse
16:26		majesticc joined #dataverse
17:01		djbrooke joined #dataverse
17:29		djbrooke joined #dataverse
17:40		djbrooke joined #dataverse
17:45	pdurbin	so much BBQ. and ice cream
17:48	pdurbin	Haridass: I just wrote back explaining how to log into the ticketing system.
17:58		djbrooke joined #dataverse
18:11	pdurbin	pameyer: I'm working on the actionlogrecord stuff. You're actively checking this table from time to time? There's no API or GUI to do so.
18:14		djbrooke joined #dataverse
18:18		djbrooke joined #dataverse
18:28		donsizemore joined #dataverse
18:29	pameyer	pdurbin: what's the context for the actionlogrecord question?
18:31	pdurbin	pameyer: on Thursday I added a todo to RequestRsyncScriptCommand saying "make sure the error is logged to the actionlogrecord". That's still a requirement, right?
18:32	pameyer	the error being logged somewhere is - but is actionlogrecord where operational errors for the admins would usually go?
18:34	pdurbin	Errors can be logged there. I'm not sure how often anyone looks at it, since (again) there's no GUI or API to look at the actionlogrecord table.
18:34	pdurbin	pameyer: we log lots of errors to server.log. You're familiar with server.log, right?
18:35	pameyer	vaguely
18:35	pdurbin	It's the main Glassfish log.
18:36	pameyer	is there an expected place for admins to look for information about things going wrong?
18:36	pameyer	it's sounding like that might not be actionlogrecord
18:37	pdurbin	I'd say server.log is the main place to look for errors. But maybe that's just me.
18:39	pameyer	think it's worth asking the ops/curation people on your end where they usually look?
18:39	pdurbin	let's ask donsizemore since he runs a production Dataverse installation
18:40	pdurbin	pameyer: curation people here don't have access to the database (actionlogrecord, etc.) and don't use APIs.
18:43	pdurbin	bricas: do you have opinions on the "actionlogrecord" table?
18:50	pdurbin	pameyer: I also have a todo that says "notify user on any failure" and I might need a little more detail on that.
18:50	donsizemore	@pdurbin i generally look in server.log or syslog - aside from turning on debugging as needed
18:50	pameyer	donsizemore: thanks
18:51	pdurbin	donsizemore: ok, and debugging often involves cranking up the verbosity in server.log
18:51	pameyer	pdurbin: that'll be failure for post-upload validation
18:51	donsizemore	@pdurbin in my one year of experience, Dataverse pretty much spits out the most helpful bits to the screen while Glassfish buries it in a straw pile of Java exceptions
18:51	pameyer	aka - dcm needs enough info to send a message to the user (and curators) that there was a problem
18:52	pdurbin	donsizemore: I guess that's good. We try to show useful errors in the UI. :)
18:54	pdurbin	pameyer: so the DCM will send a message to Dataverse about if the upload succeeded or failed. A JSON message, I assume. Have you defined what the JSON will look like yet?
18:54	pameyer	only to the level of "something that dataverse is happy getting messages in"
18:55	pdurbin	Dataverse is happy with JSON.
18:56	pameyer	is there an API for "tell user $x somthing"?
18:56	pameyer	I didn't find one in a quick look at the docs - but that problem could be on my end
18:57		djbrooke joined #dataverse
19:02	pdurbin	pameyer: it sounds like you want an API endpoing called "notifications" or something. There is no such thing.
19:02	pdurbin	endpoint*
19:03	pameyer	is that because there's another way of doing it?
19:03	pameyer	or is this something to put on the list of requirements?
19:05	pdurbin	pameyer: it looks like we have "listen for DCM messages" already at https://docs.google.com/spreadsheets/d/1gZy4w21xnJsKRvOtarl_GUb_aeS53Nk7kJ04NYfi10E/edit?usp=sharing
19:06	pameyer	yup
19:07	pdurbin	pameyer: that reminds me. I have a script for you to try to run. A project management script. :)
19:08	pameyer	I can probably make it generate interesting errors ;)
19:08	pameyer	where does it need to run?
19:08	pdurbin	pameyer: please try running this: https://gist.github.com/pdurbin/5a98932c94055674278ebf47d241782e
19:10	pameyer	doesn't look like a rootkit - hang on a few
19:10	pdurbin	heh
19:11	pameyer	looks to me like it worked - want me to send you the output?
19:12	pdurbin	that's ok
19:13	pdurbin	pameyer: but maybe some day we can use a similar script to export our task list into a document
19:14		majestic joined #dataverse
19:15	pameyer	cool
19:15	pdurbin	that one was getting complicated enough that a script became helpful
19:18	pameyer	might be helpful other places as well
19:18	pameyer	and I learned that google will give you csv
19:19	pdurbin	:)
20:03	bjonnh	pdurbin: http://guides.dataverse.org/en/latest/installation/prep.html#hardware-requirements
20:03	bjonnh	I found that to give me an idea of what the harvard instance is
20:04	bjonnh	is there a way to see a stats page for the harvard instance, to know how much they store right now?
20:05	pdurbin	bjonnh: no, we don't have a stats page but if you email supportdataverse.org someone will probably tell you disk usage or whatever. I think it's 6 TB or so.
20:06	pdurbin	and I actually have to run, sorry
20:06	bjonnh	ohh that's all?
20:06		pdurbin left #dataverse
20:06	bjonnh	see you :)
20:07	pameyer	bjonnh - how big were you hoping for?
20:07	bjonnh	I have two projects in mind
20:08	bjonnh	one university wide, one NIH national-center wide
20:08	bjonnh	but I'm sure we're not going to reach 6TB before a couple of years
20:09	bjonnh	because our field is lagging way behind others in term of datasharing
20:10	pameyer	mind if I ask which field?
20:34	bjonnh	pharmacognosy
20:34	bjonnh	pharmacy/natural products
20:47		metamattj joined #dataverse
20:53		djbrooke joined #dataverse
20:54		djbrooke_ joined #dataverse
20:57		djbrooke joined #dataverse
20:58		djbrooke_ joined #dataverse
21:30	pameyer	bjonnh: cool - not a field I've got enough overlap with to give you informed guesses as to data volume though
21:31	pameyer	but you're probably in a better spot to do that than I am anyhow
21:36		djbrooke joined #dataverse
21:38	bjonnh	well most publication/grant related datasets are not that huge
21:39	bjonnh	except that we have cell-imaging and huge metabolomic mass spectrometry data files that are coming with their tens to hundreds of GB
21:39	bjonnh	but these are marginal for now
21:41	pameyer	that gets into the range where http uploads might cause problems
21:42	pameyer	part of the reason we're working on non-http uploads

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.