IQSS logo

IRC log for #dataverse, 2018-03-05

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

All times shown according to UTC.

Time S Nick Message
00:17 danmcp joined #dataverse
01:23 danmcp joined #dataverse
03:23 tyrel joined #dataverse
03:36 tyrel joined #dataverse
04:22 tyrel joined #dataverse
04:51 tyrel joined #dataverse
05:53 tyrel joined #dataverse
08:06 jri joined #dataverse
12:06 pdurbin joined #dataverse
12:37 jri joined #dataverse
12:45 pdurbin tyrel: awesome that you got `vagrant up` working.
12:48 jri_ joined #dataverse
13:03 danmcp joined #dataverse
13:08 jri joined #dataverse
13:10 danmcp joined #dataverse
14:01 danmcp joined #dataverse
15:05 pameyer joined #dataverse
15:07 pameyer pdurbin: did you see http://irclog.iq.harvard.edu/dataverse/2018-03-02#i_63996 ?
15:13 pdurbin_m joined #dataverse
15:14 pdurbin_m pameyer: yes! I tried to give you a shout out at http://irclog.iq.harvard.edu/dataverse/2018-03-03#i_64098
15:15 pdurbin_m I need to get in touch with aculich, obviously.
15:18 pameyer it sounds like he'll be back online (probably with students) earlier Friday - but it might be good to connect before then (if possible) for some more precise scheduling
15:18 pameyer glad you saw it!
15:20 pdurbin_m Ah, "with the students there is a time next friday earlier in the day." I glossed over that. Thanks.
16:30 danmcp joined #dataverse
16:38 jri joined #dataverse
16:40 jri joined #dataverse
17:36 pdurbin pameyer: do you have any thoughts on https://github.com/IQSS/dataverse/issues/4481 ? "As a sysadmin, I'd like to know how much usage (hits, etc.) my Dataverse installation is getting". Maybe this is a question for andrewSC bricas and knikolla as well.
17:37 pameyer pdurbin: the awstats one?
17:38 pdurbin Yeah, as a starting point I documented `yum install awstats` etc at https://github.com/IQSS/dataverse/commit/a56a3ef and I stood up the awstats web interface at https://dev1.dataverse.org/awstats/awstats.pl?config=localhost.localdomain (blocked by IP but anyone here can PM me if they want to see it).
17:40 pameyer I've seen the awstats interface before - anything non-standard that would be worth unblocking an IP?
17:42 pdurbin Probably not. It's awstats 7.0 from EPEL. Very standard.
17:43 pameyer https://goaccess.io/ might be worth a look - but it seems essential the same concept as awstats
17:44 pameyer I think awstats would be a useful tool, but probably not top of my list
17:48 pdurbin awstats is just something we happened to use two jobs ago so I'm vaguely familiar with it. GoAccess doesn't seem to support Glassfish out of the box according to https://goaccess.io/man#custom-log but that's not especially surprising.
17:48 pdurbin "Predefined options include, Common Log Format (CLF), Combined Log Format (XLF/ELF), including virtual host, W3C format (IIS) and Amazon CloudFront (Download Distribution)."
17:48 pameyer ah - I'd thought you were going with apache and X-Forwarded-For
17:48 pameyer missed the bit about glassfish logs
17:50 pdurbin Right now I'm using awstats to process Apache's /var/log/httpd/ssl_access_log (and rotated logs) as-is. I haven't re-configured Apache to show the X-Forwarded-For header in the logs.
17:50 pameyer google analytics / piwik might also be useful; but complementary to log analysis.  Can't get page load times from server side log analysis; but ga/piwik would probably be uninformative for scripts/apis/my normal browsing profile
17:52 pdurbin Right, Google Analytics and Piwiki rely on cookies and I think I'll walk down the hall and see if I can get access to our GA data but I'm thinking that a server-side log analytics solution would probably give a more comprehensive picture of what's going on because it wouldn't rely on cookies.
17:54 pameyer cookies / javascript / hidden "pixel" images / other annoyances - browsers with cookies disabled would still show up
17:54 pameyer but if I'm remembering correctly, dataverse with cookies disabled is already unhappy
17:55 pameyer …. that translates in to me agreeing with you about log analysis here
17:55 pdurbin Yeah, the jsessionid goes in a cooke. What about the ELK stack? (Elasticsearch, Logstash, and Kibana.) Is it still popular? Is it worth learning and using? I see bencomp wrote a "grok pattern" for Glassfish server.log and access.log at https://gist.github.com/bencomp/9b99cee2720d6233c8a1
17:57 pameyer haven't gotten into ELK, so I don't have a useful opinion there
18:00 pdurbin Ok, no worries. If someone else here has an opinion on ELK, please let me know.
18:05 pdurbin pameyer: have you used GoAccess?
18:29 dimitri joined #dataverse
18:38 andrewSC pdurbin: for something fairly comprehensive i'd recommend datadoghq.com (if you're able to get funding/pay for it)
18:38 andrewSC I've used it personally at Adobe for a handful of projects and it was pretty nice
18:39 andrewSC where i work now we don't have it and we honestly haven't had any discussions around analytics yet
18:39 andrewSC we probably should though....
18:39 andrewSC I know i'm using GA to track a couple projects i maintain here but that's only for my own personal info
18:40 andrewSC I'm not entirely sure what the open source variant of datadog would be?
18:42 pdurbin I think people in my building are starting to use Datadog for monitoring. Different group than the one that hosts our production installation though. Thanks, andrewSC.
18:43 andrewSC mmmmmm nice
18:43 andrewSC and for sure :)
18:44 pdurbin This post is from November so I have the impression that analyzing logs is a relatively new feature of Datadog: https://www.datadoghq.com/blog/announcing-logs/
18:44 pameyer pdurbin: did a little investigation into goaccess, but it hasn't been high priority enough to get it (or awstats) converged
18:45 andrewSC you know what, i think we used something else for parsing logs
18:45 pdurbin pameyer: ok, thanks
18:45 andrewSC i can't remember the name though
18:45 andrewSC lemme ask a buddy of mine
18:45 pdurbin andrewSC: ok but no worries
18:46 pdurbin Whatever we come up with for making sense of access logs, I think it would be nice to write up in the Dataverse Admin Guide as a suggested approach.
18:47 pdurbin There are 32 production installations of Dataverse around the world and I wonder if anyone else already has a server-side solution in place. Something other than Google Analytics or Piwik, I mean.
18:47 andrewSC mmmmm
18:47 andrewSC pretty sure it was this https://www.sumologic.com
18:48 pdurbin andrewSC: I haven't heard of it but thanks!
18:48 andrewSC npnp
18:50 pameyer andrewSC: for some reason, that made me think of P =?= NP
18:50 andrewSC hahah
19:57 andrewSC do you guys use another service or tool to handle other types of submissions?
19:57 andrewSC i know that sounds overly vague lol
19:57 andrewSC basically i'm trying to figure out if there's something out there that's open source that is a submission management system?
19:57 andrewSC or if i need to build something in house
19:59 pdurbin What do you mean by submission?
20:04 andrewSC so for example outside of publications we also have a couple larger planning/process? sprints throughout the year around gathering research proposals for the coming year, people submitting research posters for the symposium we host every year, project leads submitting the final output of their project's etc
20:05 andrewSC some of these things end up in dataverse while others are more ephemeral? in nature
20:05 pdurbin People submitting talk ideas for the next Dataverse Community Meeting?
20:05 andrewSC hmmmmm that could be something similar
20:05 andrewSC yeah
20:06 andrewSC how do you guys handle that now? just a google form or is there some tooling or is it via email or?
20:07 pdurbin I want to say we used a Google Form but I can't remember or tell by looking at https://groups.google.com/d/msg/dataverse-community/S7lV0-QmgLY/Y6q8YtwTAwAJ
20:08 pdurbin Probably either Google Forms or Qualtrics. It sounds like you were looking for an open source solution, though.
20:08 pameyer andrewSC: it might be worth looking at conference management software
20:09 pameyer I don't have any recommendations, but it sounds like that might be a name for useful searching
20:10 pdurbin andrewSC: "an easy submission process" at https://osf.io/meetings/ ... "A free poster and presentation sharing service for academic meetings and conferences"
20:11 andrewSC nicee
20:11 andrewSC thanks for the suggestions :)
20:13 pdurbin sure
22:28 pameyer pdurbin: do you have strong opinions about APIs "returning what you sent" and binary data?
22:54 pdurbin pameyer: are you testing a Dataverse API?
22:54 pdurbin This might be a good question for #rest here on freenode.
22:55 pameyer every time I run integration tests I get a bell in my terminal because the thumbnail APIs return the thumbnail
22:56 pdurbin Oh. I wrote that API endpoint. I wanted to be able to test that the thumbnail was correct. :/
22:58 pdurbin In this case, the binary data is probably a PNG, which doesn't strike me as weird to return.
22:58 pameyer np
22:58 pameyer thanks
23:08 pdurbin sorry about the bells
23:08 pdurbin :)

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.