IQSS logo

IRC log for #dataverse, 2018-12-04

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

All times shown according to UTC.

Time S Nick Message
08:21 jri joined #dataverse
09:47 MrK joined #dataverse
10:34 poikilotherm joined #dataverse
11:49 pdurbin morning, all
11:49 poikilotherm Good morning :-)
11:50 pdurbin poikilotherm: I'm not using an IRC bouncer but I do run an IRC cliet (weechat) in a screen session in my home server.
11:50 poikilotherm :-)
11:51 poikilotherm I am heading for lunch...
11:51 poikilotherm Cu later
11:51 pdurbin bon appetit
11:53 pdurbin MrK: did you see the "This looks promising overall" comment at https://github.com/IQSS/dataverse/pull/5349#pullrequestreview-180848867 ?
11:57 MrK pdurbin: Ah yeah :P
11:59 pdurbin He's asking if we need to rename our old scripts, etc.
12:04 MrK Yeah I gotta answer, I'm just struggling with docker atm, but I think I'm gonna take a break.
12:04 pdurbin thanks!
12:04 pdurbin and sorry to hear about your docker struggles
12:06 MrK Yeah I hoped it would be easy but replacing just a war is not that simple, I was doing workarounds like they use solr schema/config which is in the folder instead of unpacking from war (??) and using language bundle the same way instead of unpacking from war (??). But even fixes and using my war for some reason I can't upload files there is just a single <div> missing.
12:31 poikilotherm Re :-)
12:33 poikilotherm MrK while reading your answer to landreev: wouldn't it be an option to create a custom MigrationResolver that adds the necessary bits based on the knowledge what the prior running Dataverse version has been?
12:33 poikilotherm That way you could reuse the SQLs made by landreev for updating
12:34 poikilotherm But let Flyway takeover instead of using a script to get to a certain point
12:35 MrK poikilotherm: Hmm you mean extracting the info about version and based on that execute the sql scripts? Yeah that's also an solution.
12:36 poikilotherm YEs
12:37 poikilotherm I dunno if this version is conserved somewhere except strings. In that case a upgrade script would be needed, but it is just about conserving the version number
12:37 poikilotherm Maybe pdurbin or pameyer know sth about that
12:45 MrK Ah right, recently the script was added so i would not need to execute sql scripts one by one.
12:52 pdurbin MrK: ok, so "baseline" is whenever you start with flyway. Thanks.
12:53 MrK pdurbin: Yeah if you dont have database schema(new project) you just don't use it.
12:53 MrK the baseline
12:53 pdurbin no baseline for a new project
12:53 pdurbin ok
12:54 pdurbin I don't know. It sounds like I should read some flyway docs. :)
12:55 pdurbin Huh. Flyway creates a table called "flyway_schema_history"? I'm looking at https://flywaydb.org/getstarted/how
12:55 MrK Yeah thats how it tracks which sql failed/ was executed before
12:55 pdurbin "Immediately afterwards Flyway will begin scanning the filesystem or the classpath of the application for migrations. They can be written in either Sql or Java."
12:56 pdurbin It sounds like we could make Flyway scan the file system for our old manual migration scripts.
12:57 MrK Yeah but it would not be compatible since if 2 people upgrade from different version the sql will fail since there will be duplicates.
12:58 MrK but poikilotherm suggested some good solution.
12:58 poikilotherm ;-)
12:59 poikilotherm Thx for the flowers
12:59 MrK we won't have to use flyway before all old sql scripts will be executed
12:59 MrK xD
13:00 MrK no problem
13:01 MrK ahh yeah we will have to add old sql scripts to war but that's not a problem.
13:01 poikilotherm Oh it would be really charming to use these SQL scripts from Flyway to support instant upgrading on deploying new version
13:01 pdurbin Let's see what Leonid and others say in the issue or pull request. We should keep the conversation there. I mean, we can discuss here too but please don't expect them to read the IRC log.
13:01 andrewSC joined #dataverse
13:02 poikilotherm MrK before I forget this: once we have Flyway in place, we should really try to get https://github.com/IQSS/dataverse/blob/develop/scripts/database/reference_data.sql replaced with a proper migration
13:02 pdurbin 4 hours until the community call: https://dataverse.org/community-calls
13:02 poikilotherm At least the indexing stuff
13:03 poikilotherm The inserts might be a good thing to do in bootstraping
13:03 poikilotherm (See my issue https://github.com/IQSS/dataverse/issues/5361)
13:03 MrK Yeah I see it that way, first check if the version is 4.9.5( with flyway support) if not use the script which will upgrade to 4.9.5 and then use flyway. That's during startup
13:04 poikilotherm Oh IIRC that script from landreev compiles a specific SQL script
13:05 poikilotherm Takes the changes between versions and mixes them together based on source version and target version
13:05 poikilotherm I dunno if we should execute this script from within the code...
13:06 poikilotherm IMHO it would be cleaner to use the SQL changes scripts and place those in migrations to run via custom resolver
13:19 pdurbin poikilotherm: oh, the answer is yes, we have a different call in method today: https://groups.google.com/d/msg/dataverse-community/BxKl7Qt914I/OneAOeYIBgAJ
13:20 poikilotherm I already saw that and tried it :-) Linux + Firefox = unsupported, also the docs state otherwise. Chrome + Linux seems to work ok
13:20 pdurbin phew
13:20 poikilotherm Thx for pointing out :-)
13:21 pdurbin poikilotherm: do you want part 2 of the feedback on your glassfish/paraya post?
13:21 poikilotherm Yeah sure :-) Go ahead
13:21 pdurbin Ok, for context, here was part 1: http://irclog.iq.harvard.edu/dataverse/2018-11-30#i_80663
13:22 pdurbin I guess we're on the "philosophy beyond container images" section.
13:23 pdurbin You talk about what the container images for Dataverse should be like.
13:23 pdurbin But we've never ever done this. Never made container images.
13:23 pdurbin I'm the only one on the team who has pushed images to Docker Hub.
13:23 pdurbin No one else even has access to push to "iqss" on Docker Hub.
13:24 pdurbin I gave access to some Red Hat intern and their mentor.
13:24 pdurbin They might still have access.
13:24 pdurbin All this is to say that anything under "iqss" on Docker Hub should be regarded as highly experimental.
13:24 pdurbin I don't know what I'm doing.
13:24 pdurbin And I'm the one who created the "iqss" org on Docker Hub.
13:25 pdurbin So I appreciate any tips you have. :)
13:26 pdurbin When you say "usable directly from upstream releases" I assume you mean that we should be pushing images to "latest" on Docker Hub. Dataverse releases track the "master" branch, so I guess at release time, when we merge "develop" into "master" and tag a release, we should be pushing an image to Docker Hub.
13:26 pdurbin I guess I want to back up.
13:26 pdurbin I don't think IQSS should necessarily have to do this.
13:27 pdurbin If poikilotherm or Slava or Craig or others want to do this under their own non-iqss namespace on Docker Hub, they should feel free.
13:27 pdurbin I say this because I can't imaging developers at IQSS doing this any time soon. Most of us don't even have Docker installed.
13:28 pdurbin But anyone, someone who wants all this Docker goodness will be pushing images to Docker Hub, I assume. They will be pushed at release time. They will be pushed to "latest". I assume.
13:29 pdurbin Next you say "stateless"... you've already discovered that our EJB timers are not stateless. The Red Hat interns worked on this a bit within OpenShift. Only having one master timer server when you start up multiple Glassfish.
13:31 pdurbin The other stateful thing is that when you upload a logo for a dataverse to a particular Glassfish server, the logo is saved only on the local file system of that Glassfish server. In production we point used to point to an NSF mount. Now on AWS, we have a script that rsyncs logos back and forth between the two Glassfish servers.
13:31 pdurbin Pretty ugly. I don't remember if there's an issue tracking this or not.
13:32 pdurbin With regard to secrets, the Red Hat interns implemented secrets in openshift.json in the code base. Done. There's an issue and pull request I'm sure I can dig up. Hopefully, we can use the same variables for "Docker Secrets", whatever that is.
13:32 pdurbin I don't know what "Kubernetes ConfigMaps and Secrets" is.
13:34 pdurbin I don't know how we'd achieve "rolling-update" ready in Dataverse. I would think we'd want this not just in Kubernetes but also in traditional installations. The Flyway stuff might help here. Updating the Solr schema is still a manual process. Solr does have an API for adding fields. I haven't used it.
13:35 pdurbin For "multinode cluster" please see comments above about the EJB timer and the dataverse logos being specific to a Glassfish server. There's still some work to do here. I updated http://guides.dataverse.org/en/4.9.4/installation/advanced.html#multiple-glassfish-servers but a release hasn't been cut yet.
13:36 pdurbin I am in support of configuration being easier.
13:36 pdurbin "The script is to a large degree a derivative of the old installer from DVN 3.x. It is written in Perl. If someone in the community is eager to rewrite it, perhaps in a different language, please get in touch. :)"
13:36 pdurbin http://guides.dataverse.org/en/4.9.4/installation/installation-main.html
13:37 pdurbin We know, we know that our Perl installer is not great. Please see the quote above. If someone can help with a better approach, better technology, whatever, we are open to it.
13:37 MrK joined #dataverse
13:38 pdurbin "container and classic support" is a must. We must continue to support classic installations. We don't want to scare people away with "kubernetes only" talk.
13:38 pdurbin I don't think any images I've pushed to Docker Hub have test data.
13:39 pdurbin I'm not sure where the "test data" comment comes from. docker-aio was explicitly developed for testing. It runs the API test suite with does leave cruft behind.
13:39 poikilotherm Actually I think now this was a mistake
13:39 pdurbin Ok, next up is the section on security. poikilotherm should I keep going or do you want to react to anything above?
13:39 poikilotherm Sure
13:40 poikilotherm err...
13:40 poikilotherm Sure = feedback from me
13:40 poikilotherm First, I don't think it might be necessary now that IQSS releases images
13:41 pdurbin Good, because IQSS won't any time soon. :)
13:41 poikilotherm There are options to use the simple maven call, once this is in place
13:42 poikilotherm And I think this _might_ be something that the Consortium can take care of
13:42 pdurbin yes, a great project for the consortium
13:43 poikilotherm I don't think it would be a good idea to point people to images from others like me, slava, ...
13:44 poikilotherm People around Docker often have little to no trust in user images
13:44 poikilotherm Because often it is unknown how well maintained those are
13:49 poikilotherm About being stateless
13:49 poikilotherm Good that you have this on your radar
13:50 poikilotherm The logo could be solved by storing it in S3...
13:51 pdurbin IQSS should help provide the raw materials necessary for people in the community and/or consortium to create, push, and maintain awesome Docker images. IQSS doesn't have the capacity to do it.
13:51 poikilotherm If someone uses a multinode installation, them needs a solution for this stuff anyway
13:51 poikilotherm Good to know about secrets support - will look into this later
13:52 pdurbin see "Separate secrets in openshift.json" at https://github.com/IQSS/dataverse/issues/4763
13:53 poikilotherm Solr is indeed a tricky one. But one thing after the other
13:54 poikilotherm About the installer: I think when some stuff is included in the codebase, this will result in a more lightweight version of it
13:55 poikilotherm To a large degree I see a benefit from those changes for both worlds - Docker and classic
13:55 poikilotherm And to repeat: I am absolutely with you on the "must" for Docker and "classic"
13:56 poikilotherm Altough it might sound sometimes like I have only Docker and Kubernetes in mind
13:57 poikilotherm Please rest assured that I am absolutely aware that my efforts will only have a chance to be merged, when this is respected
13:57 pdurbin When what is respected? Kubernetes?
13:58 poikilotherm No, the hybrid approach
13:58 poikilotherm Both needs to continue to work
13:59 pdurbin This is why I was suggesting that I could create a dataverse-kubernetes repo for you under IQSS on GitHub. You wouldn't be blocked. You could merge whatever you want. From my perspective, what you need are better raw materials.
13:59 poikilotherm But I need code changes.
14:00 poikilotherm And those have to be in small chunks
14:00 poikilotherm When I work on a fork, this cannot be fullfilled
14:00 pdurbin What we would merge on the main repo is a line or paragraph in the Installation Guide, saying "There's an experimental dataverse-kubernetes repo you can play with maintained by the community and/or consortium of here: <link>."
14:00 poikilotherm And I don't like fork
14:00 poikilotherm +s
14:01 pdurbin Sure, code changes are part of the raw materials you need. You need a better war file. You need a better or different installer.
14:01 poikilotherm Do you really think forking would be a good option? Any chances this might make its way back into mainline?
14:01 poikilotherm I could just live on my own fork in github and use my feature branch
14:02 poikilotherm Makes only little difference
14:02 pdurbin I'm not saying you should fork. I'm saying you should continue to make pull requests for code changes you need. For any raw materials you need.
14:02 poikilotherm I fear of the consequences
14:03 poikilotherm Sorry, I don't get it. Could you give an example how this repo should look like and what I am supposed to do in there and what not?
14:03 pdurbin I expect it to look a lot like https://github.com/IQSS/dataverse-docker
14:03 pdurbin I only recently created that one for Slava.
14:04 poikilotherm Right.
14:04 pdurbin Back in June, it looks like.
14:04 pdurbin I'd be happy to do the same for you.
14:04 pdurbin Especially since you have a different vision than Slava.
14:04 poikilotherm Sure. But what I need is going way deeper than what Slava did
14:05 pdurbin IQSS would be happy to bless both community-supported approaches. Pick your poison. :)
14:05 poikilotherm Slava wanted to get started quick and sticked to the surface. I am hacking inside, doing plumbing
14:05 pdurbin Yeah, that's my impression too.
14:06 poikilotherm And I don't see any option how to do that in a separated repo and not call it a fork?
14:06 pdurbin You seem more focused on actually running in production. I don't think anyone is in production with Slava's stuff yet. It's great stuff, but I think it's proof of concept, great for demos.
14:06 poikilotherm The layers on top are mainly additions to the pom.xml now.
14:06 poikilotherm Everything else happens _in_ the codebase
14:06 pdurbin You want the same thing as Red Hat, basically. But you want to run on Kubernetes instead of OpenShift.
14:07 poikilotherm Yeah, that's just a distro of Kubernetes
14:07 pdurbin yeah
14:07 pdurbin You *could* volunteer to help Red Hat get Dataverse running well on OpenShift.
14:08 pdurbin At one point they were even talking about having Dataverse in their catalog next to Jenkins when you click "Java". Just click a button and you have an installation.
14:09 poikilotherm Sorry, I still don't get it. What I need to realize my vision, and what might help people using OpenShift/Kubernetes, is mostly about hacking on the codebase. not on the tools to get it deployed.
14:09 poikilotherm I liked your analogy
14:10 poikilotherm Its "plumbing", not the loader that brings the plumbs to the construction site
14:10 poikilotherm err - language. plumbs = tubes
14:11 pdurbin Ok, and you should continue to make pull requests for what you need. Please.
14:11 poikilotherm Sure :-D
14:11 pdurbin We don't know what you need. The 35 installations in production are fine with what we are already providing.
14:11 pdurbin Except for the bugs, I guess. :)
14:12 pdurbin And they want features. They always want features. :)
14:12 poikilotherm Well, then just declare Docker / Kubernetes a feature ;-)
14:12 pdurbin If we created a dataverse-kubernetes mailing list, how many people would sign up?
14:13 poikilotherm 1
14:13 poikilotherm Me
14:13 pdurbin heh
14:13 pdurbin I'll sign up to and read whatever you write.
14:13 pdurbin too*
14:13 poikilotherm Looking at dataverse-i18n makes me fear that is no road leading somewhere...
14:14 pdurbin not sure what you mean
14:14 pdurbin we have a roadmap :)
14:14 poikilotherm https://groups.google.com/forum/#!forum/dataverse-internationalization-wg
14:15 poikilotherm Pretty dead
14:15 poikilotherm I don't think a dataverse-kubernetes list will have better chances
14:15 poikilotherm And actually this hole stuff is not only about Kubernetes
14:15 pdurbin Oh. True. Lots of pull requests though. Jayanthy is working incredibily hard on internationalization.
14:16 poikilotherm Kubernetes is just a tool
14:16 poikilotherm The truly good stuff is the underlying Docker
14:16 poikilotherm There is a good reason pameyer invented docker-aio
14:16 pdurbin That was Pete's reaction. "This post is about a lot more than Glassfish vs. Payara." I'm paraphrasing.
14:17 poikilotherm More or less, it is. This discussion is the base for a lot of stuff down the road
14:17 pdurbin yeah, I was stuck doing Vagrant only, he showed me the way with Docker
14:17 pdurbin Do you want part 3 feedback. "security needs" and the glassfish vs payara stuff?
14:18 poikilotherm "Give to me baby, aha aha...."
14:19 pdurbin Just because app servers haven't had a lot of CVEs lately doesn't mean much. We should always be vigilant. Who knows what will be discovered. This is true of any software, including Dataverse.
14:19 poikilotherm True
14:20 pdurbin I guess Java app servers don't fit into the normal "yum" or "apt" model that I'm so used to. I like it when my software is updateable via a package manager. I don't know if wildfly is in an rpm or not, coming from Red Hat. It would be nice.
14:21 pdurbin But anyway, yes, quarterly releases, I guess. Different than Solr, which releases as soon as they discover a security vulnerability. Solr is also written in Java, of course. I haven't heard of Solr RPMs either.
14:21 pdurbin I like the characterization as Eclipse Glassfish as the new kid on the block. I should have used that phrase during tech hours the other week.
14:22 poikilotherm Well... it is no "new kid on the block".
14:22 poikilotherm https://projects.eclipse.org/projects/ee4j.glassfish/reviews/5.1.0-release-review
14:22 pdurbin I didn't know Payara had a non profit org as well as a for profit org. Interesting.
14:22 poikilotherm This is first release from Eclipse. There are no API or implementation changes and has just new licensing and Maven coordinates.
14:22 poikilotherm This is more or less Glassfish 5.0 renamed.
14:22 poikilotherm Old wine in new pipes
14:23 pdurbin Yeah, I think it'll be mostly the same. What's new is the team behind it. Hopefully the code works the same.
14:23 pdurbin no regressions
14:23 pdurbin fingers crossed
14:23 pdurbin I hadn't heard of "variables or pwd aliases".
14:24 pdurbin But you say it's essential stuff so ok.
14:24 poikilotherm Do you want an showcase?
14:24 pdurbin I don't know the difference between "application scoped" and "domain scoped" resource configuration.
14:25 pdurbin I tried your small-container branch and Dataverse doesn't "just work" yet. I believe you'r still blocked in setup-all.sh territory. I'd be happy to try to help get you unblocked.
14:25 pdurbin I think that's it. I could go on about Glassfish but whatever.
14:26 poikilotherm I see. Maybe this is one of the reasons why nobody catched up...  ;-)
14:26 poikilotherm To much unknown territory
14:26 poikilotherm Sorry about that
14:26 poikilotherm +o
14:26 pdurbin nobody replied to you?
14:27 poikilotherm Only Gustavo in private email
14:28 poikilotherm Asked for a timeslot this week, no reaction on my offered slots
14:28 pdurbin Oh! Sounds like progress.
14:28 pdurbin And Pete gave you a little feedback here in IRC.
14:28 poikilotherm "Progress"
14:28 poikilotherm Yes
14:28 poikilotherm He seemed interested
14:28 poikilotherm IIRC "lots of good stuff in there"
14:28 pdurbin Well, Pete doesn't use Kubernetes and doesn't want to last I checked.
14:29 poikilotherm (paraphrazed)
14:29 poikilotherm But he wants Docker :-D
14:29 pdurbin Well, he wants his stuff not to break.
14:29 pdurbin His installation of Dataverse is different than the others. The only one with rsync.
14:29 poikilotherm I can't emphasize it enough: Kubernetes is just something on top of Docker. My work is all about Docker first, the Kubernetes part is easy later on
14:30 poikilotherm Obviously
14:30 pdurbin Dev for https://github.com/IQSS/dataverse/pull/5359 is happening within three Docker images. Dataverse, DCM, and a client.
14:30 poikilotherm IIRC he was with me on the integration testing part and docker-aio not so well suited for this, as it would make it even more bloated
14:31 pdurbin DCM is Data Capture Module.
14:31 poikilotherm :-)
14:31 pdurbin The three of us are highly interested in running integration tests *before* pull requests are merged. If you can help with this using Docker or anything, please let us know.
14:32 poikilotherm Yes I can
14:32 poikilotherm (no, that wasn't Obama speaking)
14:32 poikilotherm But it will need pretty Docker stuff
14:32 pdurbin We have an AWS account we can use to spin up EC2 instances if necessary. I can een give you access. I already gave Don an account. We're using his dataverse-ansible code.
14:32 poikilotherm And tooling like arquillian
14:33 pdurbin For this EC2 stuff I'm talking about http://guides.dataverse.org/en/4.9.4/developers/deployment.html
14:33 poikilotherm Actually I don't want to be on your bill.... :-D All that would be necessary is sth. like Travis CI which is for free
14:33 poikilotherm Or Jenkins
14:33 poikilotherm Or whatever :-D
14:34 pdurbin The doc was delivered as part of this "spin up arbitrary branches: phase 1" issue: https://github.com/IQSS/dataverse/issues/4990
14:34 pdurbin That's fine. I'm just saying that I don't mind if you're on our bill if you're helping us with this stuff.
14:34 poikilotherm Ok, good to know :-)
14:35 poikilotherm I hope you are with me on the track of "use existing commonly used tools" instead of "create your own toolset"?
14:35 pdurbin It's barely used. The new script we wrote to spin up EC2 instances. Kind of sad.
14:35 pdurbin Oh, sure. I'm the one who added Travis and Coveralls to the Dataverse mix.
14:35 poikilotherm It's cool for trying out stuff I think
14:36 poikilotherm We sth. in mind about using it or a similar script for deployment of testing branches to Kubernetes for RAD approaches to UI/UX dev
14:36 pdurbin I'm the one who moved us from Sourceforge SVN to GitHub too. :)
14:37 poikilotherm (I may not use AWS here, conflicts with company policy)
14:37 poikilotherm :-)
14:37 poikilotherm Cool!
14:37 pdurbin A guy Raman moved us from Redmine to Github issues.
14:38 pdurbin But who would host Kubernetes for us... oh, you're saying the script could be adjusted to used Kubernetes on AWS instead of raw EC2 instances?
14:38 poikilotherm At least one could try that
14:38 pdurbin to use*
14:38 poikilotherm Haven't looked into it yet
14:38 poikilotherm But why not
14:38 pdurbin But you aren't allowed to try it on AWS? Company policy?
14:39 poikilotherm I cannot make FZJ pay the bill for an account of our own
14:39 pdurbin But I could add you to our account like how I added Don from UNC.
14:39 poikilotherm Maybe...
14:40 poikilotherm But let me do plumbing first :-D
14:40 MrK I just read the first lines but how poikilotherm would you vision the docker containers would be, since right now replacing war is a lot of a trouble for me there :D
14:40 pdurbin Then you could play Kubernetes in there.
14:40 poikilotherm MrK I am using the fabric8io docker maven plugin
14:40 pdurbin maybe MrK would be the second subscriber to the dataverse-kubernetes mailing list, if we make one :)
14:40 poikilotherm It supports a watch target for auto rebuilding
14:41 poikilotherm So this would be sufficient for dev purposes I think
14:41 poikilotherm For production new images would be needed to get pushed to a registry
14:41 poikilotherm Either into your custom registry, user account on Docker hub or somewhere else (currently using quay.io)
14:42 MrK right now we had a pipline on jenkins where we just cloned a branch created war and replaced it but even dockerEU is not that easy.
14:42 poikilotherm The production containers need to be replaced in a rolling-update fashin
14:42 poikilotherm Yeah...
14:42 poikilotherm I know...
14:42 MrK yeah thats not a problem you would have a pipeline clone-maven-push image- and docker up
14:43 poikilotherm for local dev that would shorten to clone+mvn
14:43 poikilotherm As it is right now :-D
14:43 MrK yep
14:43 poikilotherm That doesn't need to change alot anymore ;-)
14:43 poikilotherm The changes to the pom.xml are rather trivial
14:44 MrK poikilotherm: poke me if you would have a working version where you can just do it so simply, my co-programmer will also be happy :D
14:44 poikilotherm The pipeworks and plumbing inside the code base is the hard nut to crack
14:44 poikilotherm Sure :-D
14:44 poikilotherm You could watch issue #5292 ;-
14:44 poikilotherm ;-)
14:45 poikilotherm (and related)
14:45 MrK I also have a voice call with Slava on friday since I'm wondering how they do it
14:45 poikilotherm Most certainly they will do a rebuild, too
14:46 MrK the problem is that when i replace stuff some frontend stuff are missing like an icon on navbar or the upload function is completely missing and I don't even know why since the same war locally is working fine. but in docker, nope
14:48 poikilotherm IMHO dataverse-docker is pretty bloated and tries to depict the normal setup into containers. IMHO this makes things worse than better.
14:48 poikilotherm But this is just my opinion and if Slava is happy with their solution that is perfectly fine to me
14:48 poikilotherm -to +for
14:49 MrK Yeah I agree the project which in mind should've been easier and help is as much complicated as normal version
14:49 poikilotherm I am an optimist and perfectionista
14:49 poikilotherm Just like pdurbin...
14:50 poikilotherm So I try to improve and contribute to upstream.
14:51 poikilotherm Hey pdurbin: do you think it might be a good idea to add some labels to my issues?
14:51 poikilotherm Like "Community dev blocked" and maybe "Docker"?
14:52 poikilotherm Maybe I didn't a good job talking to the devs about what this is and how important it is
14:53 poikilotherm I really hate forking, but I would really appreciate some feedback from the core devs
15:08 pdurbin_m joined #dataverse
15:09 pdurbin_m poikilotherm: the easiest way to get attention is to make a pull request. We discuss issues in the code review column every morning at standup, in about an hour.
15:10 pdurbin_m I don't think more labels will help.
15:10 poikilotherm Ok
15:10 poikilotherm THe problem with that approach: implementing a change might be a reasonable amount of work
15:11 poikilotherm And not knowing if it is worth it or gets declined is... "suboptimal"
15:11 poikilotherm I would REALLY appreciate discussing things first before starting implementation wokr
15:13 pdurbin_m maybe just make a change to the dev guide then
15:14 pdurbin_m documentation driven development
15:14 pdurbin_m Does that make sense?
15:14 poikilotherm Well... I can try.
15:14 poikilotherm Altough I prefere TDD
15:14 poikilotherm But ok
15:14 pdurbin_m thanks
15:16 poikilotherm I will create a PR for the necessary changes on the EJB timers
15:16 poikilotherm This is rather triviail
15:16 poikilotherm And if that starts moving things...
15:17 MrK poikilotherm: I saw there is also a problem with some sequence sometimes it wont allow app to run.
15:34 pameyer joined #dataverse
15:47 poikilotherm pdurbin most certainly I will be late for the call
15:48 poikilotherm Arrival of government is delayed by traffic jams
15:49 poikilotherm Hear you guys later :-)
16:11 Paul66 joined #dataverse
16:12 Paul66 Hi, what is the largest file size allowed and is there a limit on total file space?
16:15 pameyer Paul66: for normal uploads, I think file size is configurable.  not sure about total space
16:15 Paul66 ok, thanks!
16:15 Paul66 left #dataverse
17:04 pdurbin There is no limit on total file space. Please see https://github.com/IQSS/dataverse/issues/4339
17:07 pdurbin The community call is starting. Trying gotomeeting this time: https://dataverse.org/community-calls
17:16 poiki-at-home joined #dataverse
17:17 poiki-at-home Hey guys, is the meeting still going?
17:19 poiki-at-home Just joined, listening. Need to check micro later when appropriate
18:06 craig-willis joined #dataverse
18:31 pdurbin craig-willis: thanks for calling in and participating!
18:39 craig-willis pdurbin: you've suggested for years (?) that I join and now I have multiple good reasons.  Very helpful to hear where y'all are headed.
18:40 pdurbin It all seems like madness when we start talking about file hierarchy. Everyone is surprised when they discover how Dataverse stores files.
18:40 jri joined #dataverse
18:40 pdurbin pameyer had to build a parallel way to store files where the filenames are not scrambled on disk.
18:41 pdurbin I was surprised as anyone when I first looked at my file system with Dataverse running.
18:41 pdurbin So it's a challenging situation.
18:42 pdurbin As I was saying we can probably at least start by bringing back the file hiearchy feature we had in DVN 3: Zip file in, expand and put the path for each file in the database, zip file out. But less buggy. It would be a start.
18:43 craig-willis Several systems I work with are similarr, I expect.  I don't think DataONE supports hierarchy, so we've been publishing our own map.
18:43 pdurbin interesting
18:43 pdurbin What was the original thinking? You'll never need to upload more than a handful of files?
18:44 pdurbin You'll only interact with the files via the GUI?
18:44 pdurbin Don't you want to save storage space when the same file is in multiple versions?
18:44 pdurbin I don't know.
18:44 pdurbin It's complicated. :)
18:46 craig-willis I don't know the original thinking either.
18:49 pdurbin The TRSA stuff might help. Jon said files in there will be in a hierarchy.
18:50 pdurbin pameyer's rsync stuff helps. It'll be interesting to see if other Dataverse installations adopt it.
18:57 bricas it's been a while since i've been here. oops.
18:58 bricas pdurbin: you asked me a question, oh, a month ago .... :)
18:58 pdurbin Did I? Don't worry about it. :)
18:58 bricas "how about you? Glassfish or Payara?"
18:59 bricas we use glassfish
18:59 pdurbin Yeah, I think everyone does. Glassfish is what's in our installation guides.
19:00 pdurbin bricas: but I'd be interested to hear what you think about https://groups.google.com/d/msg/dataverse-dev/L6rNXbbZG0o/UFVfI0_gAQAJ
19:05 bricas we also have the goal of moving our dataverse install to our k8s infrastructure
19:05 pdurbin interesting
19:06 pdurbin I keep thinking that very few people want this.
19:06 bricas making this as easy as possible would be great. we don't need anything super "high availability" since our demand is pretty low (at least right now)
19:07 pdurbin bricas: sure. Most installations run Dataverse all on one box, I believe. Can you please take a look at https://github.com/IQSS/dataverse/issues/5292 ? That's the "epic" Oliver opened and just talked about on the community call. He'd love feedback.
19:07 bricas using glassfish/payara has never really mattered to us since we're not java folks and just need it to work :)
19:08 pdurbin heh, exactly
19:08 bricas we're containerizing like crazy 'round here.
19:08 pdurbin interesting, I haven't seen it at Harvard yet
19:08 pdurbin but I'm in a different role now
19:08 pdurbin I'm one of those java folks now, I guess. No longer a cluster person.
19:09 pdurbin mostly the push seems to be to move to AWS
19:09 pdurbin which we did for Harvard Dataverse
19:10 pdurbin but we also have a nice new datacenter in western mass. Cheaper to run compute out there that AWS. I don't know. Again, I'm less involved in this stuff now.
19:13 poiki-mobile joined #dataverse
19:13 pameyer it's interesting that the various containerization bits expose different areas of tech debt than the recommended installation path
19:13 bricas we use a number of amazon services but haven't moved to their infrastrcuture
19:14 pdurbin plenty of tech debt
19:14 pdurbin bricas: you're running k8s on your own hardware?
19:15 bricas we have a decent setup here and so far we're happy to stick with it but there may come a day when that cost is too much and it's easier just to throw money at aws
19:15 bricas yep!
19:15 bricas containerzing definitely shapes your mindset for deployment
19:17 poiki-mobile Good evening boys n girls ;-)
19:17 poiki-mobile thx for a very pleasing community call. although it was a bit confusing hearing the bus stations... :-D
19:18 pameyer especially the bus station in a different city, with different announcements ;)
19:18 poiki-mobile Kiddo needs his fairy tale now, shortly AFK
19:19 pdurbin poiki-mobile: thanks for calling in. You have another customer: bricas :)
19:19 pdurbin maybe :)
19:21 pdurbin poiki-mobile: and I'm talking about your epic at https://javabot.evanchooly.com/logs/%23glassfish/2018-12-04 if you want to join in when you return. #glassfish on freenode
19:36 poiki-mobile2 joined #dataverse
19:38 poiki-mobile2 just joined #glassfish
19:39 craig-willis joined #dataverse
19:48 pdurbin heh, nice
19:51 craig-willis joined #dataverse
19:56 poikilotherm joined #dataverse
20:01 jri joined #dataverse
20:22 jri joined #dataverse
20:42 jri joined #dataverse
21:03 jri joined #dataverse
21:21 pdurbin poikilotherm: lots of chatter over there!
21:23 jri joined #dataverse
21:39 poikilotherm pdurbin: yeah. Bad?
21:44 jri joined #dataverse
22:03 poikilotherm pdurbin: any news from tech hours?
22:03 poikilotherm Did anyone bring a pile of stones to throw in my direction?
22:07 pdurbin_m joined #dataverse
22:07 pdurbin_m poikilotherm: good chatter
22:08 pdurbin_m We didn't talk about your stuff at tech hours. Mostly it was Make Data Count and security.
22:20 poikilotherm :-)
22:20 poikilotherm Sounds good
22:20 poikilotherm I really enjoyed the feedback from the community about Docker/k8s
22:20 pdurbin_m me too
22:21 poikilotherm Seems like this might strike a chord
22:25 jri joined #dataverse
22:26 poikilotherm Alright, I am off for the day... 23:26 over here. Bedding time
22:26 poikilotherm Read you tomorrow. And please pdurbin: - don't let me get between you and your wife at 6:30 am in the morning ;-) It's "just" Dataverse

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.