IQSS logo

IRC log for #dataverse, 2020-11-12

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

All times shown according to UTC.

Time S Nick Message
10:50 juancorr joined #dataverse
11:32 Ricarda joined #dataverse
11:33 Ricarda I have a non-technical question. I am describing Dataverse in a paper and would like to include a citation about the project itself and about the community. Are there recent publications you can recommend? And is there information on the number of members in the DataCite community?  Any help is appreciated :)
12:03 Virgile joined #dataverse
12:09 Virgile joined #dataverse
12:33 Ricarda joined #dataverse
12:43 Virgile joined #dataverse
13:03 Virgile joined #dataverse
13:13 Virgile joined #dataverse
14:31 donsizemore joined #dataverse
15:07 pameyer joined #dataverse
15:08 pameyer looks like I missed Ricarda , but there is a recommended publication for the Dataverse project.  don't recall offhand, but pretty sure it's listed at dataverse.org
15:15 pdurbin joined #dataverse
15:17 pdurbin poikilotherm: so what did you think of that meeting?
15:30 poikilotherm pameyer pdurbin maybe in the not so future Dataverse would be able to eat its own dogfood by depositing it to Harvards Dataverse as a scholarly output of research software engineering.
15:31 poikilotherm pdurbin: seems like a good kickoff to me :-)
15:32 pdurbin yeah, me too
15:32 poikilotherm I actually liked Slavas idea of creating a Github issue tracker for topics and meetings
15:33 pdurbin Oh, for each of the 3 subtopics?
15:33 poikilotherm Maybe a Github-linkable issue tracker would be easier to follow for results and output compared to a closed access google drive.
15:33 poikilotherm Input for Dataverse TV :-D
15:34 pdurbin Ah, good point.
15:34 pdurbin I was glad to see people pushing for recordings of the calls.
15:34 poikilotherm Will be on a workshop from Code Ocean in ~25 minutes
15:35 pameyer poikilotherm - you're thinking Dataverse as a software dataset in dataverse?
15:38 poikilotherm pameyer: aye
15:38 pdurbin poikilotherm: do you have time to talk more about the postgres driver pull request?
15:42 poikilotherm Depends. Workshop in 20 minutes. IIRC for 1.5h
15:42 poikilotherm So quickly now or after that
15:43 pdurbin Can I call you? Maybe on GDCC Slack?
15:43 poikilotherm Oh. Workshop is only 1h
15:44 poikilotherm Go go go
15:52 pameyer poikilotherm - that got me wondering about which dataset in dataverse has the most authors/contributors.  because dataverse itself seems to have quite a few
15:59 pdurbin probably something from physics
16:02 pameyer probably
17:08 poikilotherm CodeOcean is a nice product, as is Whole Tale, etc.
17:08 poikilotherm But still I couldn't find someone answering the hard question...
17:08 poikilotherm How do you actually provide a guarantee for reproducibility for 10 years +
17:08 poikilotherm When using a container, you might suffer from hardware arch changes, kernel ABI changes etc etc
17:09 poikilotherm The best answer I got to that so far was during the DCM 2020
17:09 pdurbin 10+ years is a long time. Does it help that some of these tools are open source so you can run them yourself?
17:10 poikilotherm That those tools are open source definitely helps.
17:10 poikilotherm Even more important is that the research software itself uses as much open source as possible
17:11 poikilotherm It really is a much harder task to let a propiertary from 10 years ago execute your code today than running it with open source software
17:12 poikilotherm I remember with fear all the VMs and old hardware machines we tried to keep up and running so we could use tools that old to operate a large scientific experiment
17:13 poikilotherm Just like with Dataverse: there is no glory or funding for maintenance in science
17:21 pdurbin Not even in other countries? :)
17:38 pameyer to guarantee something for 10 years, you'd really need funding for 10 years
17:39 pameyer open source tools help to a degree, but only a bit.  there's an increasing amount of open-source cuda code for example
17:40 pameyer part of why treating research reproducibility as "can you re-execute the code" seems to be to not be aiming at the right problem
17:41 donsizemore just use perl =)
17:43 pameyer oddly enough, I had an issue with perl code breakling lately.  same system, same codebase, all of a sudden wouldn't work.  but it needed to talk to a server, and that server was using https, and decided to require use of newer ciphers than the old code could handle...
17:43 donsizemore !!!
17:43 pameyer ... but it's easier to break stuff than to keep it working ;)
17:51 pameyer I happy that linux is so far avoiding the code-signing/notarization/etc train.  wouldn't be too shocked if expiring certs caused issues for software in the future
17:59 donsizemore you missed the expired cert chain funsies in may?
18:00 pdurbin is written in Perl
18:02 pameyer I absolutely did not miss the expired cert chain "fun" in may ;) .  got to figure out the build process for a tool that I hadn't known embedded the expired cert on Sun when a user reported "hey, this is broken"
18:02 pameyer has been pretty solid
18:16 poikilotherm pdurbin did you see https://github.com/ossf/scorecard ?
18:18 pdurbin no but I'm nervous already
18:48 pameyer I'm not seeing anything about "this is why criteria $x is on the list"
20:10 FernandoSantana joined #dataverse
20:10 FernandoSantana I have a problem using the API
20:10 FernandoSantana { status:"error", message:"Endpoint available using API key only. Please contact the dataverse administrator"}
20:12 pdurbin FernandoSantana: hi, let me look for that error quick.
20:12 pameyer FernandoSantana - are you setting an API key in your request headers?
20:13 FernandoSantana moment please
20:14 FernandoSantana yes.
20:14 pdurbin FernandoSantana: the error is leading me to this... you probably have :BlockedApiKey set to something. Please see https://guides.dataverse.org/en/5.1.1/installation/config.html#blockedapikey
20:14 FernandoSantana I want to remove
20:14 FernandoSantana version 4.11 on Dataverse.
20:15 pdurbin Can you see the key if you do curl http://localhost:8080/api/admin/settings/:BlockedApiKey ?
20:15 FernandoSantana { status:"error", message:"Endpoint available using API key only. Please contact the dataverse administrator"}
20:16 FernandoSantana i have the key
20:17 FernandoSantana Use key with the command CURL, message below
20:17 FernandoSantana { status:"error", message:"Endpoint available using API key only. ..
20:19 pdurbin If you want to remove the key you should probably change :BlockedApiPolicy from unblock-key to localhost-only. See https://guides.dataverse.org/en/5.1.1/installation/config.html#blockedapipolicy
20:20 pdurbin Most installations run with "localhost-only" I believe. It's the default.
20:20 FernandoSantana Ok.
20:20 FernandoSantana moment please.
20:23 FernandoSantana error :/
20:23 FernandoSantana { status:"error", message:"Endpoint available using API key only. Please contact the dataverse administrator"}
20:24 FernandoSantana use localhost only, but the message error above
20:25 FernandoSantana [root@vps11730 ~]# curl http://localhost:8080/api/admin/settings | jq .   % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current                                  Dload  Upload   Total   Spent    Left  Speed 100   111    0   111    0     0   4545      0 --:--:-- --:--:-- --:--:--  5045 parse error: Invalid numeric literal at line 1, column 9 [root@vps11730 ~]#
20:26 pameyer this seems like it might call for psql
20:26 pameyer but I defer to pdurbin's API knowledge
20:27 pdurbin What if you just do `curl http://localhost:8080/api/admin/settings` and remove the `| jq`?
20:28 FernandoSantana { status:"error", message:"Endpoint available using API key only. Please contact the dataverse administrator"}
20:28 pdurbin Oh, I know what that is... the error isn't valid JSON. "status" and "message" should be in quotes.
20:30 pdurbin What if you do this (using your key): curl http://localhost:8080/api/admin/settings/:BlockedApiKey?unblock-key=YOUR_KEY_HERE
20:30 FernandoSantana even using the quotation marks there is an error.
20:30 FernandoSantana I will do what you mentioned above
20:31 FernandoSantana {"status":"OK","data":{"message":"******"}}[root@vps11730 ~]#
20:31 FernandoSantana used curl http://localhost:8080/api/admin/settings/:BlockedApiKey?unblock-key=*****
20:32 dataverse-user joined #dataverse
20:32 pdurbin Ok, so, to use the API right, you have to add that unblock-key=***. So we should be able to add that to change the policy.
20:33 pdurbin Like this: curl -X PUT -d localhost-only http://localhost:8080/api/admin/settings/:BlockedApiPolicy?unblock-key=YOUR_KEY_HERE
20:33 FernandoSantana how to remove now not to use?
20:37 pdurbin Did that work?
20:38 FernandoSantana yes
20:39 FernandoSantana however any command with curl it presents the error message of the api
20:42 pdurbin Hmm. Maybe pameyer is right. Maybe it's time to switch to psql.
20:44 pdurbin Once you're connected to your database, you could try this: SELECT * FROM setting;
20:45 FernandoSantana ok
20:46 FernandoSantana I will do this
20:49 FernandoSantana dvndb=# SELECT * FROM setting;                  name                  |                                  content ---------------------------------------​+--------------------------------------​-------------------------------------  :AllowSignUp                          | yes  :SignUpUrl                            | /dataverseuser.xhtml?editMode=CREATE  :BlockedApiEndpoints                  | admin,test  :SystemEmail
20:49 FernandoSantana ..
20:49 FernandoSantana :AllowSignUp                          | yes
20:49 FernandoSantana :SignUpUrl                            | /dataverseuser.xhtml?editMode=CREATE
20:49 FernandoSantana :BlockedApiEndpoints                  | admin,test
20:50 FernandoSantana :SystemEmail                          | curadoriadata@gmail.com
20:50 FernandoSantana :Authority                            | 20.500.12196
20:50 FernandoSantana :IdentifierGenerationStyle            | sequentialNumber
20:50 FernandoSantana :DoiProvider                          | FAKE
20:50 FernandoSantana :ApiTermsOfUse                        | http://best-practices.dataverse.org/harvard-policies/harvard-api-tou.html
20:50 FernandoSantana :ExcludeEmailFromExport               | true
20:50 FernandoSantana :ApplicationTermsOfUse                |
20:50 FernandoSantana :TabularIngestSizeLimit               | 2000000000
20:50 FernandoSantana :TwoRavensTabularView                 | true
20:50 FernandoSantana :DatasetPublishPopupCustomText        | Arquivo criado com sucesso no Repositório CIDACS
20:50 FernandoSantana :PVMinLength                          | 6
20:51 FernandoSantana :PVMaxLength                          | 0
20:51 FernandoSantana :PVNumberOfConsecutiveDigitsAllowed   | 4
20:51 FernandoSantana :Protocol                             | hdl
20:51 FernandoSantana :FilePIDsEnabled                      | true
20:51 FernandoSantana :NavbarAboutUrl                       | Dataverse
20:51 FernandoSantana :Shoulder                             | CIDACS/
20:51 FernandoSantana :IndependentHandleService             | true
20:51 FernandoSantana :LoginSessionTimeout                  | 240  :ApplicationPrivacyPolicyUrl          | htt
20:51 FernandoSantana :ApplicationPrivacyPolicyUrl          | https://dataverse.org/best-practices/harvard-dataverse-privacy-policy
20:51 FernandoSantana :MaxFileUploadSizeInBytes             | 2147483648
20:51 FernandoSantana :ZipDownloadLimit                     | 20000000000
20:51 FernandoSantana :TabularIngestSizeLimit:Rdata         | 20000000000
20:52 FernandoSantana :SolrMaxFileSizeForFullTextIndexing   | 31457280000
20:52 FernandoSantana :SolrHostColonPort                    | dataverse.intracidacs.org:8983
20:52 FernandoSantana :RepositoryStorageAbstractionLayerUrl | https://dataverse.intracidacs.org
20:52 FernandoSantana :DownloadMethods                      | rsal/rsync
20:52 FernandoSantana :AllowApiTokenLookupViaApi            | true
20:52 FernandoSantana :ProvCollectionEnabled                | true
20:52 FernandoSantana :DataCaptureModuleUrl                 | http://localhost:5000
20:52 FernandoSantana :UploadMethods                        | dcm/rsync+ssh
20:52 FernandoSantana :BlockedApiPolicy                     | unblock-key
20:53 FernandoSantana ...
20:53 FernandoSantana ...
20:53 FernandoSantana these are the select select parameters
20:53 pdurbin That's the one. The last one. You should change :BlockedApiPolicy from unblock-key to localhost-only.
20:55 pameyer I tend to do db changes in transactions, because I know I make typo's
20:56 FernandoSantana how do i do it right in the bank?
20:56 FernandoSantana I don't know from the dataverse bank
20:56 pameyer so rather than "UPDATE setting SET content='localhost-only'" , I'd do "BEGIN;" before it
20:57 pameyer not sure what you mean by "bank" though
20:57 FernandoSantana database*
20:57 pameyer ah - got it
20:57 FernandoSantana sorry
20:58 pameyer using "BEGIN;" starts a transaction - it doesn't change the database until a following "COMMIT;" statement.  so if I make a mistake, I can abort it before breaking things
20:59 pameyer for connecting, I'd usually use something like "psql -h localhost -U postgres dvndb" - assuming that you're on the system where the database lives, and that the user and database names match
20:59 FernandoSantana moment please
21:00 pameyer also, if you're going to be doing stuff to the database, it's always a good idea to have backups
21:00 pdurbin But how do you know if you've made a mistake? You do SELECT after your UPDATE? I've never played with BEGIN and COMMIT.
21:00 FernandoSantana i have backup
21:01 pameyer sometimes by checking with SELECT after updates
21:02 pameyer sometimes just by not having errors in the statements I'm trying to execute
21:03 pdurbin gotcha
21:03 FernandoSantana ok.
21:34 pdurbin pameyer: looks like he's all set: https://github.com/IQSS/dataverse/issues/7404 ... the curl command seems to have worked.
21:35 dataverse-user joined #dataverse
21:50 pdurbin left #dataverse
22:20 pameyer pdurbin: great
22:50 nightowl313 joined #dataverse

| Channels | #dataverse index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

Connect via chat.dataverse.org to discuss Dataverse (dataverse.org, an open source web application for sharing, citing, analyzing, and preserving research data) with users and developers.