IQSS logo

IRC log for #dvn, 2013-08-27

We've moved! Please join #dataverse instead. The new logs are at http://irclog.iq.harvard.edu/dataverse/today

| Channels | #dvn index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

All times shown according to UTC.

Time S Nick Message
12:09 jwhitney joined #dvn
12:11 pdurbin jwhitney: good morning!
12:11 jwhitney pdurbin: hello!
12:11 pdurbin lots of stuff happening on this end! :)
12:12 pdurbin hopefully stuff you like... new features mostly... but a few changes as well
12:17 pdurbin jwhitney: when you get a chance, can you please scroll to the bottom of this and check out my comment about "lookup a single dataverse"? DVN SWORDv2 implementation design document: https://docs.google.com/document/d/1Sw8ZTjelFtWIi1etWqgQ0gPh6ltJuh8voX00WUi7Sng/edit?usp=sharing
12:22 jwhitney pdurbin: that's fine -- the plugin accommodates this quite happily
12:22 jwhitney pdurbin: a lookup has to occur after username / password entered, so just as easy to select from multiple possible dataverses
12:24 pdurbin jwhitney: great. I figured you weren't relying on a single SWORD collection being returned in the Service Document but I wanted to make sure
12:25 pdurbin jwhitney: I think the other changes I want to communicate are captured here: https://github.com/dvn/dvn-devguide-src/commits/master
12:26 pdurbin i.e. the commits behind the curl examples at http://devguide.thedata.org/features/api/data-deposit/v1
12:26 pdurbin jwhitney: so, a big one is this: "replace all files" no longer supported
12:27 pdurbin "replace all files" no longer supported · 1d5c36b · dvn/dvn-devguide-src - https://github.com/dvn/dvn-devguide-src/commit/1d5c36b
12:28 pdurbin we're concerned about inefficiency... we want people to delete individual files instead and add new files if they need to
12:30 jwhitney pdurbin: yes, ok, that works here.
12:33 pdurbin jwhitney: great. the "delete file" and "add files" curl commands are on that page
12:34 jwhitney pdurbin: I see, yes. So it's little different to post new a zip of new files & delete individual files
12:34 jwhitney pdurbin: rather than a full replace
12:34 pdurbin right, exactly
12:35 pdurbin jwhitney: if you read my note at https://github.com/IQSS/dvn/blob/aac0e952c47c6f1bd1379bdfc6b70d9e068f7796/src/DVN-web/src/edu/harvard/iq/dvn/api/datadeposit/MediaResourceManagerImpl.java#L119 you'll see my thoughts on this method
12:36 pdurbin in short, the SWORD spec says that perhaps some kind of versioning could be done in the case of a replace. but for now I just throw an error. maybe in the future we'll support this method with versioning but not in v1
12:39 pdurbin jwhitney: I just wanted to make sure you're aware of this change because "replace all files" used to be the main way to add files to a study
12:46 jwhitney pdurbin: that's great -- thanks for filling me in.  Only minor changes here --  different swordappv2 library call to post rather than put
12:47 pdurbin right, makes sense
12:49 pdurbin jwhitney: in the past we've talked about our files on the DVN side have metadata such as filename, category, and description. I don't have a way to set descriptions for files. Is this a problem for the OJS use case? See also this ticket about this: https://redmine.hmdc.harvard.edu/issues/3232
13:15 pdurbin jwhitney: does that makes sense? I'm trying to ask if it's ok if we don't populate "description" for each file on the DVN side
13:17 jwhitney pdurbin: for now, yes, I think so. OJS gives authors the option to add metadata to supplementary files (title, creators, keywords, etc.) that may differ from article-level metadata for these fields.
13:19 jwhitney pdurbin: so potentially, ojs is collecting more metadata than can currently be sent over to dataverse, unless some of the file-level metadata is propagated upward to fill in absent study-level fields
13:19 jwhitney pdurbin: although that's probably not a great idea
13:20 pdurbin jwhitney: right, OJS is collecting more metadata about files and it wouldn't all appear on the DVN side
13:21 jwhitney pdurbin: yep
13:21 pdurbin jwhitney: I'm looking at your screenshot at author_describe_datafile.png - Google Drive - https://docs.google.com/file/d/0B8Zfl4GMgyejMVV2VUV6QkptN3M/edit
13:22 pdurbin looks like for a file in OJS, you can have Title, Author(s), Keywords, Brief description, Category, and Date
13:22 jwhitney pdurbin: yes, this is based on the supplementary file upload
13:23 pdurbin jwhitney: to support all this, you and I would need to agree on some sort of manifest file, I guess... or some other way to store all this information within the zip file that is sent across via SWORD
13:24 jwhitney pdurbin: I'm wondering if it's better to capture file-level metadata that's only available OJS side, or use a simpler interface to only capture what Dataverse will currently store
13:25 pdurbin jwhitney: oh, are you saying you could expose only a few fields on the OJS side? Only the fields we can receive on the DVN side? (filename and category)
13:27 jwhitney pdurbin: that's what I'm wondering: if that approach would be too limiting for submitters
13:27 pdurbin it might feel limiting, yes
13:27 posixeleni joined #dvn
13:27 posixeleni hi pdurbin and jwhitney!
13:28 pdurbin but it would probably be frustrating for submitters if they filled in a bunch of fields that don't get propogated to the DVN side
13:28 jwhitney hello!
13:28 jwhitney pdurbin: agreed
13:28 pdurbin posixeleni: are you following this?
13:28 pdurbin posixeleni: and hello! :)
13:29 posixeleni i saw you chatting about the invidiviual file level metadata and just wanted to ask a quick question about how OJS would capture send over to us about the overall metadata for the Dataverse study
13:29 pdurbin posixeleni: oh, well, that's different... study-level metatadat
13:29 posixeleni we got that covered right?
13:30 pdurbin well, let's finish the file-level metadata (i.e. description) discussion
13:30 pdurbin for now anyway :)
13:30 posixeleni cool sorry to interrupt!
13:30 pdurbin I'm in favor of limiting the visible fields on the OJS side to what we can receive on the DVN side (filename and category)
13:31 pdurbin I realize this is limiting, but I'm more worried about the frustration submitters would feel when they realize the description, etc. doesn't get propogated to the DVN side
13:31 jwhitney I agree -- otherwise, I think it's misleading to collect metadata that doesn't get deposited
13:32 pdurbin yeah
13:32 jwhitney also, I type slowly.
13:32 jwhitney :)
13:32 pdurbin and on the DVN side, we don't even have some of those fields on the file level, such as Author
13:32 pdurbin :)
13:33 jwhitney yes, and I think it makes more sense to extend the plugin if / as DVN changes
13:33 pdurbin again, for description (which we do have at the file level), I opened a ticket to maybe support populating description for each file inside a zip file in the future (not v1): https://redmine.hmdc.harvard.edu/issues/3232
13:34 pdurbin jwhitney: right. this is v1. we want feedback from actual users of the plugin
13:35 jwhitney pdurbin: ok, that's great
13:35 pdurbin posixeleni: I think you can go ahead now :)
13:35 pdurbin jwhitney: thanks
13:35 pdurbin jwhitney: it's no fair, two against one ;)
13:36 pdurbin ruebot: you can be on jwhitney's side ;)
13:37 pdurbin jwhitney: the last thing I want to talk about is the FIXMEs at https://github.com/IQSS/dvn/blob/cdb2ceb61ec661929eabe84adadbda04ae047d14/tools/scripts/data-deposit-api/atom-entry-study.xml
13:38 posixeleni jwhitney: good morning!! just wanted to get an idea as to how OJS would capture/send to us the overall metadata for a Dataverse study which would normally include "Author", "Title" (of the article), Date, etc
13:38 pdurbin studyRelPublication.getIdType() vs. getIdNumber() vs. getUrl()
13:39 jwhitney posixeleni: hi there! going through your message from yesterday
13:41 pdurbin posixeleni: are you talking about the FIXMEs too? or something else?
13:46 posixeleni jwhitney: i think i needed to get my coffee BEFORE I typed out that message... :/
13:46 jwhitney posixeleni: lol.
13:48 posixeleni Phil and I are working on perfecting the atom entry file here that includes all the overall metadata about the data which gets deposited into a Dataverse Study https://github.com/IQSS/dvn/blob/develop/tools/scripts/data-deposit-api/atom-entry-study.xml
13:49 jwhitney posixeleni: ok, let's start w/ the article citation  -- I have a slight preference for using dcterms:isReferencedBy
13:49 posixeleni just wanted to make sure I understood where all this info would be captured from your side (which definitely refers to the email I had written to everyone yesterday)
13:50 posixeleni jwhitney: i can definitely agree with that... i'm guessing Alex will be cool with this as well?
13:51 jwhitney posixeleni: well, let's find out. Will ask him to catch up here.
13:55 posixeleni jwhitney: thanks!
13:58 pdurbin jwhitney: I'm so glad you and posixeleni have opinions about which dcterms fields to use... Dublin Core is very new to me :)
13:59 jwhitney pdurbin: well, that was a fast call, but mainly based on the thought that a data file may be referenced by multiple studies / articles.
14:00 jwhitney pdurbin: to  me, isPartOf suggests that it belongs solely to one study / article.
14:03 jwhitney but no strong feelings either way.
14:06 pdurbin jwhitney: ok. do you agree with the sentiment that we need 4 dcterms fields? 1. The related publication citation 2. ID type (i.e. DOI, arXiv) 3. ID number (i.e. DOI number) 4. ID URL (i.e. http://dx.doi.org/10.1093/rfs/hhm055 )
14:07 jwhitney pdurbin: yes, unless you want to generate #4 from 2 & 3
14:09 jwhitney pdurbin: although that could become unwieldy / limiting
14:10 pdurbin jwhitney: well, in practice I'm seeing many URLs in our database that are not on http://dx.doi.org ... stuff like http://www.mitpressjournals.org/doi/abs/10.1162/REST_a_00194 and http://onlinelibrary.wiley.com/doi/10.1111/j.1528-3585.2011.00436.x/abstract ... I don't know how I'd construct those URLs...
14:10 pdurbin the non-doi.org URLs I mean
14:10 jwhitney pdurbin: yes, I was thinking too narrowly
14:12 pdurbin jwhitney: related to this I had written this in an email: "I'm going to see if I can use a single dcterms field for DOI + number (separated by a colon) but separate fields was easier for me to get working quickly."
14:12 jwhitney pdurbin: so ok, 4 fields to get at the citation & permalink
14:12 pdurbin but now I'm not so sure about splitting on a colon, after talking to posixeleni this morning
14:13 pdurbin that is to say, I'm not sure I should hard code the split on a colon. i.e. DOI:0.1234/ABCD
14:14 pdurbin to separate the type (DOI vs. arXiv vs. ???) and the identifier (10.1234/ABCD)
14:14 pdurbin (sorry, I meant to put 10. in the i.e.)
14:16 pdurbin jwhitney: what do you think? I hope this makes sense. The thing I'm wondering is if we should try to use a single dcterms field to represent the type (agency) and the id... and assume they are always separated by a colon
14:18 jwhitney my preference is for separate fields, but not sure we can squeeze them out of dc
14:18 posixeleni pdurbin: my concern is that Dublin Core even with the extended DCMI terms is still too flat so it would be difficult to break the citation down into mutliple DC terms
14:18 posixeleni that would relate to each other
14:18 pdurbin yeah, I hear you. both of you
14:19 pdurbin well, maybe I'll work a bit on splitting on colon in the xslt stylesheet in case we need it
14:21 pdurbin speaking of flat... technically in DVN and DDI, you can have multiple of these related publications (relPubl). But I don't think we can support this in the atom-entry-study.xml file since dcterms is so flat
14:22 posixeleni we could have more than one citation like we have more than one author?
14:24 pdurbin posixeleni: you mean isPartOf? (which may become isReferencedBy if jwhitney gets her way.) The text of the OJS article citation? well, how do you later associate the type, id, and url if there is more than one citation?
14:25 pdurbin right now I have it coded such that the 4 fields all go into the same "related publication"
14:25 pdurbin jwhitney: have you seen how this looks on the DVN side?
14:26 jwhitney pdurbin: not since last week
14:26 posixeleni pdurbin: yeah i see what you mean... all this info would have to be smooshed into a isPartOf or isReferencedBy field
14:29 pdurbin jwhitney: here's how it looks. See "Publications" which has a Link? http://dvn-build.hmdc.harvard.edu/dvn/dv/pdurbin1/faces/study/StudyPage.xhtml?globalId=hdl:TEST/10013
14:30 pdurbin that's the end goal, basically
14:30 pdurbin populate those 4 fields
14:30 pdurbin especially the link, I'd say, since it's pretty fundamental to how we're trying to link data and articles: http://projects.iq.harvard.edu/files/styles/os_files_xlarge/public/ojs-dvn/files/ojs-dvn-integration.png
14:32 jwhitney pdurbin: yes, sorry -- familiar w/ that, thought you meant changes to the atom entry \
14:34 pdurbin jwhitney: well, the atom entry is what we'll use to populate those fields
14:34 pdurbin the 4 fields under "Publications" on the DVN side
14:37 pdurbin the 4 fields in the atom entry get added to those fields under "Publications" (relPubl) on the DVN side with this XML stylesheet transformation: https://github.com/IQSS/dvn/blob/87210844e1292b51e31ee4a66b563fd23a294964/working_directory/dcmi_terms2ddi.xsl#L128
14:38 pdurbin anyway, implemention details. but that's how we're using the atom entry
14:38 jwhitney pdurbin: right, ok
14:40 pdurbin obviously, we want to stop using dcterms:FIXME1 and the like :)
14:41 jwhitney posixeleni: my dc knowledge fairly basic -- can we use qualifiers to distinguish repeated elements?
14:45 posixeleni jwhitney: given the fields we want to qualify within isReferencedBy or isPartOf I dont see what we could use as qualifiers: http://dublincore.org/documents/usageguide/qualifiers.shtml
14:48 * posixeleni scratches head in metadata bewilderment
14:50 posixeleni jwhitney: let me know if you see anything!
14:51 jwhitney posixeleni: no, I'm out of my depth, here.
14:54 posixeleni we may very well have hit the granularity ceiling as far as DC is concerned
15:35 posixeleni jwhitney: OK so what if we put the whole journal article publication citation flattened into a single isReferencedBy field? Would the structure be consistent enough coming from OJS that Phil could parse it on our end into the separate fields for Dataverse?
15:35 posixeleni hope that made sense!
15:37 pdurbin bleh. would be no fun to split a single field into 4 in the xml stylesheet but I guess I can try
15:39 posixeleni For example I'm looking at a citation (APA) from a Co-Action journal in OJS: Kostoglou, V., & Siakas, E. (2012). Investigating higher education graduates’ entrepreneurship in Greece. Annals Of Innovation & Entrepreneurship, 3. doi:10.3402/aie.v3i0.17291
15:40 posixeleni Would be nice to have a hyperlink included for people to go from Dataverse back to the OJS article
15:42 posixeleni pdurbin: but perhaps we can compromise and have both custom-XML to allow for more granularity with the journal article publication citation, and a less granular DC option available
15:42 pdurbin posixeleni: I think a hyperlink is a must... as discussed earlier it would be hard (impossible?) to construct non-doi.org URLs
15:43 pdurbin posixeleni: by custom-XML you mean this, bascially (I think): "The client MAY add any other metadata formats or foreign markup to the atom:entry element" -- http://swordapp.github.io/SWORDv2-Profile/SWORDProfile.html
15:43 pdurbin but that's kinda a whole can of worms
15:44 pdurbin jwhitney: is it even easy for you to include foreign markup when using the PHP SWORD library?
15:45 pdurbin and what would the namespace be for the foreign markup? http://purl.org/net/dvn ? http://purl.org/net/ojs ? I don't think these exist...
15:45 pdurbin xmlns I mean
15:55 jwhitney pdurbin: I could include custom XML although I'd have to write a new packager to allow it
15:55 pdurbin jwhitney: hmm. ok. and Alex just replied via email
15:55 * pdurbin reads
15:55 jwhitney pdurbin: I'm sure there's a schema that exists already that would allow the citation to be included in parsable fields, without creating custom XML
15:56 pdurbin jwhitney: oh, cool!
15:57 posixeleni FYI Alex also just responded to my email but I'm gonna grab lunch
15:58 * pdurbin needs lunch
15:59 jwhitney * jwhitney has more packing / shipping to do but plans to be around for a couple more hours this afternoon
16:29 pdurbin so... I like Alex's idea a lot. gonna show posixeleni when she gets back
17:19 axfelix joined #dvn
17:27 pdurbin axfelix: welcome!
17:27 pdurbin posixeleni and I are trying out your idea (in code)
17:29 pdurbin axfelix: I like your focus on the URL... the link back to the OJS study... it's really the most important thing
17:30 pdurbin but if that's the only dcterms field available to me, I can only populate the "Link" field under "Publications" on the DVN side
17:31 pdurbin unless... I also store the link in the field for the citation for the Publication, like this: http://dvn-build.hmdc.harvard.edu/dvn/dv/pdurbin1/faces/study/StudyPage.xhtml?globalId=hdl:TEST/10013
17:32 pdurbin because if I don't populate that citation field, all you see is "Link" ... i.e. you would have to hover your mouse over "Link" to see the doi
17:57 axfelix so the issue is basically whether we expect users to need to see the DOI in non-URL form
18:00 pdurbin well...
18:01 pdurbin axfelix: what you should see is "Link" and above it a non-clickable URL
18:01 axfelix right
18:02 pdurbin that non-clickable URL is *actually* the citation field. I'm abusing it by shoving a URL in there... because I have to put *something* in there
18:02 axfelix ahhhh, okay, I follow now
18:02 pdurbin if you don't (in the DVN GUI), you get "Publication citation is required if other publication data is entered"
18:02 pdurbin ideally, we have a second dcterms field with the actual citation
18:03 axfelix but this is separate from the existing FIXMEs, which only handled URIs
18:05 pdurbin well, the FIXMEs were written with the idea of having 4 dcterms fields to populate the 4 fields on the DVN side
18:07 pdurbin axfelix: instead of a non-clickable URL I could put something like "[Imported via Data Deposit API]"
18:07 pdurbin but that's pretty ugly
18:08 pdurbin I'd rather have the actual citation of the journal article
18:09 axfelix are we actually out of dcterms fields on the DVN side at this point, such that we'd have to try to bring in another XSD from somewhere for the sake of the web form?
18:09 axfelix because if so we could /try/ resolving the citation string from the DOI the way I indicated in my email, but the crossref guys claim that isn't widely deployed enough
18:16 pdurbin axfelix: well, originally I was hoping we could have 4 dcterms fields. so maybe we are out? I'm not sure
18:19 pdurbin axfelix: I think it's a clever hack to be able to resolve a doi on the fly via a third party website, but I really like having all the info I need in the XML file. Then I run `xsltproc dcmi_terms2ddi.xsl atom-entry-study.xml` to transform all the Dublin Core to DDI. I don't have to reach out to the internet
18:20 axfelix yeah, you're probably right
18:20 axfelix I need to review which fields we have left and could reasonably use here
18:20 pdurbin awsome, that would be very helpful
18:20 axfelix should have something for you by EOD
18:21 pdurbin great
18:21 axfelix need to write a ddi-dc transform for an unrelated project in the meantime...
18:21 pdurbin :)
18:21 pdurbin so many transforms
18:21 axfelix it's a bit silly, but if nothing else it's a nice low-to-medium-complexity task to keep your brain humming on
18:22 pdurbin axfelix: so on a related note...
18:23 pdurbin the nice thing about using a single dcterms field is that I can (properly) allow multiple relPubl "related publications" in DDI and in DVN
18:25 pdurbin I'm in favor of using multiple dcterms fields (at minimum, one for the journal article citation and one for the URL) but it means that I can only create one related publication on the DDI/DVN side
18:25 pdurbin because dcterms is so flat
18:26 pdurbin it's like how I can specify multiple creators but I can't specify a URL for each creator. or a Twitter handle for each creator
18:30 axfelix so that would be a point in favour of us basically just expanding a given dcterms field however we want
18:30 axfelix which is certainly not uncommon, even if it's not technically great practice
18:31 pdurbin axfelix: neither you nor I are fans of spliting apart strings ;)
18:31 axfelix yeah, at that point, I think it's more realistic to chuck some MODS in there
18:32 pdurbin dunno what MODS is
18:32 axfelix which complicates slightly but we are probably the hundred-thousandth people to have had this conversation :)
18:32 axfelix it's effectively a "successor" to DC, LOC-backed, expressed in XML, superset of DC terms
18:32 pdurbin ah
18:32 axfelix DC is still a good lowest common denominator
18:33 pdurbin right. hence why it's in SWORD
18:34 pdurbin axfelix: if we use a single dcterms field, could it always have a URL in it? I definitely agree with you that the URL is the most important part
18:35 posixeleni hey folks in a meeting but just thought of this: http://wiki.dublincore.org/index.php/User_Guide#Dublin_Core_and_Linked_Data
18:35 pdurbin (not that I'm excited about trying to figure out how to extract a URL with an XML stylesheet)
18:37 pdurbin posixeleni: sorry, I'm not sure what you mean
18:39 posixeleni pdurbin: sorry just thought this part of the DC user guides related to our dilemma with linking to resources (data or journal article)
18:39 pdurbin oooooo
18:40 pdurbin posixeleni: you have given me an idea though!
18:40 pdurbin are we allowed to do something like this? <dcterms:isReferencedBy href="http://dx.doi.org/10.1038/dvn333">Peets, J., &amp; Stumptown, J. (2013). Roasting at Home. New England Journal of Coffee, 3(1), 22-34.</dcterms:isReferencedBy>
18:40 pdurbin i.e. stick an href in there?
18:41 posixeleni ooooh !
18:41 pdurbin because it totally works with xsltproc
18:41 posixeleni axfelix: how bout them apples?
18:41 posixeleni dont see why not?
18:41 axfelix huh, I've never actually seen that before, but I like it
18:41 pdurbin posixeleni: forget your meeting. come check this out :)
18:41 axfelix and it is fairly xsl-friendly
18:42 pdurbin axfelix: maybe we could stick the other fields in there too? type and number
18:42 posixeleni pdurbin: almost done w/ the meeting
18:42 axfelix yeah, go for it
18:43 axfelix I don't have a better idea and this is fairly easy to work with
18:43 posixeleni dc likes things flat
18:43 axfelix and I've seen worse violations of dc :)
18:43 pdurbin it's just crazy enough to work
18:47 posixeleni i like having the actual citation in there to provide context into what the URL/DOI is pointing to
18:47 pdurbin I changed href to holdingsURI and added the other two: <dcterms:isReferencedBy holdingsURI="http://dx.doi.org/10.1038/dvn333" agency="DOI" IDNo="10.1038/dvn333">Peets, J., &amp; Stumptown, J. (2013). Roasting at Home. New England Journal of Coffee, 3(1), 22-34.</dcterms:isReferencedBy>
18:47 pdurbin I guess it can be whatever we want
18:53 pdurbin axfelix: but will it be easy for Jen to add those attributes?
18:53 pdurbin man I hope so
18:53 axfelix that's why I wanted to see which fields we still had "remaining" to play with since I know OJS doesn't fully use DC as is
18:54 axfelix but regardless I think the href solution will be usable
18:54 pdurbin yeah. the attributes solution
18:56 pdurbin axfelix: so this is how it looks with all 4 fields populated on the DVN side: http://dvn-build.hmdc.harvard.edu/dvn/dv/pdurbin1/faces/study/StudyPage.xhtml?globalId=hdl:TEST/10014
18:57 pdurbin axfelix: as you can see, agency and IDNo are displayed together as DOI:10.1038/dvn333 (with "ID:" in front of it)
18:57 pdurbin but what looks nice is the full citation to the journal article
18:57 pdurbin this: http://projects.iq.harvard.edu/files/styles/os_files_xlarge/public/ojs-dvn/files/ojs-dvn-integration.png :)
18:58 axfelix A+++ great shipping
18:58 axfelix just need to make sure we can accommodate it on the OJS side
18:58 pdurbin hope so
19:09 jwhitney joined #dvn
19:28 jwhitney hi all, catching up w/ your earlier discussion.
19:29 pdurbin jwhitney: it was getting scary for a while... ;)
19:29 jwhitney very cool & easy to manage, I think, as long as everyone's cool with a bit of customization of the swordappv2 lib
19:29 pdurbin ah, I was afraid you might have to modify the php sword library
19:30 jwhitney Subclass existing or create new packager, so minor changes, but does up the code mgmt a bit
19:30 pdurbin jwhitney: if it helps, I've had to modify the java SWORD library: generalise getName/getFilename into getContentDispositionValue by pdurbin · Pull Request #3 · swordapp/JavaServer2.0 - https://github.com/swordapp/JavaServer2.0/pull/3 :)
19:32 pdurbin jwhitney: we should attempt to modify our python client ( https://github.com/dvn/swordpoc/tree/master/dvn_client ) and see if we have to make modifications to the sword python library
19:33 pdurbin we (royal we) have a pull request in for the python sword client library already as well: Bug Fixes and Updates for DVN Consumption by pjbull · Pull Request #7 · swordapp/python-client-sword2 - https://github.com/swordapp/python-client-sword2/pull/7
19:33 pdurbin (our python client doesn't run without those changes)
19:34 jwhitney pdurbin: reading through...
19:34 pdurbin jwhitney: so are you ok with having to modify the php library? sounds like kind of a pain
19:34 pdurbin I'm trying to decide what the "right" way to do this is...
19:35 pdurbin adding attributes to dcterms:isReferencedBy is working great on my side. and with my testing via curl
19:35 pdurbin but I'm open to other ideas
19:35 pdurbin I *did* go ahead and commit the change on the DVN side a few minutes ago: dcterms:isReferencedBy (and attributes) for relPubl · fe00364 · IQSS/dvn - https://github.com/IQSS/dvn/commit/fe00364
19:36 pdurbin so here's the latest and greatest atom entry: https://github.com/IQSS/dvn/blob/fe00364d7971763556615363576bdd2becd31cfd/tools/scripts/data-deposit-api/atom-entry-study.xml
19:36 jwhitney pdurbin: I think it's a great solution. An (admittedly speedy) review of the two-step packager leads me to think the changes would be limited to passing in an optional array of attributes to the two-step packager's addMetadata() method
19:36 pdurbin jwhitney: ok. well, please keep us posted. I hope it isn't too bad
19:37 pdurbin posixeleni and I have been talking about this all day. we're pretty excited about the attributes solution
19:37 jwhitney pdurbin: you should be!
19:37 pdurbin heh
19:37 pdurbin :)
19:38 pdurbin but it's not too late to do this a more "right" way if there is one
19:38 pdurbin though we do want to release soon... basically I want to be done done on Thursday
19:39 pdurbin jwhitney: so I should ask if there's anything else you need!
19:40 pdurbin axfelix: you too. anything you need?
19:40 jwhitney pdurbin: no, I'm pretty clear that I have what I need,
19:40 pdurbin great. but...
19:41 pdurbin what about the bibliographicCitation as a single string? do you want separate fields for some of that?
19:42 pdurbin since we've been talking all day about dublin core anyway ;)
19:50 jwhitney pdurbin: as long as the study global ID comes back in the atom entry, the plaintext citation is fine
19:53 pdurbin jwhitney: well, it doesn't appear alone, if that's what you mean. the global ID appears many times but always in a URL (handle URL, edit-media URL, statement URL)
19:53 jwhitney pdurbin: yes, of course. Sorry, long day.
19:54 jwhitney pdurbin: last conversation re: data citation in OJS was the plain text string was fine, and I can create / link a handle if wanted.
19:57 pdurbin jwhitney: well, posixeleni and I were talking about exposing the persistent URL for a study anyway
19:58 pdurbin posixeleni: i think we talked about maybe using dcterms:identifier for this? on the import side we use just a global id there, not the persistent URL...
20:01 pdurbin well, anyway, easy to add either a global id or a persisent url if someone tells me which dcterms field to use
20:01 jwhitney pdurbin: if a persistent URL is available, I'll definitely use it
20:03 pdurbin jwhitney: ok, I'm just not sure if I should use dcterms:identifier to expose it since we typically use that field to mean global id, not persistent URL
20:04 jwhitney pdurbin: ok.
20:05 pdurbin jwhitney: but I don't want to make you have to grovel around in the bibliographicCitation for the persistent URL if there's a good dcterms field I can use
20:05 * pdurbin looks at: SWORDv2 GET examples from dash.harvard.edu (DSpace) - Google Drive - https://docs.google.com/document/d/1vgLGhsuF-5W_KQ_h0tCEvIBP2ah0L6NRckk698ZeGCY/edit?pli=1
20:09 pdurbin hmm, depositReceipt.setSplashUri​(study.getPersistentURL()) might work... puts this in the entry: <link href='http://hdl.handle.net/hdl:TEST/12345' rel='alternate'/>
20:11 pdurbin deposit receipt... MAY contain an atom:link@rel="alternate" element with an href attribute which points to the splash page of the item on the server
20:12 pdurbin I don't know what a splash page is in this context, but I think this could work
20:13 pdurbin especially with a word like "alternate" ... the non-SWORD URL
20:17 pdurbin jwhitney: ok, added. the Xpath query you'd use is here: display persistent URL via SWORD as "alternate" · 9035b61 · IQSS/dvn - https://github.com/IQSS/dvn/commit/9035b61
20:18 pdurbin persistent_uri = XPath.first(entry,"//link[@rel='​alternate']").attributes["href"]
20:18 jwhitney pdurbin: excellent.
20:18 pdurbin jwhitney: should work for your needs. if not, please let me know!
20:20 jwhitney pdurbin: thank you. much more direct than grabbing / formatting handle refs
20:22 pdurbin jwhitney: no problem. just added it to http://devguide.thedata.org/features/api/data-deposit/v1 too
20:23 pdurbin oh, that reminds me, the last updated links at the bottom of the devguide work now
20:23 pdurbin anyhoo. my turn to pick up kids today. bye all!
20:24 jwhitney pdurbin: bye!

| Channels | #dvn index | Today | | Search | Google Search | Plain-Text | plain, newest first | summary

We've moved! Please join #dataverse instead. The new logs are at http://irclog.iq.harvard.edu/dataverse/today