Skip to content

Lesson: Add pages to a bibliographic work

E. Lynette Rayle edited this page Sep 23, 2015 · 20 revisions

Goals

  • Set up Models to represent relationships between different types of objects
  • Create and modify relationships between objects

Explanation

Now that we have created a model for BibliographicWorks, we will create a model for pages in the bibliographic resource.

Steps

Step 1: Define a Page object model

Next we're going to add a model for our pages. Open app/models/page_file.rb and add this content:

class PageFile < ActiveFedora::Base
  include Hydra::Works::GenericFileBehavior    
  property :page_number, predicate: ::RDF::URI.new('http://opaquenamespace.org/hydra/pageNumber'), multiple: false do |index|
    index.as :stored_searchable
    index.type :integer
  end  
  property :text, predicate: ::RDF::URI.new('http://opaquenamespace.org/hydra/pageText'), multiple: false do |index|
    index.as :stored_searchable
  end
end

This is very similar to how our RDF-based BibliographicFile class looks, we're just adding different attributes defined by different predicates. I.E. a bibliographic resource has an author and a title, while a Page has a page_number and text.

ASIDE #1: It's generally best practice to use well-known RDF predicates when defining metadata terms, e.g. using DC.title for your title term. In some cases, however, RDF ontologies don't exist to express the concepts we want to track. Surprisingly, page numbers are in this group; therefore, we have used a predicate registry created by the community to define our page number predicate: http://opaquenamespace.org/hydra/pageNumber.

ASIDE #2: We are explicitly assigning a type for the page number index. This lets us control our indexing and searching to use integer sorting (e.g. 1,2,3,4,...9,10,11) instead of the default lexical sorting use for strings (e.g. '1','10','11','2','21','27','5','6','60'). By default, Hydra and Active Fedora assume all metadata are text values. You can find additional information on the available indexing options in the Solrizer ReadME and in the solrizer code that defines the default indexing strategies

Step 2: In the Console, add a page to a bibliographic resource

Pre-PCDM and Hydra-Works, you had to explicitly define a relationship between the bibliographic resource and it's page. In Hydra-Works, any generic file can be a member of any generic work. To add a page to a work, you simply create an instance of the PageFile generic file and add it as a member of the BibliographicWork instance.

Start the rails console and let's create a page.

pf1 = PageFile.new("page-1")
=> #<PageFile id: "page-1", page_number: nil, text: nil, head_id: nil, tail_id: nil>

pf1.page_number = 1
=> 1

pf1.text = "Once upon a midnight dreary..."
=> "Once upon a midnight dreary..."

pf1.save
=> true

And add it to the bibliographic resource work we created previously.

bw = BibliographicWork.find("work-1")
=> #<BibliographicWork id: "work-1", title: "The Raven", author: "Poe, Edgar Allan", abstract: "A lonely man tries to ease his 'sorrow for the lost Lenore', by distracting his mind with old books of 'forgotten lore'.">

bw.generic_files << pf1
=> [#<BibliographicFile id: "gfile-1", title: "The Raven pdf", head_id: nil, tail_id: nil>, #<PageFile id: "page-1", page_number: 1, text: "Once upon a midnight dreary...", head_id: nil, tail_id: nil>]

For fun, let's create another page so we can have more than one page to play around with.

pf2 = PageFile.new("page-2")
=> #<PageFile id: "page-2", page_number: nil, text: nil, head_id: nil, tail_id: nil>

pf2.page_number = 2
=> 2

pf2.text = "The End"
=> "The End"

pf2.save
=> true

bw.generic_files << pf2
=> [#<BibliographicFile id: "gfile-1", title: "The Raven pdf", head_id: nil, tail_id: nil>, #<PageFile id: "page-1", page_number: 1, text: "Once upon a midnight dreary...", head_id: nil, tail_id: nil>, #<PageFile id: "page-2", page_number: 2, text: "The End", head_id: nil, tail_id: nil>]

Step 3: In the Console, find all the pages

Now that we have pages, how can we get a list of all pages?

bw.generic_files
=> [#<BibliographicFile id: "gfile-1", title: "The Raven pdf", head_id: nil, tail_id: nil>, #<PageFile id: "page-1", page_number: 1, text: "Once upon a midnight dreary...", head_id: nil, tail_id: nil>, #<PageFile id: "page-2", page_number: 2, text: "The End", head_id: nil, tail_id: nil>]

Hmmm, that returns the bibliographic resource file and all the page files. Can I make it just return page files?

Let's add a filter to the BibliographicWork that understands pages. Add the filters_association line between the include of GenericWorkBehavior and the property statements such that BibliographicWork model at app/models/bibliographic_work.rb now looks like...

class BibliographicWork < ActiveFedora::Base
  include Hydra::Works::GenericWorkBehavior
  filters_association :members, as: :pages, condition: :page?
  property :title, predicate: ::RDF::DC.title, multiple: false
  property :author, predicate: ::RDF::DC.creator, multiple: false
  property :abstract, predicate: ::RDF::DC.abstract, multiple: false
end

For the filter to find pages, we need to define the condition method page? in each of the generic file classes.

Add the following method to the end of app/models/page_file.rb to identify this model as a page.

def page?
  true
end

At this time, it is also necessary to add the same method to all generic files that can be members of BibliographicWork. So also add this method to app/models/bibliographic_file.rb. NOTE: In bibliographic_file.rb, the method returns false to identify that this class is NOT a page.

def page?
  false
end

Now, we can request just pages to be returned.

bw = BibliographicWork.find("work-1")
=> #<BibliographicWork id: "work-1", title: "The Raven", author: "Poe, Edgar Allan", abstract: "A lonely man tries to ease his 'sorrow for the lost Lenore', by distracting his mind with old books of 'forgotten lore'.">
 

bw.pages
=> [#<PageFile id: "page-1", page_number: 1, text: "Once upon a midnight dreary...", head_id: nil, tail_id: nil>, #<PageFile id: "page-2", page_number: 2, text: "The End", head_id: nil, tail_id: nil>]

Step 4: How does this look in Fedora?

Let's look at the RDF that active-fedora uses to represent these relationships. To see that content, either output it on the command line like this:

The page file has the following properties in Fedora...

screen shot 2015-09-12

The updated bibliographic resource work with two pages has the following properties in Fedora...

screen shot 2015-09-12

Step 5: Commit your changes

Now that we've added page relationships, it's a great time to commit to git:

git add .
git commit -m "Created a page file model with relationship to the bibliographic resource work model"

Next Step

Go on to BONUS Lesson: Add attached files or explore other [Dive into Hydra-Works](Dive into Hydra-Works#Bonus) tutorial bonus lessons.