-
Notifications
You must be signed in to change notification settings - Fork 14
Lesson: Add pages to a bibliographic work
- Set up Models to represent relationships between different types of objects
- Create and modify relationships between objects
Now that we have created a model for BibliographicWorks, we will create a model for pages in the bibliographic resource.
Next we're going to add a model for our pages. Open app/models/page_file.rb
and add this content:
class PageFile < ActiveFedora::Base
include Hydra::Works::GenericFileBehavior
property :page_number, predicate: ::RDF::URI.new('http://opaquenamespace.org/hydra/pageNumber'), multiple: false do |index|
index.as :stored_searchable
index.type :integer
end
property :text, predicate: ::RDF::URI.new('http://opaquenamespace.org/hydra/pageText'), multiple: false do |index|
index.as :stored_searchable
end
end
This is very similar to how our RDF-based BibliographicFile class looks, we're just adding different attributes defined by different predicates. I.E. a bibliographic resource has an author and a title, while a Page has a page_number and text.
NOTE: We are adding the text property here to hold the text of the page. In the next lesson, we will see how to attach a file that could be a pdf, jpg, or some other format to serve as the text for the page.
ASIDE #1: It's generally best practice to use well-known RDF predicates when defining metadata terms, e.g. using DC.title for your title term. In some cases, however, RDF ontologies don't exist to express the concepts we want to track. Surprisingly, page numbers are in this group; therefore, we have used a predicate registry created by the community to define our page number predicate: http://opaquenamespace.org/hydra/pageNumber.
ASIDE #2: We are explicitly assigning a type for the page number index. This lets us control our indexing and searching to use integer sorting (e.g. 1,2,3,4,...9,10,11) instead of the default lexical sorting use for strings (e.g. '1','10','11','2','21','27','5','6','60'). By default, Hydra and Active Fedora assume all metadata are text values. You can find additional information on the available indexing options in the Solrizer ReadME and in the solrizer code that defines the default indexing strategies
Pre-PCDM and Hydra-Works, you had to explicitly define a relationship between the bibliographic resource and it's page. In Hydra-Works, any generic file can be a member of any generic work. To add a page to a work, you simply create an instance of the PageFile generic file and add it as a member of the BibliographicWork instance.
Start the rails console
and let's create a page.
pf1 = PageFile.new("page-1")
=> #<PageFile id: "page-1", page_number: nil, text: nil, head_id: nil, tail_id: nil>
pf1.page_number = 1
=> 1
pf1.text = "Once upon a midnight dreary..."
=> "Once upon a midnight dreary..."
pf1.save
=> true
And add it to the bibliographic resource work we created previously.
bw = BibliographicWork.find("work-1")
=> #<BibliographicWork id: "work-1", title: "The Raven", author: "Poe, Edgar Allan", abstract: "A lonely man tries to ease his 'sorrow for the lost Lenore', by distracting his mind with old books of 'forgotten lore'.">
bw.generic_files << pf1
=> [#<BibliographicFile id: "gfile-1", title: "The Raven pdf", head_id: nil, tail_id: nil>, #<PageFile id: "page-1", page_number: 1, text: "Once upon a midnight dreary...", head_id: nil, tail_id: nil>]
For fun, let's create another page so we can have more than one page to play around with.
pf2 = PageFile.new("page-2")
=> #<PageFile id: "page-2", page_number: nil, text: nil, head_id: nil, tail_id: nil>
pf2.page_number = 2
=> 2
pf2.text = "The End"
=> "The End"
pf2.save
=> true
bw.generic_files << pf2
=> [#<BibliographicFile id: "gfile-1", title: "The Raven pdf", head_id: nil, tail_id: nil>, #<PageFile id: "page-1", page_number: 1, text: "Once upon a midnight dreary...", head_id: nil, tail_id: nil>, #<PageFile id: "page-2", page_number: 2, text: "The End", head_id: nil, tail_id: nil>]
There are two more pages for The Raven. You can add those now if you want.
Now that we have pages, how can we get a list of all pages?
bw.generic_files
=> [#<BibliographicFile id: "gfile-1", title: "The Raven pdf", head_id: nil, tail_id: nil>, #<PageFile id: "page-1", page_number: 1, text: "Once upon a midnight dreary...", head_id: nil, tail_id: nil>, #<PageFile id: "page-2", page_number: 2, text: "The End", head_id: nil, tail_id: nil>]
Hmmm, that returns the bibliographic resource file and all the page files. Can I make it just return page files?
Let's add a filter to the BibliographicWork that understands pages. Add the filters_association line between the include of GenericWorkBehavior and the property statements.
filters_association :members, as: :pages, condition: :page?
such that BibliographicWork model at app/models/bibliographic_work.rb
now looks like...
class BibliographicWork < ActiveFedora::Base
include Hydra::Works::GenericWorkBehavior
filters_association :members, as: :pages, condition: :page?
property :title, predicate: ::RDF::DC.title, multiple: false do |index|
index.as :stored_searchable
end
property :author, predicate: ::RDF::DC.creator, multiple: false do |index|
index.as :stored_searchable
end
property :abstract, predicate: ::RDF::DC.abstract, multiple: false do |index|
index.as :stored_searchable
end
end
For the filter to find pages, we need to define the condition method page?
in each of the generic file classes.
Add the following method to the end of app/models/page_file.rb
to identify this model as a page.
def page?
true
end
At this time, it is also necessary to add the same method to all generic files that can be members of BibliographicWork. So also add this method to app/models/bibliographic_file.rb
. NOTE: In bibliographic_file.rb
, the method returns false to identify that this class is NOT a page.
def page?
false
end
Restart rails console. Now, we can request just pages to be returned.
bw = BibliographicWork.find("work-1")
=> #<BibliographicWork id: "work-1", title: "The Raven", author: "Poe, Edgar Allan", abstract: "A lonely man tries to ease his 'sorrow for the lost Lenore', by distracting his mind with old books of 'forgotten lore'.">
bw.pages
=> [#<PageFile id: "page-1", page_number: 1, text: "Once upon a midnight dreary...", head_id: nil, tail_id: nil>, #<PageFile id: "page-2", page_number: 2, text: "The End", head_id: nil, tail_id: nil>]
Let's look at the RDF that active-fedora uses to represent these relationships. View the page file in Fedora at: http://127.0.0.1:8983/fedora/rest/dev/page-1
The page file has the following properties in Fedora...
View the update bibliographic work in Fedora at: http://127.0.0.1:8983/fedora/rest/dev/work-1 The updated bibliographic resource work with two pages has the following properties in Fedora...
Now that we've added page relationships, it's a great time to commit to git:
git add .
git commit -m "Created a page file model with relationship to the bibliographic resource work model"
Go on to BONUS Lesson: Add attached files or explore other [Dive into Hydra-Works](Dive into Hydra-Works#Bonus) tutorial bonus lessons.
- Dive into Hydra-Works
- Lesson: Generate a Rails Application
- Lesson: Add the Hydra Dependencies
- Lesson: Start FCRepo and Solr
- Lesson: Start the Application & Search for Results
- Lesson: Define models with Hydra-Works
- Lesson: Create instances of Hydra-Works models
- Lesson: Explore Objects in Fedora and Solr
- Lesson: Make Blacklight Return Search Results
- BONUS Lessons
- Aggregations API Documentation
- Application Profile