Most useful applications interact with data in some form. There are multiple solutions for storing data and for each of those solutions, sometimes multiple formats in which the data can be stored. When using object-oriented PHP, that same data is stored, modified and accessed in a class.
Let's assume that you were creating your own blogging system. We'll assume initially that you have need want to create posts and you want to allow multiple users to author articles.
Storing the data in a hierarchical format with XML is fairly straightforward. Each 'user' is represented by a node named 'user' with a child 'contents' node to contain the user's blog posts.
<xml version="1.0>
<user id="1" first="Bobby" last="Mcintire" email="bobby@gacela.com" />
<user id="2" first="Frankfurt" last="McGee" email="sweetcheeks@gacela.com">
<contents>
<content id="id" title="Beginners Guide to ORMs" published="2013-05-22 15:31:00">
In order to start, you need to read the rest of this user's guide.
</content>
</contents>
</user>
</xml>
With a relational database, we would create two tables, one to hold the basic information about each user, and a table to hold their posts.
'users' table
id | name | |
---|---|---|
1 | Bobby Mcintire | bobby@gacela.com |
2 | Frankfurt McGee | sweetcheeks@gacela.com |
'contents' table
id | userId | title | content | published |
---|---|---|---|---|
1 | 2 | Beginners Guide to ORMs | Read the rest of the guide | 2013-05-22 15:31:00 |
The same data in PHP would be stored in classes like so:
class User {
protected $data = [
'id' => 1,
'name' => 'Bobby Mcintire',
'email' => 'bobby@kacela.com'
];
protected $contents = [];
}
class User {
protected $data = [
'id' => 2,
'name' => 'Frankfurt McGee',
'email' => 'sweetcheeks@kacela.com',
'phone' => '9876543214'
];
protected $contents = [
[
'id' => 1,
'userId' => 2,
'title' => 'Beginners Guide to ORMs',
'content' => 'Read this guide all the way to the end'
]
];
}
As you can see the way that data is stored can be vastly different from the way that we interact with data in our application code.
This is called the object-impedance mismatch. A common design pattern has arisen to hide the complexities of the differences between data in application code and data stores called Object-Relational Mapping.
This design pattern was developed specifically to deal with the complexities of mapping relational database records to objects in code, but many of the same principles apply when dealing with any form of raw data because there is almost always some mismatch.
The most common approach to Object-Relational Mapping, or ORM for short, is the Active Record pattern.
With Active Record, one class instance represents one Record from the Data Source. With an Active Record instance, business logic and data access logic are contained in a single object. A basic Active Record class would look like so:
class Model_User extends ORM
{
}
And would be accessed like so:
$user = ORM::find('User', 1);
// echo's Bobby Mcintire to the screen
echo $user->name;
$user->email = 'new.user@gacela.com'
$user->save();
Working with a Data Mapper for the first time can be quite a bit more difficult than working with a more basic approach like Active Record, but Gacela offers large dividends if you tackle the complexity upfront. When developing Gacela, the following were just the top features we thought every ORM should have:
- Automatically discover relationships between classes and rules about the data contained within classes.
- Separate data store activities from business logic activities so that our classes have a single responsibility.
- Defaults that are easy to set up and use, but can handle complex mapping between business objects and the underlying data store.
Gacela can be installed with Composer.
Define the following requirement in your composer.json file:
{
"require": {
"energylab/gacela": "dev-develop"
}
}
Gacela assumes that in any given application there will be multiple sources of data, even if there just multiple databases that need to be used.
Currently there are two supported types of Data Sources for Gacela: Database & Salesforce. We plan to add support for Xml, RESTful Web Services, and SOAP Web Services as well as to fully support the differences between MySQL, MSSQL, Postgres, and SQLlite.
Gacela provides a convenience method to create a DataSource object from configuration parameters. Once a DataSource object is created, it is easily registered with the Gacela instance so that it is available from anywhere.
$source = \Gacela\Gacela::createDataSource(
[
'name' => 'db',
'type' => 'mysql' // Other types would be mssql, postgres, sqllite
'host' => 'localhost',
'user' => 'gacela',
'password' => '',
'schema' => 'gacela'
]
);
\Gacela\Gacela::instance()->registerDataSource($source);
The default for Gacela is to name the database tables in the plural form (users, contents). Though this can be easily overridden. We'll look at an example of how to override table name later.
For example:
CREATE TABLE users (
id INT UNSIGNED NOT NULL PRIMARY KEY AUTO_INCREMENT,
`name` VARCHAR(150) NOT NULL,
`email` VARCHAR(200) NOT NULL
) ENGINE=Innodb;
CREATE TABLE contents (
id INT UNSIGNED NOT NULL PRIMARY KEY AUTO_INCREMENT,
userId INT UNSIGNED NOT NULL,
title VARCHAR(255) NOT NULL,
content LONGTEXT NOT NULL,
published TIMESTAMP NULL,
CONSTRAINT `fk_user_contents` FOREIGN KEY (userId) REFERENCES users(id) ON DELETE RESTRICT
) ENGINE=Innodb;
$source = \Gacela\Gacela::createDataSource(
[
'name' => 'salesforce',
'type' => 'salesforce',
'soapclient_path' => MODPATH.'sf/vendor/soapclient/',
/**
* Specify the full path to your wsdl file for Salesforce
*/
'wsdl_path' => APPPATH.'config/sf.wsdl',
'username' => 'salesforceuser@domain.com.sandbox',
'password' => 'SecretPasswordWithSalesforceHash',
/**
* Specifies which Salesforce objects are available for use in Gacela
*/
'objects' => []
]
);
Gacela contains its own autoloader and registers it when the Gacela instance is constructed. Gacela also registers its own namespace for its use. You will want to register a custom namespace for your application even if you only plan on creating Mappers and Models for your project.
/*
* Assuming that you are bootstrapping from the root of your project and that you want to put your
* custom application code in an app directory
*/
\Gacela\Gacela::instance()->registerNamespace('App', __DIR__.'/app/');
/*
* A handy trick if you want to put your Mappers/Models or other custom extensions for Gacela in the global
* namespace
*/
\Gacela\Gacela::instance()->registerNamespace('', __DIR__.'/app/');
With those two namespaces registered to the same directory, I could declare a new Mapper (User) like so:
/*
* __DIR__.'/app/Mapper/User.php'
*/
<?php
namespace Mapper;
class User extends \Gacela\Mapper\Mapper {}
Or alternatively like this:
/*
* __DIR__.'/app/Mapper/User.php'
*/
<?php
namespace App\Mapper;
use \Gacela\Mapper\Mapper as M;
class User extends M {}
Even more exciting is that Gacela allows for cascading namespaces so you can easily override default Gacela classes without having to modify the methods and classes that depend on the modified class. So lets say that you wanted to create a custom Model class where you could some default functionality for all of your models to use.
/*
* __DIR__.'/app/Model/Model.php'
*/
<?php
namespace Model;
class Model extends \Gacela\Model\Model {}
?> // This breaks a PSR standard but is shown here to clarify the end of the file
/*
* __DIR__'/app/Model/User.php'
*/
<?php
namespace Model;
class User extends Model {}
?>
Personally, I always extend the base Model and Mapper classes in projects if for no other reason than it simplifies my class declarations.
Gacela supports caching on two levels, the first is to cache the metadata that it uses to determine relationships, column data types, and such. The second is to cache requested data. In order to use either, caching must be enabled in the Gacela instance.
Gacela will use any caching library that supports get(), set(), and increment()
$cache = new \Cache;
\Gacela\Gacela::instance()->enableCache($cache);
As we noted previously, there are two separate functions provided by any given ORM;
- Data Access
- Business Logic
Most ORM's mash these two responsibilities into a single class that will contain custom methods for dealing with business or application logic problems as well as custom methods for finding or saving data back to the database. Gacela takes a different approach in that it separates these two functions into two separate, distinct classes:
- Mappers (Data Access)
- Models (Business or Application Logic)
To get our basic application up and running, I will need the following files and class definitions:
/*
* Again assume that we have created an 'app' directory and registered it into the global namespace with Gacela.
*
* As I mentioned before, I prefer to always override the default Model and Mapper classes in my application so
* I will that first.
* app/Mapper/Mapper
*/
<?php
namespace Mapper;
class Mapper extends \Gacela\Mapper\Mapper {}
?>
/*
* app/Model/Model
*/
<?php
namespace Model;
class Model extends \Gacela\Model\Model {}
?>
/*
* app/Mapper/User
*/
<?php
namespace Mapper;
class User extends Mapper {}
?>
/*
* The underlying database table is named contents, but perhaps we decided after the fact that we'd rather
* use Post as the name by which we reference records in this table.
*
* app/Mapper/Post
*/
<?php
namespace Mapper;
class Post extends Mapper {
/*
* Easy peasy. To manually specify the underlying table or as we call it in Gacela, resource, name just set
* the $_resourceName property in the mapper. This also works great if your table names are singular rather than
* the default plural.
*/
protected $_resourceName = 'contents';
}
?>
/*
* app/Model/User
*/
<?php
namespace Model;
class User extends Model {}
?>
/*
* app/Model/Post
*/
<?php
namespace Model;
class Post extends Model {}
?>
Now we can load existing Users, create a new Post, or delete a User or Post.
/*
* You can also easily override the \Gacela\Gacela class by creating a shorthand class in the app/ directory
* that extends \Gacela\Gacela. To simplify calls, I like to create a extended class 'G'. Future examples all
* assume that this extended class exists.
*/
$user = \G::instance()->find('User', 1);
// echos Bobby Mcintire to the screen
echo $user->name;
$user->email = 'different@gacela.com'
// Saves the updated record to the database
$user->save();
/*
* The required argument when using new Model\Model() specifies the name of the Mapper to use from the Model.
*/
$post = new \Model\Post('Post');
$post->setData(
[
'userId' => 1,
'title' => 'A new blog post',
'content' => 'Lorem ipsum dolor fakey fake content'
]
);
/*
* Will output TRUE because the id column is assigned by the database engine at insert in our case.
*/
echo $post->validate();
$post->save(['title' => 'A better title']);
Right now you're probably thinking, "Wait! This looks almost EXACTLY like every other ORM I've ever used, where's the benefit in creating two files where I only created one before?"
So far all we've looked at is the most basic scenario - one database table with a mapper that presents simple, default find() and findAll() methods with a Model that doesn't have any custom business logic.
We'll explore custom Mapper functions first.
The name of the Model class associated to a Mapper class defaults to the same name as the Mapper class. This can be overriden:
<?php
namespace Mapper;
class Note extends Mapper {
protected $_modelName = 'Comment'
}
$bobby = \Gacela\Gacela::instance()->find('User', 1);
/*
* Outputs Bobby Mcintire
*/
echo $bobby->email;
/*
* The \Gacela\Criteria object allows users to specify simple rules for filtering, sorting and limiting data without all of the complexity of
* a full built-in query builder.
* \Gacela\Gacela::instance()->findAll() returns an instance of \Gacela\Collection\Arr
*/
$criteria = \Gacela\Gacela::criteria()
->equals('userId', 1);
\Gacela\Gacela::instance()->findAll('Post', $criteria);
<?php
namespace Mapper;
class User extends Mapper
{
/**
* Fetches a Collection of all users with no posts
*/
public function findUserWithNoPosts()
{
/**
* Mapper::_getQuery() returns a Query instance specific to the Mapper's data source.
* As such, the methods available for each Query instance will vary.
*/
$query = $this->_getQuery()
->from('users')
->join(array('p' => 'contents'), "users.id = p.userId, array(), 'left')
->where('p.published IS NULL');
/**
* For the Database DataSource, returns an instance of PDOStatement.
* For all others, returns an array.
*/
$data = $this->_runQuery($query)->fetchAll();
/**
* Creates a Gacela\Collection\Collection instance based on the internal type of the data passed to Mapper::_collection()
* Currently, two types of Collections are supported, Arr (for arrays) and PDOStatement
*/
return $this->_collection($data);
}
}
Sometimes, it is desirable to pass data to the model that is not strictly represented in the underlying table for a specific model.
Lets assume that we want to track all login attempts for each user. We could add the following database table:
CREATE TABLE logins (
`userId` INT UNSIGNED NOT NULL,
`timestamp` TIMESTAMP NOT NULL,
`succeeded` BOOL NOT NULL DEFAULT 0,
PRIMARY KEY(`userId`, `timestamp`),
CONSTRAINT `fk-user-logins` FOREIGN KEY (userId) REFERENCES users(id) ON DELETE CASCADE
) ENGINE=Innodb;
However, we'd like to be able to know the number of times each user has attempted to login along with the number of times they have successfully logged in. So we're going to modify the find() and findAll() queries for the User mapper to always include the number of login attempts and successful logins.
<?php
namespace Mapper;
use Gacela as G;
class User extends Mapper
{
public function find($id)
{
$criteria = \G::instance()criteria()->equals('id', $id);
$rs = $this->_runQuery($this->_base($criteria))->fetch();
if(!$rs)
{
$rs = new \stdClass;
}
return $this->_load($rs);
}
public function findAll(\G\Criteria $criteria = null)
{
/**
* Returns an instance Gacela\Collection\Statement
**/
return $this->_runQuery($this->_base($criteria));
}
/**
* Allows for a unifying method of fetching the custom data set for find() and find_all()
**/
protected function _base(\G\Criteria $criteria = null)
{
$attempts = $this->_getQuery()
->from('logins', 'attempts' => 'COUNT(*)')
->where('logins.userId = users.id');
$logins = $this->_getQuery()
->from('logins', 'logins' => 'COUNT(*)')
->where('logins.userId = users.id')
->where('succeeded = 1');
return $this->_getQuery($criteria)
->from('users', [
'users.*',
'attempts' => $attempts,
'logins' => $logins
]);
}
}
So far everything we've done with the Models is pretty standard:
$user = new \Model\User('\Mapper\User');
$user->name = 'Noah Dingermeister';
$user->email = 'noahmeister@gacela.com';
if($user->validate()) {
$user->save();
} else {
print_r($user->errors);
}
But what about all of the fancy schmanzy business logic that you need to manage?
In the 'users' table we store the full name in a single field, but what if we wanted to be able to use the first name and last name separately? For example, so that when a user logs in, you could output 'Hello Bobby' to the user.
In the User model we could do the following:
<?php
namespace Model;
class User extends Model {
protected function _getFirstName()
{
$first = explode(' ', $this->name);
return current($first);
}
protected function _getLastName()
{
$last = explode(' ', $this->name);
return end($last);
}
}
?>
$user = \G::instance()->find('User', 1);
echo 'Hello '.$user->firstName;
Let's also assume that whenever the content for a Post model is set, that we want to translate certain words (lame, boring, stupid) to a different word (AMAZING):
<?php
namespace Model;
class Post extends Model {
protected function _setContent($val)
{
$val = str_replace(['lame', 'boring', 'stupid'], 'AMAZING', $val);
/**
* The _set() method verifies that the new value is distinct from the current value
* and then moves the old value into the _originalData array, sets the new value into _data
* and adds the specified key to the _changed array.
*/
$this->_set('content', $val);
}
}
?>
<?php
$post = new \Model\Post('\Mapper\Post', [
'userId' => 2,
'title' => 'Another Great Post',
'content' => 'This post is all about how lame and stupid I think these docs are. In fact this is so boring I
am falling asleep.'
]);
/*
* Outputs: This post is all about how AMAZING and AMAZING I think these docs are. In fact this is so AMAZING
* I am falling asleep.
*/
echo $post->content;
Another common case with Models is verifying whether a value is empty or not. As such, just like with __get() and __set() and _get and _set, the methods isset() and empty() will call the magic __isset() method on objects. Likewise _isset allows you to define custom behavior for a virtual property.
Right here seems like a good place to take a second and explain how validation works in Gacela.
By default Gacela uses the field meta data from the Mapper to perform validation. For example, if you have a field declared as an integer in the underlying resource, then only integer values would validate as true for the field in the Model.
Gacela currently supports the following data types natively:
- Binary
- Bool
- Date
- Decimal
- Enum
- Float
- Guid
- Int
- Set
- String
- Time
Validating to make sure that the input data matches the underlying data type is great and all, but what about validating data for complex business rules? For example, what if we only want to allow users with email addresses from the gacela.com domain?
The simplest solution in this case is to simply extend the validate() method in the model.
<?php
namespace Model;
class User extends Model {
public function validate(array $data = null)
{
if($data) {
$this->setData($data);
}
if(strpos($this->email, 'gacela.com') === false) {
$this->_errors['email'] = 'not_gacela_domain';
}
return parent::validate();
}
}