Stop all migrations and roll back everything on failure #22616

roji · 2020-09-19T07:48:54Z

When looking at #22613, I realized something... In general, if an earlier migration fails, later migrations will still be attempted and may succeed, leaving the database in a potentially inconsistent and confusing state. I understand that the idea is to allow the idempotent migration to be re-executed later and "fill in" the missing migrations, but if there's any dependency between one migration and another, this doesn't work (think about data migrations where a later migration implicitly depends on an earlier one).

Our new 5.0 behavior isn't a regression (or even worse) compared to the previous behavior. However, if we wrapped all migrations in a single transaction (and set XACT_ABORT to ON in SQL Server), we should get the safer behavior above, while at the same time addressing the original ask in #7681. The problem here may be with transaction suppression - but isn't that always problematic (and also SQL Server-specific).

/cc @bricelam

ajcvickers · 2020-09-21T19:42:08Z

Consider as part of #19587

bricelam · 2020-09-21T21:45:52Z

Triage: Wrap all the migrations inside a single transaction (instead of individual transactions per migration)

ChristopherHaws · 2020-09-23T01:29:08Z

@bricelam From what I can tell, there are only two types of migrations operations that are always TransactionSuppressed:

SqlServerCreateDatabaseOperation
SqlServerDropDatabaseOperation

The rest of the operations that are sometimes TransactionSuppressed are when the resource is memory optimized:

AddColumnOperation
AddForeignKeyOperation
AddPrimaryKeyOperation
AlterColumnOperation
RenameIndexOperation
CreateTableOperation
DropTableOperation
CreateIndexOperation
DropPrimaryKeyOperation
AlterDatabaseOperation
AlterTableOperation
DropForeignKeyOperation
DropIndexOperation
DropColumnOperation

Could this be a way of achieving this?

The "up" migration specifically places the SqlServerCreateDatabaseOperation before the migration transaction
The "down" migration specifically places the SqlServerDropDatabaseOperation after the migration transaction
If --no-transaction is not specified and any of the migrations have TransactionSuppressed = true then return an error stating that "Migrations in a transaction are only supported when not using memory optimized tables"

bricelam · 2020-09-23T16:46:04Z

Don't worry about SqlServerCreateDatabaseOperation and DropDatabase. The fact that they're migration operations is merely an implementation detail. No migration will contain them.

TransactionSuppressed is orthogonal to --no-transaction. TransactionSuppressed means it will commit the current transaction, execute the operation, and begin a new one. --no-transactions means the script will not include those COMMIT and BEGIN TRANSACTION statements.

For this issue, we just need to lift the transaction management logic inside MigrationCommandExecutor into Migrator--similar to how GenerateScript works. Then update both to stop committing and beginning new transactions between migrations (but continue doing it for TransactionSuppressed).

ChristopherHaws · 2020-09-23T16:56:53Z

@bricelam Ok, that mostly makes sense to me. I wasn't aware of the create and drop db commands being outside the migrations, so that's good to know.

If we continue generating the begin tran and commit's for the TransactionSuppressed commands, won't it sort of defeat the purpose of the whole migration executing inside of a single transaction since now there would be committed schema/data? I'm not sure of any other way around this, but maybe a warning would be useful.

Also, should the creation of the migration history table be inside or outside of the transaction? I don't see any reason it can't be inside.

Fixes dotnet#22616

bricelam · 2020-09-23T21:04:52Z

I like the idea of warning about TransactionSuppressed. I'm just not sure where we can do it that isn't already too late (i.e. during Update-Database). And the script plainly shows where the transaction will be committed, obviating the need for a warning. But maybe extra warnings about data loss and suppressed transactions would actually help; not sure.

Should the creation of the migration history table be inside or outside of the transaction?

I'd keep it outside. It's more of a prerequisite than part of the migration.

Zero3 · 2020-09-27T13:04:03Z

Triage: Wrap all the migrations inside a single transaction (instead of individual transactions per migration)

Just a thought: When a migration fails to apply, rolling back the already successfully applied ones is not necessarily desirable.

It is not unusual for a migration to take quite a while to execute on a large database (think 10+ GB), and it is not unusual to apply a batch of migrations all at once, for example during a scheduled release to production. If a trivial mistake in the last migration makes it fail, it would be quite annoying to lose the many minutes/hours of time spent applying the migrations that succeeded. In such cases, it would be much nicer to be able to simply fix the last migration and re-apply that one only, just as you could in EF6.

roji · 2020-09-27T13:28:43Z

@Zero3 that's definitely a valid argument... The problem is that if you leave the database with that last migration un-applied, that could mean that your program can't start at all, because the schema is in some intermediate unsupported state. At that point users have to figure out what happened, what state they're in, and how to fix it - all under the pressures of downtime; leaving a production database in such a state seems very problematic, even more so than the (possible) loss of time.

Note that it's always possible to edit a migration script and manage transaction manually - which one can definitely do around a migration that's expected to run for very long.

Zero3 · 2020-09-27T15:00:52Z

@roji

The problem is that if you leave the database with that last migration un-applied, that could mean that your program can't start at all, because the schema is in some intermediate unsupported state.

Indeed!

At that point users have to figure out what happened, what state they're in, and how to fix it - all under the pressures of downtime

Given the use case I presented, this can be preferred over losing hours of migration time. And given the long migration time, the downtime aspect should already be handled already anyway.

leaving a production database in such a state seems very problematic, even more so than the (possible) loss of time.

I think that depends on the use case. In the one I presented, I would definitely prefer EF not rolling back the successfully applied migrations.

Note that it's always possible to edit a migration script and manage transaction manually - which one can definitely do around a migration that's expected to run for very long.

I was actually thinking of how it will be done when using context.Database.Migrate();.

Either way, I don't mind whatever the default strategy is, as long as it is possible to make EF not roll back the successfully applied ones :).

ChristopherHaws · 2020-09-28T00:20:10Z

Ultimately it would probably be nice to have hooks before and after the entire migration process and before and after each individual migration. This would allow people to customize the the generation of migrations to fit their needs. This is how I handled adding the pre and post migration scripts in my open PR: https://github.com/dotnet/efcore/pull/22654/files#diff-a98268716bc62993b273028b6b462e39

Perhaps I could modify my PR to have the following (currently the Pre and Post commands in my PR run before and after the entire migration, not each):

GenerateInitializeMigrationCommands: Run once before all the migrations
GeneratePreMigrationCommands: Runs before each migration
GeneratePostMigrationCommands: Runs after each migration
GenerateFinalizeMigrationCommands: Runs once after all the migrations

I do believe that the default behavior should be wrapping the entire script in a transaction and anyone who wants to change that behavior could do so via the IMigrationsSqlGenerator interface. This seems like it would be an advanced use case that the majority of users would never encounter.

roji · 2020-09-28T11:14:01Z

@Zero3 and others, for transaction-per-migration there's also another problem: the problematic migration may fail and leave all previous migrations intact, but subsequent migrations will still get executed as well. In effect, what you get is a database with all migrations applied except one in the middle. There may ways to avoid this later migration execution effect in some database, but it's definitely going to be tricky cross-database.

If the database is left in this state and migrations aren't idempotent, running them again will immediately fail. Even for idempotent migrations, there are still possible dependencies between migrations that can wreak havoc - idempotency means you can safely execute the migration script twice, but not necessarily if the first run had a "hole" in the middle.

So while I do see the case for per-migration transactions, the complications seem very... complicated. I tend to agree with @ChristopherHaws that we should expose hooks to allow users to customize their migration process, and allow them to do transaction-per-migration that way instead of attempting to provide it as a built-in option.

But I think all this needs more thought and de# 6.0.

ChristopherHaws · 2020-09-28T17:09:57Z

@roji

for transaction-per-migration there's also another problem: the problematic migration may fail and leave all previous migrations intact, but subsequent migrations will still get executed as well

This could be avoided with the same measure we are taking here: #22613 (comment)
All it would require is removing this if statement:

efcore/src/EFCore.SqlServer/Migrations/SqlServerMigrationsSqlGenerator.cs

Line 144 in 02c4700

if (!noTransactions)

This would allow the script to fail and not move on if there is an error in any batch regardless of if transactions will be used or not.

roji · 2020-09-28T17:19:26Z

@ChristopherHaws that's purely SQL Server-specific, isn't it? We have to think about other databases as well...

roji · 2020-09-28T17:21:00Z

Unless I'm mistaken, even within SQL Server this also depends on sqlcmd mode. That may be fine - I don't know, I'm trying to still think of migration scripts as something that doesn't depend on the specific tooling which executes them. But that may not be viable.

ChristopherHaws · 2020-09-28T17:21:44Z

@roji Agreed. My understanding was that the other DB's that are supported (PostgreSQL, SQLite, etc) already have the behavior of "stop executing on first failure" and that it was only SQL server that behaves in this way.

Zero3 · 2020-10-04T14:53:40Z

@roji

for transaction-per-migration there's also another problem: the problematic migration may fail and leave all previous migrations intact, but subsequent migrations will still get executed as well.

I don't understand why EF would want to do this, instead of stopping after the first failed migration (like EF6 did). I don't think EF should consider skipping a failed migration a valid thing to do.

Maybe some of these issues arose from the introduction of migration scripts? I'm only thinking in the context of context.Database.Migrate(), of which I am a happy user and appreciated the way EF6 worked (one transaction per migration, and error instead of skipping failed migrations).

roji · 2020-10-05T14:45:27Z

I don't understand why EF would want to do this, instead of stopping after the first failed migration (like EF6 did). I don't think EF should consider skipping a failed migration a valid thing to do.

I don't think anyone wants this to be the behavior - but it's a possible consequence of doing transaction-per-migration (as opposed to transaction-for-all-migrations).

ajcvickers · 2020-10-05T15:09:22Z

@roji But only when scripting and only when the script keeps running inappropriately. We don't have this behavior for Update-Database, for example. It will stop[ after the first failed migration.

roji · 2020-10-05T15:13:20Z

Yeah, this is indeed an SQL script problem (as opposed to when applying migrations programmatic or via CLI). That could warrant different transactional strategies based on the mechanism we use.

Use a single transaction for all migrations in the script Fixes #17578 Fixes #22616

AlaRaies · 2024-09-02T13:29:30Z

in the official documentation we have the following :

...The transaction handling and continue-on-error behavior of these tools are inconsistent and sometimes unexpected. This can leave your database in an undefined state if a failure occurs when applying migrations...

I'am confused about strategy used by the bundle while applying migrations. from what i understand the expected behaviour : the bundle will attemp to apply migrations ( a set of them), if it fails it wont continue and revert what is applied. Is this correct ?

roji · 2024-09-02T14:56:55Z

@AlaRaies the sentence you quote above are about applying migrations via SQL scripts, e.g. using sqlcmd for SQL Server. Migration bundles are meant precisely to solve these shortcomings.

Starting with 9.0, migration bundles will execute all migrations inside a single transaction, so yes, if any error occurs at some point everything will be rolled back. The only exception is a very limited set of migration operations which cannot be executed inside a transaction (e.g. anything related to a SQL Server in-memory table).

bachratyg · 2024-09-02T15:14:02Z

Starting with 9.0, migration bundles will execute all migrations inside a single transaction

A welcome change. But still won't work for Oracle. Sigh.

roji · 2024-09-02T17:55:23Z

@bachratyg can you provide a bit more info? Does Oracle not support running DDL inside transactions?

bachratyg · 2024-09-02T18:52:56Z

It does not fail per se when DDL is inside tranasctions, it just won't be rolled back. See https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/COMMIT.html

Oracle Database issues an implicit COMMIT under the following circumstances:

Before any syntactically valid data definition language (DDL) statement, even if the statement results in an error

After any data definition language (DDL) statement that completes without an error

No reasonable way to roll back DDL apart from maybe some juggling with pdb snapshots (somewhat akin to mssql's database snapshots) or doing a full database backup/restore. AFAIK MySQL has the same limitation.

roji · 2024-09-03T10:54:29Z

@bachratyg thanks, I didn't know that... I do know that rolling back DDL works in PostgreSQL and SQL Server.

roji added type-enhancement area-migrations labels Sep 19, 2020

ajcvickers mentioned this issue Sep 21, 2020

SqlServer Migrations: Scripts do not fail on failure #22613

Open

ajcvickers added this to the Backlog milestone Sep 21, 2020

ajcvickers mentioned this issue Sep 21, 2020

Improve experience deploying databases created by Migrations #19587

Open

25 tasks

bricelam modified the milestones: Backlog, 6.0.0 Sep 21, 2020

bricelam self-assigned this Sep 21, 2020

ChristopherHaws pushed a commit to ChristopherHaws/efcore that referenced this issue Sep 23, 2020

Stop all migrations and roll back everything on failure

a8035f2

Fixes dotnet#22616

ChristopherHaws mentioned this issue Sep 23, 2020

Fix sql server migration transactions not rolling back on failure #22654

Closed

ajcvickers added the needs-design label Oct 28, 2020

AndriySvyryd added the consider-for-current-release label Feb 22, 2024

AndriySvyryd mentioned this issue Feb 22, 2024

Fix migrations service in playground dotnet/aspire#2351

Merged

AndriySvyryd self-assigned this Jun 20, 2024

AndriySvyryd modified the milestones: Backlog, 9.0.0 Jun 20, 2024

AndriySvyryd removed the consider-for-current-release label Jun 24, 2024

AndriySvyryd added closed-fixed and removed needs-design area-aspire labels Jul 10, 2024

AndriySvyryd removed their assignment Jul 10, 2024

AndriySvyryd added a commit that referenced this issue Jul 11, 2024

Execute migrations using the ExecutionStrategy

498c9d1

Use a single transaction for all migrations in the script Fixes #17578 Fixes #22616

AndriySvyryd mentioned this issue Jul 11, 2024

Execute migrations using the ExecutionStrategy #34206

Merged

AndriySvyryd added a commit that referenced this issue Jul 12, 2024

Execute migrations using the ExecutionStrategy

d9667a3

Use a single transaction for all migrations in the script Fixes #17578 Fixes #22616

AndriySvyryd added a commit that referenced this issue Jul 16, 2024

Execute migrations using the ExecutionStrategy

4697720

Use a single transaction for all migrations in the script Fixes #17578 Fixes #22616

AndriySvyryd closed this as completed in #34206 Jul 26, 2024

AndriySvyryd added a commit that referenced this issue Jul 26, 2024

Execute migrations using the ExecutionStrategy

3d6868e

Use a single transaction for all migrations in the script Fixes #17578 Fixes #22616

AndriySvyryd added the area-aspire label Aug 7, 2024

ajcvickers modified the milestones: 9.0.0, 9.0.0-rc1 Aug 21, 2024

AndriySvyryd mentioned this issue Sep 4, 2024

Executing Migrator by having an ambient transaction #31507

Closed

caleblloyd mentioned this issue Sep 11, 2024

Try EF Core 9 now! #33030

Closed

roji modified the milestones: 9.0.0-rc1, 9.0.0 Oct 12, 2024

NateMerritt mentioned this issue Dec 10, 2024

Run DB migrations when deploying Jobs. BiblioNexusStudio/aquifer-server#599

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stop all migrations and roll back everything on failure #22616

Stop all migrations and roll back everything on failure #22616

roji commented Sep 19, 2020

ajcvickers commented Sep 21, 2020 •

edited by AndriySvyryd

Loading

bricelam commented Sep 21, 2020

ChristopherHaws commented Sep 23, 2020

bricelam commented Sep 23, 2020 •

edited

Loading

ChristopherHaws commented Sep 23, 2020 •

edited

Loading

bricelam commented Sep 23, 2020 •

edited

Loading

Zero3 commented Sep 27, 2020

roji commented Sep 27, 2020

Zero3 commented Sep 27, 2020

ChristopherHaws commented Sep 28, 2020 •

edited

Loading

roji commented Sep 28, 2020

ChristopherHaws commented Sep 28, 2020 •

edited

Loading

roji commented Sep 28, 2020

roji commented Sep 28, 2020

ChristopherHaws commented Sep 28, 2020

Zero3 commented Oct 4, 2020

roji commented Oct 5, 2020

ajcvickers commented Oct 5, 2020

roji commented Oct 5, 2020

AlaRaies commented Sep 2, 2024

roji commented Sep 2, 2024

bachratyg commented Sep 2, 2024 •

edited

Loading

roji commented Sep 2, 2024

bachratyg commented Sep 2, 2024

roji commented Sep 3, 2024

Stop all migrations and roll back everything on failure #22616

Stop all migrations and roll back everything on failure #22616

Comments

roji commented Sep 19, 2020

ajcvickers commented Sep 21, 2020 • edited by AndriySvyryd Loading

bricelam commented Sep 21, 2020

ChristopherHaws commented Sep 23, 2020

bricelam commented Sep 23, 2020 • edited Loading

ChristopherHaws commented Sep 23, 2020 • edited Loading

bricelam commented Sep 23, 2020 • edited Loading

Zero3 commented Sep 27, 2020

roji commented Sep 27, 2020

Zero3 commented Sep 27, 2020

ChristopherHaws commented Sep 28, 2020 • edited Loading

roji commented Sep 28, 2020

ChristopherHaws commented Sep 28, 2020 • edited Loading

roji commented Sep 28, 2020

roji commented Sep 28, 2020

ChristopherHaws commented Sep 28, 2020

Zero3 commented Oct 4, 2020

roji commented Oct 5, 2020

ajcvickers commented Oct 5, 2020

roji commented Oct 5, 2020

AlaRaies commented Sep 2, 2024

roji commented Sep 2, 2024

bachratyg commented Sep 2, 2024 • edited Loading

roji commented Sep 2, 2024

bachratyg commented Sep 2, 2024

roji commented Sep 3, 2024

ajcvickers commented Sep 21, 2020 •

edited by AndriySvyryd

Loading

bricelam commented Sep 23, 2020 •

edited

Loading

ChristopherHaws commented Sep 23, 2020 •

edited

Loading

bricelam commented Sep 23, 2020 •

edited

Loading

ChristopherHaws commented Sep 28, 2020 •

edited

Loading

ChristopherHaws commented Sep 28, 2020 •

edited

Loading

bachratyg commented Sep 2, 2024 •

edited

Loading