diff --git a/docs/advanced/sql-databases-peewee.md b/docs/advanced/sql-databases-peewee.md index ce5c62c9a..b6650bd82 100644 --- a/docs/advanced/sql-databases-peewee.md +++ b/docs/advanced/sql-databases-peewee.md @@ -257,7 +257,7 @@ And now in the file `sql_app/main.py` let's integrate and use all the other part In a very simplistic way create the database tables: -```Python hl_lines="10 11 12" +```Python hl_lines="9 10 11" {!./src/sql_databases_peewee/sql_app/main.py!} ``` @@ -265,7 +265,7 @@ In a very simplistic way create the database tables: Create a dependency that will connect the database right at the beginning of a request and disconnect it at the end: -```Python hl_lines="19 20 21 22 23 24 25" +```Python hl_lines="23 24 25 26 27 28 29" {!./src/sql_databases_peewee/sql_app/main.py!} ``` @@ -273,58 +273,52 @@ Here we have an empty `yield` because we are actually not using the database obj It is connecting to the database and storing the connection data in an internal variable that is independent for each request (using the `contextvars` tricks from above). +Because the database connection is potentially I/O blocking, this dependency is created with a normal `def` function. + And then, in each *path operation function* that needs to access the database we add it as a dependency. But we are not using the value given by this dependency (it actually doesn't give any value, as it has an empty `yield`). So, we don't add it to the *path operation function* but to the *path operation decorator* in the `dependencies` parameter: -```Python hl_lines="36 44 51 63 69 76" +```Python hl_lines="32 40 47 59 65 72" {!./src/sql_databases_peewee/sql_app/main.py!} ``` -### Context Variable Middleware - -For all the `contextvars` parts to work, we need to make sure there's a new "context" each time there's a new request, so that we have a specific context variable Peewee can use to save its state (database connection, transactions, etc). - -For that, we need to create a middleware. - -Right before the request, we are going to reset the database state. We will "set" a value to the context variable and then we will ask the Peewee database state to "reset" (this will create the default values it uses). +### Context variable sub-dependency -And then the rest of the request is processed with that new context variable we just set, all automatically and more or less "magically". +For all the `contextvars` parts to work, we need to make sure we have an independent value in the `ContextVar` for each request that uses the database, and that value will be used as the database state (connection, transactions, etc) for the whole request. -For the **next request**, as we will reset that context variable again in the middleware, that new request will have its own database state (connection, transactions, etc). +For that, we need to create another `async` dependency `reset_db_state()` that is used as a sub-dependency in `get_db()`. It will set the value for the context variable (with just a default `dict`) that will be used as the database state for the whole request. And then the dependency `get_db()` will store in it the database state (connection, transactions, etc). -```Python hl_lines="28 29 30 31 32 33" +```Python hl_lines="18 19 20" {!./src/sql_databases_peewee/sql_app/main.py!} ``` +For the **next request**, as we will reset that context variable again in the `async` dependency `reset_db_state()` and then create a new connection in the `get_db()` dependency, that new request will have its own database state (connection, transactions, etc). + !!! tip As FastAPI is an async framework, one request could start being processed, and before finishing, another request could be received and start processing as well, and it all could be processed in the same thread. - But context variables are aware of these async features, so, a Peewee database state set in the middleware will keep its own data throughout the entire request. + But context variables are aware of these async features, so, a Peewee database state set in the `async` dependency `reset_db_state()` will keep its own data throughout the entire request. And at the same time, the other concurrent request will have its own database state that will be independent for the whole request. #### Peewee Proxy - If you are using a Peewee Proxy, the actual database is at `db.obj`. So, you would reset it with: ```Python hl_lines="3 4" -@app.middleware("http") -async def reset_db_middleware(request: Request, call_next): +async def reset_db_state(): database.db.obj._state._state.set(db_state_default.copy()) database.db.obj._state.reset() - response = await call_next(request) - return response ``` ### Create your **FastAPI** *path operations* Now, finally, here's the standard **FastAPI** *path operations* code. -```Python hl_lines="36 37 38 39 40 41 44 45 46 47 50 51 52 53 54 55 56 57 60 61 62 63 64 65 66 69 70 71 72 75 76 77 78 79 80 81 82 83" +```Python hl_lines="32 33 34 35 36 37 40 41 42 43 46 47 48 49 50 51 52 53 56 57 58 59 60 61 62 65 66 67 68 71 72 73 74 75 76 77 78 79" {!./src/sql_databases_peewee/sql_app/main.py!} ``` @@ -364,15 +358,13 @@ If you want to check how Peewee would break your app if used without modificatio # db._state = PeeweeConnectionState() ``` -And in the file `sql_app/main.py` file, comment the middleware: +And in the file `sql_app/main.py` file, comment the body of the `async` dependency `reset_db_state()` and replace it with a `pass`: ```Python -# @app.middleware("http") -# async def reset_db_middleware(request: Request, call_next): +async def reset_db_state(): # database.db._state._state.set(db_state_default.copy()) # database.db._state.reset() -# response = await call_next(request) -# return response + pass ``` Then run your app with Uvicorn: @@ -391,11 +383,11 @@ The tabs will wait for a bit and then some of them will show `Internal Server Er ### What happens -The first tab will make your app create a connection to the database and wait for some seconds before replying back and closing the connection. +The first tab will make your app create a connection to the database and wait for some seconds before replying back and closing the database connection. Then, for the request in the next tab, your app will wait for one second less, and so on. -This means that it will end up finishing some of the last tabs' requests than some of the previous ones. +This means that it will end up finishing some of the last tabs' requests earlier than some of the previous ones. Then one the last requests that wait less seconds will try to open a database connection, but as one of those previous requests for the other tabs will probably be handled in the same thread as the first one, it will have the same database connection that is already open, and Peewee will throw an error and you will see it in the terminal, and the response will have an `Internal Server Error`. @@ -413,15 +405,12 @@ Now go back to the file `sql_app/database.py`, and uncomment the line: db._state = PeeweeConnectionState() ``` -And in the file `sql_app/main.py` file, uncomment the middleware: +And in the file `sql_app/main.py` file, uncomment the body of the `async` dependency `reset_db_state()`: ```Python -@app.middleware("http") -async def reset_db_middleware(request: Request, call_next): +async def reset_db_state(): database.db._state._state.set(db_state_default.copy()) database.db._state.reset() - response = await call_next(request) - return response ``` Terminate your running app and start it again. @@ -477,11 +466,11 @@ Repeat the same process with the 10 tabs. This time all of them will wait and yo Peewee uses `threading.local` by default to store it's database "state" data (connection, transactions, etc). -`threading.local` creates a value exclusive to the current thread, but an async framework would run all the "tasks" (e.g. requests) in the same thread, and possibly not in order. +`threading.local` creates a value exclusive to the current thread, but an async framework would run all the code (e.g. for each request) in the same thread, and possibly not in order. -On top of that, an async framework could run some sync code in a threadpool (using `asyncio.run_in_executor`), but belonging to the same "task" (e.g. to the same request). +On top of that, an async framework could run some sync code in a threadpool (using `asyncio.run_in_executor`), but belonging to the same request. -This means that, with Peewee's current implementation, multiple tasks could be using the same `threading.local` variable and end up sharing the same connection and data, and at the same time, if they execute sync IO-blocking code in a threadpool (as with normal `def` functions in FastAPI, in *path operations* and dependencies), that code won't have access to the database state variables, even while it's part of the same "task" (request) and it should be able to get access to that. +This means that, with Peewee's current implementation, multiple tasks could be using the same `threading.local` variable and end up sharing the same connection and data (that they shouldn't), and at the same time, if they execute sync I/O-blocking code in a threadpool (as with normal `def` functions in FastAPI, in *path operations* and dependencies), that code won't have access to the database state variables, even while it's part of the same request and it should be able to get access to the same database state. ### Context variables @@ -489,46 +478,44 @@ Python 3.7 has