Can someone of devs also take a look at this <https github c Prisma #orm-help

Can someone of devs also take a look at this? <htt...

terion

06/13/2018, 1:36 PM

Can someone of devs also take a look at this? https://github.com/prismagraphql/prisma/issues/2574 Very strange problem totally breaking continious delivery 😞

dpetrick

06/13/2018, 1:55 PM

Looking at it.

👍 1

terion

06/13/2018, 1:56 PM

I think that issue 2623 is with the same propblem

dpetrick

06/13/2018, 1:59 PM

I assume that this issue is talking about the passive connector introspection. Do you use an existing database with data?

terion

06/13/2018, 2:00 PM

terion

06/13/2018, 2:00 PM

I use multy-tenancy db

terion

06/13/2018, 2:01 PM

And deploying manually using cluster api — it works ok. Project is creating, migrations are running. But the project's api doesn't work untill I reload server (recreate container in kub)

dpetrick

06/13/2018, 2:03 PM

How does your ingress look like on the kubernetes cluster (if you use ingress)?

terion

06/13/2018, 2:07 PM

Copy code

{
  "kind": "Ingress",
  "apiVersion": "extensions/v1beta1",
  "metadata": {
    "name": "ingress-prisma",
    "namespace": "default",
    "selfLink": "/apis/extensions/v1beta1/namespaces/default/ingresses/ingress-prisma",
    "uid": "e5705f08-6811-11e8-b424-4eaef66759da",
    "resourceVersion": "4573807",
    "generation": 3,
    "creationTimestamp": "2018-06-04T16:11:10Z",
    "annotations": {
      "<http://certmanager.k8s.io/cluster-issuer|certmanager.k8s.io/cluster-issuer>": "letsencrypt-prod",
      "<http://kubectl.kubernetes.io/last-applied-configuration|kubectl.kubernetes.io/last-applied-configuration>": "{\"apiVersion\":\"extensions/v1beta1\",\"kind\":\"Ingress\",\"metadata\":{\"annotations\":{\"<http://certmanager.k8s.io/cluster-issuer\|certmanager.k8s.io/cluster-issuer\>":\"letsencrypt-prod\",\"<http://kubernetes.io/ingress.class\|kubernetes.io/ingress.class\>":\"nginx\",\"<http://kubernetes.io/tls-acme\|kubernetes.io/tls-acme\>":\"true\"},\"name\":\"ingress-prisma\",\"namespace\":\"default\"},\"spec\":{\"rules\":[{\"host\":\"<http://prisma.dev.dosvit.org.ua|prisma.dev.dosvit.org.ua>\",\"http\":{\"paths\":[{\"backend\":{\"serviceName\":\"prisma\",\"servicePort\":4466},\"path\":\"/\"}]}}],\"tls\":[{\"hosts\":[\"<http://prisma.dev.dosvit.org.ua|prisma.dev.dosvit.org.ua>\"],\"secretName\":\"prisma-tls\"}]}}\n",
      "<http://kubernetes.io/ingress.class|kubernetes.io/ingress.class>": "nginx",
      "<http://kubernetes.io/tls-acme|kubernetes.io/tls-acme>": "true"
    }
  },
  "spec": {
    "tls": [
      {
        "hosts": [
          "<http://prisma.dev.dosvit.org.ua|prisma.dev.dosvit.org.ua>"
        ],
        "secretName": "prisma-tls"
      }
    ],
    "rules": [
      {
        "host": "<http://prisma.dev.dosvit.org.ua|prisma.dev.dosvit.org.ua>",
        "http": {
          "paths": [
            {
              "path": "/",
              "backend": {
                "serviceName": "prisma",
                "servicePort": 4466
              }
            }
          ]
        }
      }
    ]
  },
  "status": {
    "loadBalancer": {
      "ingress": [
        {}
      ]
    }
  }
}

dpetrick

06/13/2018, 2:14 PM

That looks standard to me.

dpetrick

06/13/2018, 2:14 PM

If you dump the kubernetes pod logs is there anything problematic showing up? Also, what do you mean by multi-tenant DB?

terion

06/13/2018, 2:16 PM

what do you mean by multi-tenant DB

https://www.prisma.io/docs/reference/prisma-servers-and-dbs/database-connectors/overview-eiw6ahgiet#multitenancy

terion

06/13/2018, 2:18 PM

logs.. I've recreated a pod recently so there are no logs, but as far I remember nothing showed up. I'll try to deploy a new app now and take a look

dpetrick

06/13/2018, 2:20 PM

you can get logs from previous pods with the -p flag in kubectl

dpetrick

06/13/2018, 2:22 PM

Just to understand your setup: Do you use MySQL or PostgreSQL?

terion

06/13/2018, 2:23 PM

MySQL

terion

06/13/2018, 2:24 PM

and current Prisma is 1.9 (stable)

dpetrick

06/13/2018, 2:29 PM

I’m fairly sure multi-tenancy is not working the way the docs imply and I’m not sure why this is documented the way it is. Not 100% sure if this is related to the issue at hand, but I will look at it a bit more.

terion

06/13/2018, 2:31 PM

docs in general is a big problem Prisma has 🙂

dpetrick

06/13/2018, 2:36 PM

Agreed. Your problem is indeed a strange one, I can’t directly think of why it should behave the way it does. However, if you can capture some logs from a pod that behaves this way and shoot them my way it could help. Additionally, I will deploy a Prisma pod on a k8s cluster and see what happens later.

terion

06/13/2018, 2:37 PM

I'm currently deploying a fresh app to get fresh logs

terion

06/13/2018, 2:44 PM

While it is building, I'll mention that IMO design strategy (since graphcool) targeted at "a single user that deploys projects via cli to aws" was kinda... strange)). Now, as I see, you are making Prisma Server more generic, but old design still kicks in the head) I'm trying to use it as a part of huge and complex system and I need fully custom control via ci/cd etc, and it sometimes just kicks me in my head))

dpetrick

06/13/2018, 2:48 PM

Interesting, it would greatly help us if you just write up your thoughts somewhere (doesn’t have to be super coherent, random thoughts are good as well) so we can improve Prisma for your use cases.

terion

06/13/2018, 2:54 PM

Well, I suppose that most of things are to lack of docs for this cases (e.g. for cluster api and other low-level things). To start the sings up I was reverse-engeneering all you stuff and continiously asking everyone in chats. forums and github for several weeks 🙂 Another critical thing is about migrations strategy. I'm currently thinking of some ideas about this and will write in github. Now, at Prisma 1.9 it looks like other things are way better

terion

06/13/2018, 3:22 PM

Ok, here we go

terion

06/13/2018, 3:22 PM

App deployed and migrated

terion

06/13/2018, 3:22 PM

That's all in query: https://www.dropbox.com/s/x8f3dogqaauz8qj/%D0%A1%D0%BA%D1%80%D0%B8%D0%BD%D1%88%D0%BE%D1%82%202018-06-13%2018.22.39.png?dl=0

terion

06/13/2018, 3:23 PM

Mutation is simply empty: https://www.dropbox.com/s/cgf6wqvdffztoew/%D0%A1%D0%BA%D1%80%D0%B8%D0%BD%D1%88%D0%BE%D1%82%202018-06-13%2018.22.56.png?dl=0

terion

06/13/2018, 3:24 PM

Introspection: https://gist.github.com/terion-name/43f6f97bf0932a5d4e8adb636ba55576

terion

06/13/2018, 3:25 PM

Oh. logs are full, one moment

terion

06/13/2018, 3:27 PM

added log to gist

terion

06/13/2018, 3:31 PM

while "project already exists" is ok, when container restarts it tries to run add and migrate commands, but second (DeploymentInProgress) is strange... I run this tasks via curl in container (application) startup script:

Copy code

#!/usr/bin/env bash
REPLACE='$MUNICIPALITY_NAME:$APPLICATION_NAME:$APPLICATION_VERSION_SAFE:$PRISMA_STAGE
cat .dosvit/prisma/mysql.project.request.data | envsubst $REPLACE | curl \
          --request POST --url $PRISMA_HOST:$PRISMA_PORT/cluster \
          --header 'accept: application/json' --header 'content-type: application/json' \
          --data @-
cat .dosvit/prisma/mysql.migrate.request.data | envsubst $REPLACE | curl \
          --request POST --url $PRISMA_HOST:$PRISMA_PORT/cluster \
          --header 'accept: application/json' --header 'content-type: application/json' \
          --data @-

don't get how an can overlap can appear

terion

06/13/2018, 3:35 PM

and despite this exceptions - database is in correct state. but app is not working untill server restart

dpetrick

06/13/2018, 9:53 PM

Thanks, I will look at it tomorrow.

terion

06/14/2018, 7:43 PM

@dpetrick hi. did you have any ideas about this?)

dpetrick

06/15/2018, 9:04 AM

I have nothing conclusive at the moment unfortunately, I have not been able to reproduce the issue.

terion

06/21/2018, 1:08 PM

This is very strange. I've already added sleeps in script, but it still errors:

Copy code

% Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
{
  "data" : {
    "addProject" : null
  },
  "errors" : [ {
    "locations" : [ {
      "line" : 2,
      "column" : 5
    } ],
    "path" : [ "addProject" ],
    "code" : 4005,
    "message" : "Service with name 'stub-documents-0-0-2-beta-3' and stage 'staging' already exists",
    "requestId" : "local:management:cjiok967m001x0986hebna25p"
  } ]

  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0
100   680  100   351  100   329  10609   9944 --:--:-- --:--:-- --:--:-- 10968
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed

  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0
100  3124  100   429  100  2695   3379  21233 --:--:-- --:--:-- --:--:-- 21388
}{
  "data" : {
    "deploy" : null
  },
  "errors" : [ {
    "locations" : [ {
      "line" : 2,
      "column" : 5
    } ],
    "path" : [ "deploy" ],
    "code" : 4008,
    "message" : "You can not deploy to a service stage while there is a deployment in progress or a pending deployment scheduled already. Please try again after the deployment finished.",
    "requestId" : "local:management:cjiok9a42001y0986ke0bkooc"
  } ]

terion

06/21/2018, 1:08 PM

5 second delay between ops

terion

06/21/2018, 1:09 PM

Looks like it does not clear deployment queue

dpetrick

06/21/2018, 1:13 PM

You can take a look at the prisma internal database migration table. It lists queued changed to the schema and the migration status.

dpetrick

06/21/2018, 1:19 PM

Is the prisma container you’re using the only prisma container on the underlying database?

terion

06/21/2018, 11:13 PM

Prisma databases are touched only by prisma. There are more dbs on this db-server for other services, but they don't intersect

dpetrick

06/22/2018, 9:05 AM

Can you please run a

migrationStatus

query against the

/management

GraphQL endpoint to see what is going on the the stuck deployment?

terion

06/23/2018, 10:26 AM

I'm looking at database when this error occurs: all migration statuses are success

dpetrick

06/25/2018, 8:04 AM

Then it definitely sounds like a bug. Do you think it’s possible to create a reproduction for us? We do have a kubernetes cluster where we can test it out, we just need the resource definitions around it and maybe the context how you do/trigger everything.

terion

06/25/2018, 3:28 PM

as a variant, you can pull an image from here: registry-local.dev.dosvit.org.ua/news:1.0.3-beta.5 and run it with this env:

Copy code

PRISMA_HOST: <http://prisma>
PRISMA_PORT: 4466
PRISMA_STAGE: staging
PRISMA_SECRET: 
PRISMA_PROJECT: stub-news-1-0-3-beta-5
PORT: 3000
APP_KEY: bf22b7ce-01ae-41f9-9cae-c5aa85f7d52b
S3_KEY: 
S3_SECRET: 
S3_HOST: 
S3_BUCKET: 
MUNICIPALITY_NAME: stub
APPLICATION_NAME: news
APPLICATION_VERSION: 1.0.3-beta.5
APPLICATION_VERSION_SAFE: 1-0-3-beta-5

container start script contains deploy and migration commands:

Copy code

#!/usr/bin/env bash
REPLACE='$MUNICIPALITY_NAME:$APPLICATION_NAME:$APPLICATION_VERSION_SAFE:$PRISMA_STAGE
cat .dosvit/prisma/mysql.project.request.data | envsubst $REPLACE | curl \
          --request POST --url $PRISMA_HOST:$PRISMA_PORT/cluster \
          --header 'accept: application/json' --header 'content-type: application/json' \
          --data @-
cat .dosvit/prisma/mysql.migrate.request.data | envsubst $REPLACE | curl \
          --request POST --url $PRISMA_HOST:$PRISMA_PORT/cluster \
          --header 'accept: application/json' --header 'content-type: application/json' \
          --data @-

I also tried to add sleep between them — this did not help. Also worse to mention that this bug fires not all the times. Sometimes container starts ok

dpetrick

06/25/2018, 3:49 PM

Thank you, as soon as I have some time I will try to reproduce it.

👍 1

terion

06/26/2018, 7:31 PM

I'm monitoring this problems and saw that indeed deployments are stucking with status in progress

dpetrick

06/26/2018, 9:46 PM

Interesting, then it’s likely that your schema is actually blowing up internally, leaving everything in an inconsistent state. Can you (or did you already?) share the schema via PM?

terion

06/27/2018, 12:10 PM

FYI it seems to be fixed by separation of deploy-migrate-start in separate containers. Previously they all run in one script, now I've separated this tasks and run them in separate processes using Kubernetes init-containers that are running in queue before main container start. Several deploys like this were succesfull

terion

07/05/2018, 2:17 PM

Hello. Init containers made this error occur more rare, but it still appears (

terion

07/20/2018, 12:01 PM

The problem is still there even on 1.12. migrations have status SUCCESS, but nothiubg works untill container recreation

terion

07/20/2018, 12:04 PM

@dpetrick Initial migrate command returns this:

Copy code

{
  "data" : {
    "deploy" : {
      "clientMutationId" : null,
      "migration" : {
        "projectId" : "test-otg-enterprises-registry-0-0-8@staging",
        "status" : "PENDING",
        "applied" : 0
      },
      "errors" : [ ]
    }
  }

It looks like if application tries to access project before migration succeeds, server caches empty schema and does not recreate it after migration

dpetrick

07/20/2018, 1:43 PM

Thanks, that helps

terion

11/29/2018, 4:09 PM

@dpetrick this issue is still present in 1.19

dpetrick

11/29/2018, 4:25 PM

I will read up on the issue again in a bit

terion

11/29/2018, 4:56 PM

moreover, it is repeating more and more

terion

11/29/2018, 4:56 PM

after sequential adding project and migrating it with 15 sec interval - this error occurs

2 Views

Open in Slack

Previous Next