This message was deleted Permit #opal

Join Slack

This message was deleted.

# opal

Slackbot

04/18/2023, 1:11 PM

This message was deleted.

Shuvy Ankor

04/18/2023, 1:13 PM

Hi @David Hamilton I’ll check with the team members and get back to you

❤️ 1

Oded Bd

04/18/2023, 1:31 PM

Hi @David Hamilton can you share why did you set

INLINE_OPA_ENABLED=false

David Hamilton

04/18/2023, 1:31 PM

whoops, sorry, that was in order to run separately for now. With that removed is when it was not working for me

Oded Bd

04/18/2023, 1:32 PM

So I would set it to: # enable inline OPA OPAL_INLINE_OPA_ENABLED=true

David Hamilton

04/18/2023, 1:33 PM

Correct. That's how it was when I was testing

✅ 1

David Hamilton

04/18/2023, 1:51 PM

Output when starting:

Copy code

opal_client_1        | 2023-04-18T13:36:50.285126+0000 | 16 | opal_client.opa.runner                  | INFO  | Running OPA inline: opa run --server --addr=:8181 --authentication=off --authorization=off --config-file=/opal/config/config.yaml --log-level=debug
opal_client_1        | 2023-04-18T13:36:50.312097+0000 | 16 | opal_client.opa.logger                  | INFO  | Initializing server. {"addrs": [":8181"], "diagnostic-addrs": [], "time": "2023-04-18T13:36:50Z"}
opal_client_1        | 2023-04-18T13:36:50.313281+0000 | 16 | opal_client.opa.logger                  | INFO  | Starting decision logger. {"plugin": "decision_logs", "time": "2023-04-18T13:36:50Z"}

Not only do I not see decision logs, but I also don't ee the request/response pair when i make a query. However, I see logs like this

Copy code

opal_client_1        | 2023-04-18T13:46:06.269338+0000 | 16 | opal_client.opa.logger                  | INFO  | Received request.    {"client_addr": "127.0.0.1:54582", "req_body": "", "req_id": 2, "req_method": "GET", "req_params": {}, "req_path": "/v1/policies", "time": "2023-04-18T13:46:06Z"}
opal_client_1        | 2023-04-18T13:46:06.270038+0000 | 16 | opal_client.opa.logger                  | INFO  | Sent response.       {"client_addr": "127.0.0.1:54582", "req_id": 2, "req_method": "GET", "req_path": "/v1/policies", "resp_body": "{\"result\":[]}\n", "resp_bytes": 14, "resp_duration": 0.7213, "resp_status": 200, "time": "2023-04-18T13:46:06Z"}

Oded Bd

04/18/2023, 1:57 PM

Do you see those logs?

Copy code

docker-opal_client-1               | 2023-04-18T13:40:26.633528+0000 | 56 | opal_client.opa.logger                  | INFO  | Received request.    GET /v1/policies
docker-opal_client-1               | 2023-04-18T13:40:26.635689+0000 | 56 | opal_client.opa.logger                  | INFO  | Sent response.       GET /v1/policies -> 200
docker-opal_client-1               | 2023-04-18T13:40:26.649214+0000 | 56 | opal_client.opa.logger                  | INFO  | Received request.    PUT /v1/data
docker-opal_client-1               | 2023-04-18T13:40:26.653022+0000 | 56 | opal_client.opa.logger                  | INFO  | Sent response.       PUT /v1/data -> 204
docker-opal_client-1               | 2023-04-18T13:40:26.661507+0000 | 56 | opal_client.opa.logger                  | INFO  | Received request.    PUT /v1/policies/utils.rego

Oded Bd

04/18/2023, 1:59 PM

I got it when I run

docker compose -f docker-compose-example.yml up

without any config change

Oded Bd

04/18/2023, 2:05 PM

When I change it to be similar the values you posted before it is even better, and I see the full response in the logs as well, eg:

Copy code

docker-opal_client-1               | 2023-04-18T14:04:25.406845+0000 | 56 | opal_client.opa.logger                  | INFO  | Sent response.       {"client_addr": "172.18.0.1:59754", "req_id": 13, "req_method": "GET", "req_path": "/v1/data/app/rbac", "resp_body": "{\"result\":{\"allow\":false,\"user_is_granted\":[]}}", "resp_bytes": 47, "resp_duration": 19.641375, "resp_status": 200, "time": "2023-04-18T14:04:25Z"}

Oded Bd

04/18/2023, 2:06 PM

This is my docker compose with few changes:

Copy code

version: "3.8"
services:
  # When scaling the opal-server to multiple nodes and/or multiple workers, we use
  # a *broadcast* channel to sync between all the instances of opal-server.
  # Under the hood, this channel is implemented by encode/broadcaster (see link below).
  # At the moment, the broadcast channel can be either: postgresdb, redis or kafka.
  # The format of the broadcaster URI string (the one we pass to opal server as `OPAL_BROADCAST_URI`) is specified here:
  # <https://github.com/encode/broadcaster#available-backends>
  broadcast_channel:
    image: postgres:alpine
    environment:
      - POSTGRES_DB=postgres
      - POSTGRES_USER=postgres
      - POSTGRES_PASSWORD=postgres
  opal_server:
    # by default we run opal-server from latest official image
    image: permitio/opal-server:latest
    environment:
      # the broadcast backbone uri used by opal server workers (see comments above for: broadcast_channel)
      - OPAL_BROADCAST_URI=<postgres://postgres:postgres@broadcast_channel:5432/postgres>
      # number of uvicorn workers to run inside the opal-server container
      - UVICORN_NUM_WORKERS=4
      # the git repo hosting our policy
      # - if this repo is not public, you can pass an ssh key via `OPAL_POLICY_REPO_SSH_KEY`)
      # - the repo we pass in this example is *public* and acts as an example repo with dummy rego policy
      # - for more info, see: <https://docs.opal.ac/tutorials/track_a_git_repo>
      - OPAL_POLICY_REPO_URL=<https://github.com/permitio/opal-example-policy-repo>
      # in this example we will use a polling interval of 30 seconds to check for new policy updates (git commits affecting the rego policy).
      # however, it is better to utilize a git *webhook* to trigger the server to check for changes only when the repo has new commits.
      # for more info see: <https://docs.opal.ac/tutorials/track_a_git_repo>
      - OPAL_POLICY_REPO_POLLING_INTERVAL=30
      # configures from where the opal client should initially fetch data (when it first goes up, after disconnection, etc).
      # the data sources represents from where the opal clients should get a "complete picture" of the data they need.
      # after the initial sources are fetched, the client will subscribe only to update notifications sent by the server.
      - OPAL_DATA_CONFIG_SOURCES={"config":{"entries":[{"url":"<http://opal_server:7002/policy-data>","topics":["policy_data"],"dst_path":"/static"}]}}
      - OPAL_LOG_FORMAT_INCLUDE_PID=true
    ports:
      # exposes opal server on the host machine, you can access the server at: <http://localhost:7002>
      - "7002:7002"
    depends_on:
      - broadcast_channel
  opal_client:
    # by default we run opal-client from latest official image
    image: permitio/opal-client:latest
    environment:
      - OPAL_SERVER_URL=<http://opal_server:7002>
      - OPAL_LOG_FORMAT_INCLUDE_PID=true
      - OPAL_INLINE_OPA_LOG_FORMAT=http
      - OPAL_INLINE_OPA_LOG_FORMAT=full
      - OPAL_INLINE_OPA_CONFIG={"log_level":"debug"}
    ports:
      # exposes opal client on the host machine, you can access the client at: <http://localhost:7000>
      - "7004:7000"
      # exposes the OPA agent (being run by OPAL) on the host machine
      # you can access the OPA api that you know and love at: <http://localhost:8181>
      # OPA api docs are at: <https://www.openpolicyagent.org/docs/latest/rest-api/>
      - "8181:8181"
    depends_on:
      - opal_server
    # this command is not necessary when deploying OPAL for real, it is simply a trick for dev environments
    # to make sure that opal-server is already up before starting the client.
    command: sh -c "./wait-for.sh opal_server:7002 --timeout=20 -- ./start.sh"

Oded Bd

04/18/2023, 2:06 PM

This is an example request that I've sent

Copy code

curl --location --request GET '<http://localhost:8181/v1/data/app/rbac>' \
--data-raw ''

David Hamilton

04/18/2023, 2:07 PM

I appreciate it, i'll give that a go

Oded Bd

04/18/2023, 2:12 PM

And this is how I checked permissions

David Hamilton

04/18/2023, 2:25 PM

Ok, well your dockerfile works for me with the petstore API. When I move to my own repository, that's where it begins to fail. OPA is getting all of the policies correctly, and my authn request/responses all work correctly. However, nothing gets logged. The only properties I changed are those below: In the OPAL server config

Copy code

- OPAL_POLICY_REPO_URL
      - OPAL_POLICY_REPO_SSH_KEY
      - OPAL_POLICY_REPO_MAIN_BRANCH

and in the client:

Copy code

- OPAL_POLICY_SUBSCRIPTION_DIRS

David Hamilton

04/18/2023, 2:26 PM

So either it is my policies (which seem to work fine, but perhaps break the logging?) or it is the MAIN_BRANCH or SUBSCRIPTION_DIRS properties. I'll play a bit to test it out

Oded Bd

04/18/2023, 2:31 PM

Glad to hear that my docker (with the params you shared) works for you. I will ask around to see if anyone has an hint about the params that you shared and a possible affect on the logging, please share more data points if you find any so we can better help 🙂

David Hamilton

04/18/2023, 2:33 PM

Will do

David Hamilton

04/18/2023, 6:07 PM

Ok, I have confirmed it has something to do with my policies/data. Because I get the decision logging still working with the properties above. I've set up a copy of the sample here on my development branch. https://github.com/david-hamilton-bah/opal-example-policy-repo/tree/development There are multiple directories and using the property

OPAL_POLICY_SUBSCRIPTION_DIRS=common:tenant_petshop

and having a util function within the

common

directory. I've included a docker-compose and postman collection. To be clear, this sample DOES work as expected. It is only when I apply my real policies and data that it breaks, so I'll just have to keep digging.

David Hamilton

04/18/2023, 6:09 PM

I see now that after doing several requests with my real policies, requests begin to hang with no response from OPA. Part of the problem is probably that i didn't realize the data would be prefixed with the directory name, whereas the policies won't be prefixed unless it is set in the package name in the rego file

David Hamilton

04/18/2023, 6:11 PM

so just copying someone's existing policies into a directory and "subscribing" to that will require all references to `data.*`to be changed to

data.tenant_petshop.*

for example

David Hamilton

05/12/2023, 6:38 PM

Coming back to this after doing some other work, I have simplified my scenario a bit. If I run the embedded OPA I will always see the server hanging when decision logs are turned on, but if I run OPA separately it runs fine. Starting OPA with

opa run --server --log-format json-pretty -c config.yaml ./server -l debug

Then in my docker-compose opal_client env

Copy code

- OPAL_INLINE_OPA_ENABLED=false
      - OPAL_POLICY_STORE_URL=<http://host.docker.internal:8181>

David Hamilton

05/12/2023, 6:39 PM

./server

is an empty directory, so I am starting up OPA with no policies

David Hamilton

05/12/2023, 6:40 PM

OPAL client sends all the policy and static data and my policies work propertly, however, just doing the same with the embedded OPA, all my requests just load forever with no logging from OPA or OPAL client

Oded Bd

05/14/2023, 7:55 AM

@Ro'e Katz / @Ori Shavit do you have any idea why this can happen? or any idea about how to debug it?

Ro'e Katz

05/14/2023, 8:33 AM

I’m not familiar with something like that. @David Hamilton just to make sure I understand correctly - when using @Oded Bd’s example, you’ve been able to see decision logs? Then when moving to your own repo, things start to hang? although when running OPA separately it does work with the same policy? Hmm… Is your repo heavy in terms of either amount/size of rego files or size of json file?

David Hamilton

05/15/2023, 1:02 PM

Yes, your understanding is correct. The data at this point is all static from the repository, with http calls to retrieve extra data as needed. data.json is 265KB

David Hamilton

05/15/2023, 1:05 PM

repository is 2.3MB

David Hamilton

05/15/2023, 1:13 PM

Got policy bundle with 49 rego files, 1 data files

Docker stats prior to running any requests

Copy code

CONTAINER ID   NAME                                      CPU %     MEM USAGE / LIMIT     MEM %     NET I/O           BLOCK I/O         PIDS
9350f5de83e1   bip-baaz-opal-opal_client-1               0.33%     101.4MiB / 1.938GiB   5.11%     390kB / 19.2kB    35.9MB / 0B      15

then after

Copy code

CONTAINER ID   NAME                                      CPU %     MEM USAGE / LIMIT     MEM %     NET I/O           BLOCK I/O         PIDS
9350f5de83e1   bip-baaz-opal-opal_client-1               0.28%     103.5MiB / 1.938GiB   5.21%     401kB / 550kB     38.2MB / 0B       15

No significant changes to resources

David Hamilton

05/15/2023, 1:41 PM

Let me know if there is anything else I can do to debug the embedded OPA instance or any other metrics I could give you.

👀 1

Ro'e Katz

05/16/2023, 8:01 AM

There’s a known issue of long loading times when having a lot of rego files - OPA is currently recompiling everything every time you load a new policy file. I feel like that’s related, although I know you had OPAL load policies into OPA even when you ran it separately. When you saw things hang - have you checked in logs to see that the repeated`Received request. PUT ...` &&

Sent response.       PUT …

logs lines where not still being printed?

Ro'e Katz

05/16/2023, 8:11 AM

In the command line you’ve shared (

opa run --server --log-format json-pretty -c config.yaml ./server -l debug

), does the

./server

directory contain all policies? (that are also stored in your git repo)

David Hamilton

05/16/2023, 10:39 AM

no, ./server is empty

David Hamilton

05/16/2023, 10:40 AM

And the received/sent messages stop. There is no received message for the request that freezes

David Hamilton

05/16/2023, 10:40 AM

but I get all the messages showing that OPAL sent the policies and OPA received them, and i still get the keepalive logs from opal client

👀 1

5 Views

Open in Slack

Previous Next