brave-tomato-16287
04/14/2022, 8:19 AMbrave-forest-5974
04/14/2022, 8:41 AMbland-orange-13353
04/14/2022, 11:33 AMfamous-match-44342
04/14/2022, 1:32 PMfamous-match-44342
04/14/2022, 1:35 PMcool-architect-34612
04/15/2022, 12:48 AMnutritious-bird-77396
04/15/2022, 4:16 PMGroupMembership
for users...
Even though I insert the same ingestion record the records should be skipped instead of errors.
Because of this 1 error all the other updates part of the Batch update are dropped as well causing the users and groups information to be mismatched across environments.
Error details in 🧵icy-ram-1893
04/16/2022, 7:04 AMorange-coat-2879
04/18/2022, 2:14 AMsource:
type: mssql
config:
# Coordinates'
host_port: localhost:1433
database: TutorialDB
schema_pattern:
allow:
- "QQ"
table_pattern:
allow:
- "accessories"
- "raw_account"
# Credentials
username: sa
password: pwd
profiling:
enabled: true
sink:
# sink configs
type: "datahub-rest"
config:
server: "<http://localhost:8080>"
cool-architect-34612
04/18/2022, 5:11 AMsilly-application-87541
04/18/2022, 9:17 AMbetter-orange-49102
04/18/2022, 9:36 AMfresh-electrician-85277
04/18/2022, 12:31 PMdelightful-barista-90363
04/18/2022, 9:23 PMsource:
type: "s3"
rich-policeman-92383
04/19/2022, 9:24 AMacoustic-quill-54426
04/19/2022, 10:58 AMFailed to match table read event XXX with job; try increasing query_log_delay or max_query_duration
but there is nothing wrong with those: please make sure your job/QueryEvent is not being filtered herequaint-lighter-81058
04/19/2022, 6:19 PMbest-umbrella-24804
04/21/2022, 4:12 AMalert-football-80212
04/21/2022, 8:33 AMcurl '<http://localhost:8080/entitiesV2/><url-encoded-entity-urn>'.
While in the documentation the response is a json describing the entity I receive the page html (with response 200).
someone maybe use the rest.li api and can help?billions-twilight-48559
04/21/2022, 2:54 PMearly-librarian-13786
04/21/2022, 3:54 PM'entities_profiled': 1
I tried different sink types: datahub-kafka, datahub-rest, and postgres tables with different column types and rows number, but result was the same
Has anyone else faced with this issue and is there any solution?red-pizza-28006
04/21/2022, 4:54 PMlemon-terabyte-66903
04/21/2022, 8:16 PMdata-lake
source. How do I merge them all into one, so that it shows as one dataset on UI?nutritious-bird-77396
04/21/2022, 8:49 PMcurved-football-28924
04/22/2022, 5:27 AMdazzling-alarm-64985
04/22/2022, 6:00 AM'xxxxxx': ['The schema registry subject for the value schema is not found. The topic is "
"either '\n" 'schema-less, or no messages have been written to the topic yet.']
rich-policeman-92383
04/22/2022, 6:59 AM---
source:
type: hive
config:
host_port: hive:10000
env: "PROD"
table_pattern:
allow:
- "A.B\\$"
options:
connect_args: {'auth': 'KERBEROS','kerberos_service_name': 'hive'}
profiling:
enabled: true
profile_pattern:
allow:
- "A.B\\$"
sink:
type: "datahub-rest"
config:
server: "<https://datahub:8080>"
magnificent-hospital-52323
04/22/2022, 7:53 AMdatahub-gms | 07:49:26.623 [qtp544724190-14] ERROR c.l.m.filter.RestliLoggingFilter:38 - <http://Rest.li|Rest.li> error:
datahub-gms | com.linkedin.restli.server.RestLiServiceException: Failed to validate record with class com.linkedin.assertion.AssertionInfo: ERROR :: /datasetAssertion/nativeParameters :: unrecognized field found but not allowed
datahub-gms | ERROR :: /datasetAssertion/nativeType :: unrecognized field found but not allowed
datahub-gms | ERROR :: /datasetAssertion/aggregation :: unrecognized field found but not allowed
datahub-gms | ERROR :: /datasetAssertion/parameters :: unrecognized field found but not allowed
datahub-gms | ERROR :: /datasetAssertion/dataset :: unrecognized field found but not allowed
datahub-gms | ERROR :: /datasetAssertion/operator :: unrecognized field found but not allowed
datahub-gms |
datahub-gms | at com.linkedin.metadata.resources.entity.AspectResource.lambda$ingestProposal$3(AspectResource.java:140)
datahub-gms | at com.linkedin.metadata.restli.RestliUtil.toTask(RestliUtil.java:30)
datahub-gms | at com.linkedin.metadata.restli.RestliUtil.toTask(RestliUtil.java:50)
datahub-gms | at com.linkedin.metadata.resources.entity.AspectResource.ingestProposal(AspectResource.java:133)
datahub-gms | at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
datahub-gms | at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
datahub-gms | at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
datahub-gms | at java.lang.reflect.Method.invoke(Method.java:498)
datahub-gms | at com.linkedin.restli.internal.server.RestLiMethodInvoker.doInvoke(RestLiMethodInvoker.java:172)
datahub-gms | at com.linkedin.restli.internal.server.RestLiMethodInvoker.invoke(RestLiMethodInvoker.java:326)
datahub-gms | at com.linkedin.restli.internal.server.filter.FilterChainDispatcherImpl.onRequestSuccess(FilterChainDispatcherImpl.java:47)
datahub-gms | at com.linkedin.restli.internal.server.filter.RestLiFilterChainIterator.onRequest(RestLiFilterChainIterator.java:86)
datahub-gms | at com.linkedin.restli.internal.server.filter.RestLiFilterChainIterator.lambda$onRequest$0(RestLiFilterChainIterator.java:73)
datahub-gms | at java.util.concurrent.CompletableFuture.uniAccept(CompletableFuture.java:670)
datahub-gms | at java.util.concurrent.CompletableFuture.uniAcceptStage(CompletableFuture.java:683)
datahub-gms | at java.util.concurrent.CompletableFuture.thenAccept(CompletableFuture.java:2010)
datahub-gms | at com.linkedin.restli.internal.server.filter.RestLiFilterChainIterator.onRequest(RestLiFilterChainIterator.java:72)
datahub-gms | at com.linkedin.restli.internal.server.filter.RestLiFilterChain.onRequest(RestLiFilterChain.java:55)
datahub-gms | at com.linkedin.restli.server.BaseRestLiServer.handleResourceRequest(BaseRestLiServer.java:218)
datahub-gms | at com.linkedin.restli.server.RestRestLiServer.handleResourceRequestWithRestLiResponse(RestRestLiServer.java:242)
datahub-gms | at com.linkedin.restli.server.RestRestLiServer.handleResourceRequest(RestRestLiServer.java:211)
datahub-gms | at com.linkedin.restli.server.RestRestLiServer.handleResourceRequest(RestRestLiServer.java:181)
datahub-gms | at com.linkedin.restli.server.RestRestLiServer.doHandleRequest(RestRestLiServer.java:164)
datahub-gms | at com.linkedin.restli.server.RestRestLiServer.handleRequest(RestRestLiServer.java:120)
datahub-gms | at com.linkedin.restli.server.RestLiServer.handleRequest(RestLiServer.java:132)
datahub-gms | at com.linkedin.restli.server.DelegatingTransportDispatcher.handleRestRequest(DelegatingTransportDispatcher.java:70)
datahub-gms | at com.linkedin.r2.filter.transport.DispatcherRequestFilter.onRestRequest(DispatcherRequestFilter.java:70)
datahub-gms | at com.linkedin.r2.filter.TimedRestFilter.onRestRequest(TimedRestFilter.java:72)
datahub-gms | at com.linkedin.r2.filter.FilterChainIterator$FilterChainRestIterator.doOnRequest(FilterChainIterator.java:146)
datahub-gms | at com.linkedin.r2.filter.FilterChainIterator$FilterChainRestIterator.doOnRequest(FilterChainIterator.java:132)
datahub-gms | at com.linkedin.r2.filter.FilterChainIterator.onRequest(FilterChainIterator.java:62)
datahub-gms | at com.linkedin.r2.filter.TimedNextFilter.onRequest(TimedNextFilter.java:55)
datahub-gms | at com.linkedin.r2.filter.transport.ServerQueryTunnelFilter.onRestRequest(ServerQueryTunnelFilter.java:58)
datahub-gms | at com.linkedin.r2.filter.TimedRestFilter.onRestRequest(TimedRestFilter.java:72)
datahub-gms | at com.linkedin.r2.filter.FilterChainIterator$FilterChainRestIterator.doOnRequest(FilterChainIterator.java:146)
datahub-gms | at com.linkedin.r2.filter.FilterChainIterator$FilterChainRestIterator.doOnRequest(FilterChainIterator.java:132)
datahub-gms | at com.linkedin.r2.filter.FilterChainIterator.onRequest(FilterChainIterator.java:62)
datahub-gms | at com.linkedin.r2.filter.TimedNextFilter.onRequest(TimedNextFilter.java:55)
datahub-gms | at com.linkedin.r2.filter.message.rest.RestFilter.onRestRequest(RestFilter.java:50)
datahub-gms | at com.linkedin.r2.filter.TimedRestFilter.onRestRequest(TimedRestFilter.java:72)
datahub-gms | at com.linkedin.r2.filter.FilterChainIterator$FilterChainRestIterator.doOnRequest(FilterChainIterator.java:146)
datahub-gms | at com.linkedin.r2.filter.FilterChainIterator$FilterChainRestIterator.doOnRequest(FilterChainIterator.java:132)
datahub-gms | at com.linkedin.r2.filter.FilterChainIterator.onRequest(FilterChainIterator.java:62)
datahub-gms | at com.linkedin.r2.filter.FilterChainImpl.onRestRequest(FilterChainImpl.java:96)
datahub-gms | at com.linkedin.r2.filter.transport.FilterChainDispatcher.handleRestRequest(FilterChainDispatcher.java:75)
datahub-gms | at com.linkedin.r2.util.finalizer.RequestFinalizerDispatcher.handleRestRequest(RequestFinalizerDispatcher.java:61)
datahub-gms | at com.linkedin.r2.transport.http.server.HttpDispatcher.handleRequest(HttpDispatcher.java:101)
datahub-gms | at com.linkedin.r2.transport.http.server.AbstractR2Servlet.service(AbstractR2Servlet.java:105)
datahub-gms | at javax.servlet.http.HttpServlet.service(HttpServlet.java:790)
datahub-gms | at com.linkedin.restli.server.spring.ParallelRestliHttpRequestHandler.handleRequest(ParallelRestliHttpRequestHandler.java:63)
datahub-gms | at org.springframework.web.context.support.HttpRequestHandlerServlet.service(HttpRequestHandlerServlet.java:73)
datahub-gms | at javax.servlet.http.HttpServlet.service(HttpServlet.java:790)
datahub-gms | at org.eclipse.jetty.servlet.ServletHolder.handle(ServletHolder.java:852)
datahub-gms | at org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1604)
datahub-gms | at com.datahub.authentication.filter.AuthenticationFilter.doFilter(AuthenticationFilter.java:77)
datahub-gms | at org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1591)
datahub-gms | at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:542)
datahub-gms | at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:143)
datahub-gms | at org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:536)
datahub-gms | at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:127)
datahub-gms | at org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:235)
datahub-gms | at org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:1581)
datahub-gms | at org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:233)
datahub-gms | at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1307)
datahub-gms | at org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:188)
datahub-gms | at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:482)
datahub-gms | at org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:1549)
datahub-gms | at org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:186)
datahub-gms | at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1204)
datahub-gms | at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)
datahub-gms | at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:221)
datahub-gms | at org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:146)
datahub-gms | at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:127)
datahub-gms | at org.eclipse.jetty.server.Server.handle(Server.java:494)
datahub-gms | at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:374)
datahub-gms | at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:268)
datahub-gms | at org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:311)
datahub-gms | at org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:103)
datahub-gms | at org.eclipse.jetty.io.ChannelEndPoint$2.run(ChannelEndPoint.java:117)
datahub-gms | at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.runTask(EatWhatYouKill.java:336)
datahub-gms | at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.doProduce(EatWhatYouKill.java:313)
datahub-gms | at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.tryProduce(EatWhatYouKill.java:171)
datahub-gms | at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.run(EatWhatYouKill.java:129)
datahub-gms | at org.eclipse.jetty.util.thread.ReservedThreadExecutor$ReservedThread.run(ReservedThreadExecutor.java:367)
datahub-gms | at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:782)
datahub-gms | at org.eclipse.jetty.util.thread.QueuedThreadPool$Runner.run(QueuedThreadPool.java:918)
datahub-gms | at java.lang.Thread.run(Thread.java:748)
What could the issue be? As I don't quite understand the error message. Thanks.mammoth-fountain-32989
04/22/2022, 12:23 PMbright-beard-86474
04/22/2022, 6:38 PMpython3 -m datahub ingest -c ./examples/recipes/example_to_datahub_rest.yml
. The output log says Pipeline finished successfully, no warnings no errors. But I don’t see any records on DataHub UI. Could someone please help to figure out where the blocker is? Thanks!