Hi all, can I configure the use of ParallelRunner only on the command line or also in parameters.yml?
Hi all, is there a way to directly pass a value to a function that I use in a node (i.e. include the value in the parameter `inputs` of the node) without defining it in a config file?
Hi all! I have a pipeline where all datasets are PartitionedDataset. I want to implement a global filter on which partitions to process. I hoped I could drop some partitions in an after_dataset_loaded hook but that's not supported. Is there some other way for a global partition filter?
Has anyone successfully implemented a custom expectation for the use with kedro-expectations? When I copy an example of a custom exception (https://github.com/great-expectations/great_expectations/blob/develop/contrib/expe[…]erimental/expectations/expect_multicolumn_values_to_be_equal.py) to gx/plugins/expectations, gx is not able to find it and throws an exception.
Hi everyone! Has anyone implemented a customer log handler and successfully configured it in logging.yml? I'm getting a "No module named 'myproject.handlers'" error. I guess the logging is instantiated at a point where the project hasn't been loaded yet. Any idea how to get a custom logger running?
What is the right place to validate command line parameters to kedro run? Let's say I want to check if parameter A has been passed (kedro run --params=a=1) if "env" is local and parameter C exists when env is "prod". It would be great if the process just stops with an error message and not a stack trace.
Hi all! The Kedro documentation has a nice example of how to validate data with great expectations. But it only looks at one dataset at a time. But what would I do if I need to validate the data of a node that merges two datasets? Let's say one table is a lookup table and the other table may only contain entries that exists in the lookup table? Has anyone every checked multiple datasets at a time? Do you have an example for that?
Hi all! I have an issue with credentials not being inserted into the dataset configuration:
credentials.yml
my_connection_string: "<connection>"<br /><br />catalog.yml<br />table1:<br /> type: pandas.CSVDataSet<br /> filepath: <a target="_blank" rel="noopener noreferrer">abfs://my_container/table1.csv</a><br /> credentials:<br /> connection_string: my_connection_string<br /><br />I get "unable to connect to account for Connection string is either blank or malformed.." errors. I guess "my_connection_string" is used as the connection string not "<connection>".
Hi all, when providing an abfs:// path (Azure Data Lake Storage) in the catalog, what credentials exactly do I have to provide? An example would be really helpful.