How can we schedule the Dataiku DSS flows How can we schedule the Dataiku DSS flows ? I have created a flow for scoring pipeline which I want to execute automatically every given interval of time
Refresh Reload Dataset - Dataiku Community Welcome to the Dataiku Community! To answer your question you can manually refresh the sample for a SQL dataset In DSS when you add a SQL dataset it will fetch a sample by default first 10,000 rows returned You can save and refresh this sample as needed by clicking on the dataset and going to the Sample Settings - Save and Refresh Sample
Home - Dataiku Community Welcome to the Dataiku Community: a peer-to-peer community to discuss data preparation, analytics, machine learning and AI on the Dataiku platform
Remove Duplicates based on one column — Dataiku Community Muhanned Partner, Dataiku DSS Core Designer, Dataiku DSS ML Practitioner, Dataiku DSS Adv Designer, Registered Posts: 2 Partner March 2022 Yes, I agree with @Jurre You achieve the task of removing duplicates based on one column while keeping all other columns data in the output dataset by using the trick she explained in "Group" recipe
How to return current date and not time — Dataiku Community data set full of jobs and each job has a start date, so I want to create a formula that says filter out my data where my start date hasn't happened yet less than the current date