Sprinkle

Sprinkle

  • Docs
  • Tutorials
  • API
  • FAQ's
  • Blog
  • Go to sprinkledata.com

›Best Practices

Best Practices

  • Best Practices in Data Source

Best Practices

Best Practices in Data Source

  1. If table is a transactional, log based or event track type then data will be very huge as it generates millions of records every day. So, these type of tables can be ingested under incremental mode instead of complete loading. This decreases the run time of the ingestion.
  • Please follow this link for better understanding of incremental ingestion http://docs.sprinkledata.com/docs/feature_data_source_ingestion/#incremental-ingestion-mode.
  1. Create different data sources based on schedule frequency. For example if tables need to pulled on real time then add that table in real time scheduled data source. If tables need to pulled on hourly basis then add that table in hourly scheduled data source.

  2. Avoid adding same table in different data sources.

  3. Increase concurrency if you are ingesting multiple tables. If you are ingesting 5 tables then you can fix concurrency as 5. So, that all the five tables ingest parallely. For the best performance max concurrency you can use is 7.

  • Best Practices in Data Source

Product

FeaturesHow it worksIntegrationsDeploymentPricing

Industries

Retail & EcommerceUrban MobilityFinanceEducation

Departments

MarketingOperationsTechnology

Connect

Free trialAbout Us

Actionable Insights. Faster.

Sprinkle offers self-service analytics by unlocking enterprise scale data via simple search and powerful reporting service.


Copyright © 2021 Sprinkle data