Skip to main content

How to Collect an Excel File into Dadosfera

Excel is one of the most widely used tools in the professional world. Because it makes it easy to organize data in tables, your organization may already have valuable information in Excel files that you want to analyze in Dadosfera.

At the moment, it is not possible to upload an Excel file directly to Dadosfera. However, if you have a static Excel file and want to analyze it in Dadosfera, there are a few ways to collect that data:

📘

Before collecting the file into Dadosfera, make sure the data is in tabular format and import one worksheet at a time.

1) Export the file as CSV and import it manually

Recommended when your organization does not want to store files in another cloud tool. This is the fastest option.

2) Upload the file to Google Drive and collect it with the Google Sheets connector

Recommended when your organization already uses Google accounts.

  • First, upload the Excel file to Google Drive:
    • Open Google Drive and sign in with your Google account.
    • Click the New button in the upper-left corner.
    • Select File upload.
    • Browse to the location on your computer where the Excel file is stored.
    • Select the file and click Open.
    • After the upload finishes, the Excel file will appear in your Google Drive folder.
    • Note: a simpler alternative is to drag and drop the Excel file directly into the Google Drive folder, as shown in the GIF below.
  • To convert the Excel file into Google Sheets, follow these steps:
    • Right-click the Excel file you just uploaded to Google Drive.
    • Select Open with.
    • Click Google Sheets. This opens the Excel file inside Google Sheets.
    • To save it as a native Google Sheets document, click File in the upper-left corner.
    • Select Save as Google Sheets.
    • Give the new document a name if needed and click Save.
Uploading an Excel file to Google Drive and converting it to Google Sheets

3) Upload the file to AWS storage and collect it with the S3 connector

Recommended when your organization uses AWS and the data volume is relevant.

  • Export the Excel file as CSV: File -> Save As -> CSV.
  • Choose or create an S3 bucket.
  • Click Upload.
  • Select Add files and then Upload.
  • Create a pipeline using Amazon S3 as the source.

This workflow is also recommended for collecting static data from tools or file formats that Dadosfera does not support through a native connector.

If your data is not static, you can also build an automated collection script for S3, a database, or even Google Sheets, and then create a pipeline for recurring collection in Dadosfera.