Skip to main content

Packing Sheets with Webpack

Webpack is a modern build tool for generating static sites. It has a robust JavaScript-powered plugin system1

SheetJS is a JavaScript library for reading and writing data from spreadsheets.

This demo uses Webpack and SheetJS to pull data from a spreadsheet and display the content in an HTML table. We'll explore how to load SheetJS in a Webpack 5 Asset Plugin and generate data for use in webpages.

The "Webpack 5 Demo" creates a complete website powered by a XLSX spreadsheet.

This demo covers static asset imports. For processing files in the browser, the "Bundlers" demo includes an example of importing the SheetJS library in a browser script.

Webpack 5 Asset Module

Webpack 5 supports asset modules. With a special option, the loader will receive NodeJS Buffers that can be parsed. The dev server will even watch the files and reload the page in development mode!

The SheetJS NodeJS module can be imported from Webpack loader scripts.

The following diagram depicts the workbook waltz:

Webpack Config

A special rule should be added to module.rules:

webpack.config.js
// ...
module.exports = {
// ...
module: {
rules: [
{
/* `test` matches file extensions */
test: /\.(numbers|xls|xlsx|xlsb)/,
/* use the loader script */
use: [ { loader: './sheetjs-loader' } ]
}
]
}
};

Hot Module Replacement enables reloading when files are updated:

webpack.config.js
// ...
module.exports = {
// ...
devServer: {
static: './dist',
hot: true,
}
};

It is strongly recommended to add an alias to simplify imports:

webpack.config.js
// ...
module.exports = {
// ...
resolve: {
alias: {
/* `~` root of the project */
"~": __dirname
}
},
};

SheetJS Loader

The SheetJS loader script must export a raw property that is set to true.

The base export is expected to be the loader function. The loader receives the file bytes as a Buffer, which can be parsed with the SheetJS read method2. read returns a SheetJS workbook object3.

The loader in this demo will parse the workbook, pull the first worksheet, and generate an array of row objects using the sheet_to_json method4:

sheetjs-loader.js
const XLSX = require("xlsx");

function loader(content) {
/* since `loader.raw` is true, `content` is a Buffer */
const wb = XLSX.read(content);
/* pull data from first worksheet */
var data = XLSX.utils.sheet_to_json(wb.Sheets[wb.SheetNames[0]]);
return `export default JSON.parse('${JSON.stringify(data)}')`;
}
/* ensure the function receives a Buffer */
loader.raw = true;
module.exports = loader;

Asset Imports

Spreadsheets can be imported using the plugin. Assuming pres.xlsx is stored in the data subfolder, ~/data/pres.xlsx can be imported from any script:

src/index.js
import data from '~/data/pres.xlsx';
/* `data` is an array of objects from data/pres.xlsx */

const elt = document.createElement('div');
elt.innerHTML = "<table><tr><th>Name</th><th>Index</th></tr>" +
data.map((row) => `<tr>
<td>${row.Name}</td>
<td>${row.Index}</td>
</tr>`).join("") +
"</table>";
document.body.appendChild(elt);

Webpack 5 Demo

Tested Deployments

This demo was last tested on 2023 December 04 against Webpack 5.89.0

Initial Setup

0) Create a new skeleton project:

mkdir sheetjs-wp5
cd sheetjs-wp5
npm init -y
npm install [email protected] [email protected] [email protected] --save
mkdir -p dist
mkdir -p src
mkdir -p data

1) Install the SheetJS NodeJS module:

npm i --save https://cdn.sheetjs.com/xlsx-0.20.1/xlsx-0.20.1.tgz

2) Save the following to dist/index.html:

dist/index.html
<!DOCTYPE html>
<html>
<head>
<title>SheetJS + Webpack 5</title>
</head>
<body>
<script src="main.js"></script>
</body>
</html>

3) Save the following to src/index.js:

src/index.js
import data from '~/data/pres.xlsx';

const elt = document.createElement('div');
elt.innerHTML = "<table><tr><th>Name</th><th>Index</th></tr>" +
data.map((row) => `<tr>
<td>${row.Name}</td>
<td>${row.Index}</td>
</tr>`).join("") +
"</table>";
document.body.appendChild(elt);

4) Save the following to webpack.config.js:

webpack.config.js
const path = require('path');

module.exports = {
entry: './src/index.js',
output: {
filename: 'main.js',
path: path.resolve(__dirname, 'dist'),
},
devServer: {
static: './dist',
hot: true,
},
resolve: {
alias: {
"~": __dirname
}
},
module: {
rules: [
{
test: /\.(numbers|xls|xlsx|xlsb)/,
use: [ { loader: './sheetjs-loader' } ]
}
]
}
};

5) Save the following to sheetjs-loader.js:

sheetjs-loader.js
const XLSX = require("xlsx");

function loader(content) {
/* since `loader.raw` is true, `content` is a Buffer */
const wb = XLSX.read(content);
/* pull data from first worksheet */
var data = XLSX.utils.sheet_to_json(wb.Sheets[wb.SheetNames[0]]);
return `export default JSON.parse('${JSON.stringify(data)}')`;
}
/* ensure the function receives a Buffer */
loader.raw = true;
module.exports = loader;

6) Download https://sheetjs.com/pres.xlsx and save to the data folder:

curl -L -o data/pres.xlsx https://sheetjs.com/pres.xlsx

Live Reload Test

7) Open the test file data/pres.xlsx in a spreadsheet editor like Excel.

8) Start the development server:

npx webpack serve --mode=development

The terminal will print URLs for the development server:

<i> [webpack-dev-server] Project is running at:
<i> [webpack-dev-server] Loopback: http://localhost:8080/

9) Open the Loopback address (http://localhost:8080) in a web browser.

It should display a table of Presidents with "Name" and "Index" columns

10) Add a new row to the spreadsheet and save the file.

Upon saving, the page should refresh with the new data.

Static Site Test

11) Stop Webpack and build the site:

npx webpack --mode=production

The final site will be placed in the dist folder.

12) Start a local web server to host the dist folder:

npx http-server dist

The command will print a list of URLs.

13) Open one of the URLs printed in the previous step (http://localhost:8080) and confirm that the same data is displayed.

To verify that the page is independent of the spreadsheet, make some changes to the file and save. The page will not automatically update.

To verify that the data was added to the page, append main.js to the URL (http://localhost:8080/main.js) and view the source. The source will include president names. It will not include SheetJS library references!