Skip to content

Commit

Permalink
lint
Browse files Browse the repository at this point in the history
Signed-off-by: Scott Lee <sjl@anyscale.com>
  • Loading branch information
scottjlee committed Feb 21, 2024
1 parent 6a67e6a commit 439c5ed
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions configs/text-embeddings/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@ The high-level steps involved are:
![Embeddings Computation Overview](diagram.jpg "Embeddings Computation Overview")

## Running the code
The provided script will work as-is; no need to modify anything the underlying script, unless you want to customize the logic to meet your specific requirements.
The provided script will work as-is; no need to modify anything the underlying script, unless you want to customize the logic to meet your specific requirements.

Sample usage:
```
Expand Down Expand Up @@ -55,7 +55,7 @@ For example, if the input file contains columns `["text", "rank", "is_dupe"]`:
- With `--output-text-embeddings-only` flag: output file would have columns `["id", "text", "values"]`.

## Sample Input / Output Files
We have also provided **sample parquet files** under `sample_input/` and corresponding output files under `sample_output/` (both of the input files contain the same raw text so the output embeddings are the same, but the `-multi-column.parquet` file has additional metadata).
We have also provided **sample parquet files** under `sample_input/` and corresponding output files under `sample_output/` (both of the input files contain the same raw text so the output embeddings are the same, but the `-multi-column.parquet` file has additional metadata).

Examining the sample input file:

Expand All @@ -64,7 +64,7 @@ Examining the sample input file:
>>> sample_input
Dataset(num_blocks=768, num_rows=10, schema={content: string})
```
```

Examining the sample (expected) output file:

Expand Down

0 comments on commit 439c5ed

Please sign in to comment.