Skip to content

Commit f27697e

Browse files
authored
add careers (#1176)
1 parent 460cfd5 commit f27697e

File tree

350 files changed

+921
-10695
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

350 files changed

+921
-10695
lines changed

README.md

Lines changed: 13 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -108,7 +108,7 @@ SELECT pgml.transform(
108108
```
109109

110110
## Tabular data
111-
- [47+ classification and regression algorithms](https://postgresml.org/docs/guides/training/algorithm_selection)
111+
- [47+ classification and regression algorithms](https://postgresml.org/docs/training/algorithm_selection)
112112
- [8 - 40X faster inference than HTTP based model serving](https://postgresml.org/blog/postgresml-is-8x-faster-than-python-http-microservices)
113113
- [Millions of transactions per second](https://postgresml.org/blog/scaling-postgresml-to-one-million-requests-per-second)
114114
- [Horizontal scalability](https://github.com/postgresml/pgcat)
@@ -154,7 +154,7 @@ docker run \
154154
sudo -u postgresml psql -d postgresml
155155
```
156156

157-
For more details, take a look at our [Quick Start with Docker](https://postgresml.org/docs/guides/developer-docs/quick-start-with-docker) documentation.
157+
For more details, take a look at our [Quick Start with Docker](https://postgresml.org/docs/developer-docs/quick-start-with-docker) documentation.
158158

159159
# Getting Started
160160

@@ -214,7 +214,7 @@ SELECT pgml.transform(
214214

215215
Text classification involves assigning a label or category to a given text. Common use cases include sentiment analysis, natural language inference, and the assessment of grammatical correctness.
216216

217-
![text classification](pgml-docs/docs/images/text-classification.png)
217+
![text classification](pgml-cms/docs/images/text-classification.png)
218218

219219
### Sentiment Analysis
220220
Sentiment analysis is a type of natural language processing technique that involves analyzing a piece of text to determine the sentiment or emotion expressed within it. It can be used to classify a text as positive, negative, or neutral, and has a wide range of applications in fields such as marketing, customer service, and political analysis.
@@ -383,7 +383,7 @@ SELECT pgml.transform(
383383
## Zero-Shot Classification
384384
Zero Shot Classification is a task where the model predicts a class that it hasn't seen during the training phase. This task leverages a pre-trained language model and is a type of transfer learning. Transfer learning involves using a model that was initially trained for one task in a different application. Zero Shot Classification is especially helpful when there is a scarcity of labeled data available for the specific task at hand.
385385

386-
![zero-shot classification](pgml-docs/docs/images/zero-shot-classification.png)
386+
![zero-shot classification](pgml-cms/docs/images/zero-shot-classification.png)
387387

388388
In the example provided below, we will demonstrate how to classify a given sentence into a class that the model has not encountered before. To achieve this, we make use of `args` in the SQL query, which allows us to provide `candidate_labels`. You can customize these labels to suit the context of your task. We will use `facebook/bart-large-mnli` model.
389389

@@ -417,7 +417,7 @@ SELECT pgml.transform(
417417
## Token Classification
418418
Token classification is a task in natural language understanding, where labels are assigned to certain tokens in a text. Some popular subtasks of token classification include Named Entity Recognition (NER) and Part-of-Speech (PoS) tagging. NER models can be trained to identify specific entities in a text, such as individuals, places, and dates. PoS tagging, on the other hand, is used to identify the different parts of speech in a text, such as nouns, verbs, and punctuation marks.
419419

420-
![token classification](pgml-docs/docs/images/token-classification.png)
420+
![token classification](pgml-cms/docs/images/token-classification.png)
421421

422422
### Named Entity Recognition
423423
Named Entity Recognition (NER) is a task that involves identifying named entities in a text. These entities can include the names of people, locations, or organizations. The task is completed by labeling each token with a class for each named entity and a class named "0" for tokens that don't contain any entities. In this task, the input is text, and the output is the annotated text with named entities.
@@ -467,7 +467,7 @@ select pgml.transform(
467467
## Translation
468468
Translation is the task of converting text written in one language into another language.
469469

470-
![translation](pgml-docs/docs/images/translation.png)
470+
![translation](pgml-cms/docs/images/translation.png)
471471

472472
You have the option to select from over 2000 models available on the Hugging Face <a href="https://huggingface.co/models?pipeline_tag=translation" target="_blank">hub</a> for translation.
473473

@@ -490,7 +490,7 @@ select pgml.transform(
490490
## Summarization
491491
Summarization involves creating a condensed version of a document that includes the important information while reducing its length. Different models can be used for this task, with some models extracting the most relevant text from the original document, while other models generate completely new text that captures the essence of the original content.
492492

493-
![summarization](pgml-docs/docs/images/summarization.png)
493+
![summarization](pgml-cms/docs/images/summarization.png)
494494

495495
```sql
496496
select pgml.transform(
@@ -534,7 +534,7 @@ select pgml.transform(
534534
## Question Answering
535535
Question Answering models are designed to retrieve the answer to a question from a given text, which can be particularly useful for searching for information within a document. It's worth noting that some question answering models are capable of generating answers even without any contextual information.
536536

537-
![question answering](pgml-docs/docs/images/question-answering.png)
537+
![question answering](pgml-cms/docs/images/question-answering.png)
538538

539539
```sql
540540
SELECT pgml.transform(
@@ -558,12 +558,12 @@ SELECT pgml.transform(
558558
}
559559
```
560560
<!-- ## Table Question Answering
561-
![table question answering](pgml-docs/docs/images/table-question-answering.png) -->
561+
![table question answering](pgml-cms/docs/images/table-question-answering.png) -->
562562

563563
## Text Generation
564564
Text generation is the task of producing new text, such as filling in incomplete sentences or paraphrasing existing text. It has various use cases, including code generation and story generation. Completion generation models can predict the next word in a text sequence, while text-to-text generation models are trained to learn the mapping between pairs of texts, such as translating between languages. Popular models for text generation include GPT-based models, T5, T0, and BART. These models can be trained to accomplish a wide range of tasks, including text classification, summarization, and translation.
565565

566-
![text generation](pgml-docs/docs/images/text-generation.png)
566+
![text generation](pgml-cms/docs/images/text-generation.png)
567567

568568
```sql
569569
SELECT pgml.transform(
@@ -725,7 +725,7 @@ SELECT pgml.transform(
725725
```
726726
## Text-to-Text Generation
727727
Text-to-text generation methods, such as T5, are neural network architectures designed to perform various natural language processing tasks, including summarization, translation, and question answering. T5 is a transformer-based architecture pre-trained on a large corpus of text data using denoising autoencoding. This pre-training process enables the model to learn general language patterns and relationships between different tasks, which can be fine-tuned for specific downstream tasks. During fine-tuning, the T5 model is trained on a task-specific dataset to learn how to perform the specific task.
728-
![text-to-text](pgml-docs/docs/images/text-to-text-generation.png)
728+
![text-to-text](pgml-cms/docs/images/text-to-text-generation.png)
729729

730730
*Translation*
731731
```sql
@@ -762,7 +762,7 @@ SELECT pgml.transform(
762762
```
763763
## Fill-Mask
764764
Fill-mask refers to a task where certain words in a sentence are hidden or "masked", and the objective is to predict what words should fill in those masked positions. Such models are valuable when we want to gain statistical insights about the language used to train the model.
765-
![fill mask](pgml-docs/docs/images/fill-mask.png)
765+
![fill mask](pgml-cms/docs/images/fill-mask.png)
766766

767767
```sql
768768
SELECT pgml.transform(
@@ -859,7 +859,7 @@ SELECT * FROM items, query ORDER BY items.embedding <-> query.embedding LIMIT 5;
859859

860860
<!-- ## Sentence Similarity
861861
Sentence Similarity involves determining the degree of similarity between two texts. To accomplish this, Sentence similarity models convert the input texts into vectors (embeddings) that encapsulate semantic information, and then measure the proximity (or similarity) between the vectors. This task is especially beneficial for tasks such as information retrieval and clustering/grouping.
862-
![sentence similarity](pgml-docs/docs/images/sentence-similarity.png)
862+
![sentence similarity](pgml-cms/docs/images/sentence-similarity.png)
863863
864864
<!-- ## Conversational -->
865865
<!-- # Regression

docker/dashboard.sh

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3,8 +3,7 @@ set -e
33

44
export DATABASE_URL=postgres://postgresml:postgresml@127.0.0.1:5432/postgresml
55
export DASHBOARD_STATIC_DIRECTORY=/usr/share/pgml-dashboard/dashboard-static
6-
export DASHBOARD_CONTENT_DIRECTORY=/usr/share/pgml-dashboard/dashboard-content
7-
export DASHBOARD_DOCS_DIRECTORY=/usr/share/pgml-docs
6+
export DASHBOARD_CMS_DIRECTORY=/usr/share/pgml-cms
87
export SEARCH_INDEX_DIRECTORY=/var/lib/pgml-dashboard/search-index
98
export ROCKET_SECRET_KEY=$(openssl rand -hex 32)
109
export ROCKET_ADDRESS=0.0.0.0

packages/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -53,7 +53,7 @@ The version of PostgresML is set in many places, and all of them need to be upda
5353

5454
#### Documentation
5555

56-
Additionally, we mention the version of the extension in our documentation. It would be very helpful to update it there as well, so our users are always instructed to install the latest and greatest version. Our documentation is located in `pgml-docs`. If you search it for the current version number, you should find all the places where we mention it.
56+
Additionally, we mention the version of the extension in our documentation. It would be very helpful to update it there as well, so our users are always instructed to install the latest and greatest version. Our documentation is located in `pgml-cms`. If you search it for the current version number, you should find all the places where we mention it.
5757

5858
#### Github Actions
5959

packages/cargo-pgml-components/src/local_dev.rs

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -82,7 +82,7 @@ static PG_PGVECTOR: &str = "
8282
static PG_PGML: &str = "To install PostgresML into your PostgreSQL database,
8383
follow the instructions on:
8484
85-
\thttps://postgresml.org/docs/guides/setup/v2/installation
85+
\thttps://postgresml.org/docs/setup/v2/installation
8686
";
8787

8888
#[cfg(target_os = "linux")]

packages/postgresml-dashboard/build.sh

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -24,9 +24,8 @@ rm "$deb_dir/release.sh"
2424
( cd ${SCRIPT_DIR}/../../pgml-dashboard && \
2525
cargo build --release && \
2626
cp target/release/pgml-dashboard "$deb_dir/usr/bin/pgml-dashboard" && \
27-
cp -R content "$deb_dir/usr/share/pgml-dashboard/dashboard-content" && \
2827
cp -R static "$deb_dir/usr/share/pgml-dashboard/dashboard-static" && \
29-
cp -R ../pgml-docs "$deb_dir/usr/share/pgml-docs" )
28+
cp -R ../pgml-cms "$deb_dir/usr/share/pgml-cms" )
3029

3130
(cat ${SCRIPT_DIR}/DEBIAN/control | envsubst) > "$deb_dir/DEBIAN/control"
3231
(cat ${SCRIPT_DIR}/etc/systemd/system/pgml-dashboard.service | envsubst) > "$deb_dir/etc/systemd/system/pgml-dashboard.service"

packages/postgresml-dashboard/etc/systemd/system/pgml-dashboard.service

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,8 +6,7 @@ StartLimitIntervalSec=0
66
[Service]
77
Environment=RUST_LOG=info
88
Environment=DASHBOARD_STATIC_DIRECTORY=/usr/share/pgml-dashboard/dashboard-static
9-
Environment=DASHBOARD_CONTENT_DIRECTORY=/usr/share/pgml-dashboard/dashboard-content
10-
Environment=DASHBOARD_DOCS_DIRECTORY=/usr/share/pgml-docs
9+
Environment=DASHBOARD_CMS_DIRECTORY=/usr/share/pgml-cms
1110
Environment=ROCKET_ADDRESS=0.0.0.0
1211
Environment=GITHUB_STARS=${GITHUB_STARS}
1312
Environment=SEARCH_INDEX_DIRECTORY=/var/lib/pgml-dashboard/search-index

0 commit comments

Comments
 (0)