46 questions
1
vote
2
answers
635
views
apache beam rows to tfrecord in order to GenerateStatistics
I have built a pipeline that read some data, does some manipulations and create some apache beam Row objects (Steps 1 and 2 in the code below). I then would like to generate statistic and write them ...
0
votes
1
answer
284
views
Tensorflow-data-validation installing issue
I'm using Python 3.12, and I'm trying to install the latest version (1.15.1) of TensorFlow Data Validation. However, while installing, I encountered the following error:
ERROR: Could not find a ...
3
votes
4
answers
4k
views
How can I return blank cells as it is instead of printing as Zeros while using filter function
["referring to the data shown in the image while using the filter function and getting the return values, How can I return blank cells as it is instead of printing as Zeros while using filter ...
0
votes
2
answers
130
views
relaxing the anomaly detection constraints in MLOps
I am stucked in getting this block of statement.In my serving sets there are some anomalies.
I am not getting what this below code is doing in removing the anomalies.
payer_code = tfdv.get_feature(...
0
votes
1
answer
100
views
By excel vba adding checkmarks to each options selected in a multi-select datavalidation dropdown list
By the following vba excel code which I found on the internet, I can select more than one option within a cell from the dv dropdown list at the same time. What I am trying to do is to add a checkmark ...
0
votes
1
answer
259
views
How to define features presence in a TensorFlow Data Validation schema?
I want to create a new TensorFlow Data Validation schema from scratch with fixed features name, type and presence.
import tensorflow_data_validation as tfdv
from tensorflow_metadata.proto.v0 import ...
1
vote
1
answer
421
views
TensorFlow Extended: Specifying Valency Of Features In Schema
I am presently trying to feed a dataset with a few multivalent feature columns through a TensorFlow Extended (TFX) pipeline. Here is a row from my sample data:
user_id 29601
...
1
vote
1
answer
453
views
In TFX, is it possible to infer Schema with dates?
I'm using TFX (more precisely TensorFlow Data Validation) with the infer_schema method documented there https://www.tensorflow.org/tfx/data_validation/api_docs/python/tfdv/infer_schema. It generates a ...
-1
votes
1
answer
437
views
Problem of tensorflow data validation on google colab
When I import tensorflow data validation using
import tensorflow_data_validation as tfdv
I get
ModuleNotFoundError: No module named 'tensorflow_data_validation'
When I want to install it using
!pip ...
0
votes
1
answer
412
views
Installing tensorflow_data_validation is causing error #tensorflow_data_validation
I'm trying to install tensorflow_data_validation but it shows the below error.
it says that i need Python version 2.7 or 3.4+ but I'm using the version 3.10.9.
Also I have the 2.11.0 version of ...
4
votes
1
answer
615
views
Infer multivalent features with tfdv from pandas dataframe
I want to infer a schema with tensorflow data validation (tfdv) based on a pandas dataframe of the training data. The dataframe contains a column with a multivalent feature, where multiple values (or ...
0
votes
1
answer
117
views
tfx.components.StatisticsGen display train and eval in two different figures, is it possible to have them in a single figure as tfdv does?
a superimposed display for train/val splits using StatisticsGen
Hi,
I'm currently using tfx pipeline inside kubeflow. I struggle to have StatisticsGen showing a single graph with train and validation ...
0
votes
4
answers
888
views
How do I save a TFDV stats in the correct format for them to be loaded back in?
It is puzzling to me that there is a tfdv.load_statistics() function, but no corresponding tfdv.write_statistics() function. How do I go about saving the statistics, and then loading them again?
e.g.
...
5
votes
1
answer
816
views
Is it possible to run tensorflow-data-validation on MacOS with M1 chip?
Question: Is it possible to run tensorflow-data-validation on MacOS with M1 chip?
Steps taken: I have created a conda environment (tfdv38) in which I have installed the Mac-optimized TensorFlow.
I ...
0
votes
1
answer
361
views
Tensorflow TFDV does not work with Specific NaN values
I'm using Tensorflow Data Validation to generate stats from the data and infer an schema to input in TFX.
I didn't find any option to specify the NaN values, e. g., in pandas there is a field "...